Академический Документы
Профессиональный Документы
Культура Документы
P. R. Krishnaiah
P. K. Sen
Contributors
xix
xx Contributors
Randomization Procedures
C. B. B e l l a n d P. K. S e n
I. Introduction
2. T h e structure of r a n d o m i z a t i o n tests
(Xt . . . . . X, )
E , = \YI, , Y, " (2.1)
E~ (r) = ( X1 . . . . . X. (2.2)
L,, , Y,.)"
The sample point E , and the set ~ , (of n! permutations) gi,~e rise to a set
~, = {E,(r): r @ ~ } . (2.3)
Randomization procedures 3
The cardinality of this set is equal to n!. Now, under the null hypothesis, Xi
and Y~ are independent, for each i ( = 1 . . . . . n), while the n vectors are also
i.i.d. Thus, the (joint) distribution of E , is the 2n-fold product of the dis-
tributions of its 2n arguments, and, for each r E ~,, E , ( r ) has the same (joint)
distribution as E,. Also, we may write
where ~3, is the group of all n! permutations (transformations) {g,} which map
the sample space (R 2") onto itself. Thus, ~, = {g,. E,: g, ~ ~3,} is the orbit, and
under H0, the distribution of E , remains invariant under the permutation group
~3, (though this distribution may depend on the marginals F(x, oz) and F(oo, y)).
Under H0, the conditional distribution of E , on the orbit ~, is therefore given
by
P { E , = E , ( r ) l ~,, H0} = (n!) -~, r~ ~,, (2.5)
(and 0 mass points elsewhere). Thus, for a given significance level a (0 < a
< 1), if we consider a test function &~(E,) (where 0 <~ 4~(') < 1), such that
dp~(E.(r)) = ( n ! ) a , (2.6)
r~ n
so that
Thus, &,(E,) is a test function of size a (13< a < 1). In this setup, it is not
necessary to assume that the marginal d.f.'s of F are of given forms, and hence,
the test is nonparametric in character. Operationally, to make use of (2.5)-
(2.6), the first step is to characterize the orbit through the appropriate group of
transformations. Next, one may consider a real valued statistic T, = T ( E , ) and
enumerate the set of realizations: { T ( E , ( r ) ) : r ~ ~.}. In this context, one may
choose for T. the classical product moment correlation coefficient, the Spear-
man rank correlation or the Kendall tau statistic, among other possibilities. If
we denote the ordered values of these n! realizations by T O) ~<... ~< T~":) (note
that all of these may not be distinct), then, for a given n (and ~.) and a, we
may find two positive integers M~ and ME, such that
1, T~/> T~M2~,
~,,(T,)= t~ = (M2-(1-o~)(n!))/(M2-M1), Tn = T(~M~, (2.11)
0, Tn <~ T(~M~-I~ ,
then, (2.6) holds, and hence, (2.7)-(2.8) hold. A similar case may be worked out
for the two-sided (randomized) test.
It may be noted that the permutation group ~3, in (2.4) is basically related to
the hypothesis of matching invariance. If both the marginal d.f.'s are con-
tinuous (so that the ties among the observations may be neglected, in prob-
ability), and if we consider the coordinatewise transformations:
where both hi(') and h2(') are monotone, then, we may like to have a test for
H0 which remains invariant under all such (hi, h2). Let X~I)<'"<X~,~ and
Y 0 ) < " " < Yr,) be the two sets of order statistics (for the two rows in (2.2)),
and define Q -- (QI . . . . , On) by letting
Note that the induced-rank vector Q takes on (under H0) each permutation of
(1 . . . . . n) with the common probability (n!)-l, while the two order statistics
vectors are (jointly) sufficient for F, when H0 holds. Thus, under Ho, Q is a
maximal invariant (with respect to the class of transformations in (2.12)). A test
statistic (say, ~b*) which depends on E , through Q only is an invariant test
statistic, and under Ho, this will be genuinely distribution-flee. On the other
hand, a randomization test function, such as the one in (2.11), may not be
solely dependent on Q, and under H0, its unconditional distribution may
depend on the unknown marginal d.f.'s of F. Thus, it need not be genuinely
distribution-free. As in (2.11), the critical values of a randomization test
statistic are generally r.v. (assuming constant values on each orbit) tacitly so
chosen that (2.6) holds. This explains why a randomization test (if not based
solely on the maximal invariant) may fail to be truely distribution-free. Never-
theless, it is, at least, conditionally (Permutationally) distribution-free. We have
also noticed that the group ~3, has n! elements. If in (2.1), the elements in the
same row are not all distinct, the effective number of distinct E,(r) in (2.2) will
be equal to M, where (n !)/Mn is a positive integer. In such a case, an effective
group go of M, transformations {gO} may be defined in a similar manner, and,
in (2.5)-(2.6), n! has to be replaced by M,. The structure of the randomization
test in (2.7)-(2.8) remains the same. In (2.13), the elements of Q may not be all
distinct and adjustments for ties are to be made. The unconditional null
distribution of such adjusted Q may therefore depend on the marginal d.f.'s
(which are not necessarily continuous a.e.), so that genuinely distribution-free
Randomizationprocedures 5
tests based on the adjusted Q may not exist. However, the conditional
(permutational) distribution of the ties-adjusted Q over M, equally likely real-
izations remfiins intact, so that randomization tests can be constructed as in
(2.11). Since (2.5), in this extended form, insures that for some (possibly vector
valued) U, assuming constant values on the different orbits, the condition of
similarity of the test tb(E,) is satisfied and U, behaves as a sufficient statistic
(vector), we may conclude that the randomization tests have Neyman Structure.
We may also remark that as regards the permutational distribution, the i.i.d.
character of the r.v.'s may be replaced by exchangeability, and this may well
arise in sampling from a finite population or in some other sampling schemes
where the independence may not hold. With these observations, we may
summarize the basic structure of randomization tests in the following unified
manner:
Let n be a positive integer and let X, be a random element with the sample
space ~f.. The probability distribution P ( A . ) = P{X. ~ A.} is defined on an
additive class sO. of subsets A . of ~f.. Consider a p a r t i t i o n / / , of ~., which is a
class of mutually exclusive subsets S of ~f., such that every point x. of ~f.
belongs to one of the subsets S. The set O(x.) of all points in ~g. belonging to
the same subset containing x. is termed the orbit of x. (E ~f.), and the number
of points belonging to O(x.) is denoted by M.(x.). Typically, the partitioning
H. (and the orbit O(x.)) are characterized by the invariance of P(A.) under
appropriate groups of transformations which map the sample space (~.) onto
itself. We assume that for every (finite) n and x. E ~., M.(x.) is finite. Let S~.
be the set theoretic union of all those sets S E / / . containing exactly iV/.
elements. Then, we assume that there exist mutually exclusive subsets ~ci) Mn'
i = 1 , . . . , M~, which are measurable, and, the cardinality of S n S~. is equal to
1 for each i (= 1 , . . . , M~). The null hypothesisli(H0) relates to the invariance of
P(A,) under H~. If ~b~(x,) be a test function ( 0 ~< ~b~(.)~ < 1), for which
A set C* is similar with respect to O if there exists an a such that P{C* ] J(.)} = a,
for all J(.) in 12. A collection {Av} of sets is called a MESP (maximal essential
similar partition) with respect to 12 if
(a) each A~ is similar with respect to g2,
(b) each P(A~) > O,
(c) D* = ~ , - U A~ is similar with P(D*) = O,
(d) the {A~} are disjoint, and
(e) no A~ has a nontrivial similar subset.
LEMMA 2.1. (i) C* is similar with respect to f2 if and only if its indicator
function is NPDF. (ii) T is N P D F with respect to 12 if and only if each set {T <~t}
is similar with respect to 12. (iii) Let S ( X , ) be a sufficient statistic for ~. Then, any
statistic T independent of S ( X , ) is N P D F with respect to 12.
THEOREM 2.2. Let S ( X , ) be a complete sufficient statistic for 12. Then, a set C*
is similar with respect to 12 with P(C*) = ot if and only if for almost all s* and all
J(.) in ~, P{C* I S ( X , ) = s*} = a.
Roughly, this says that a similar set must contain a 'proportion' c~ of the a.e.
orbit. Randomization statistics arise as a method of choosing these points on
the orbits. Generally, for nonparametric hypotheses, the orbits are finite, and
one employs permutation functions in selecting the points of the orbits.
is ~> or. <0, and the summation extends over all y in S. (iii)
{[R(h(X,))= r]: r = 1. . . . . k * } = ~ ( h ( - ) ) is called the partition generated by
h(.).
One sees immediately that the B-Pitman function assigns different values to
the points on a.e. given orbit. Hence, it can be employed to select subsets of
points from a.e. orbit. (The 'a.e.' is used here because in the more common
nonparametric cases, ties may cause some difficulties as to the orbit size etc.,
and are excluded.) It is also clear that there are many possible permutation
statistics and associated partitions. The results of several authors (including
Bell and Doksum, 1967; Bell and Donoghue, 1969, and Lehmann and Stein,
1949) allow one to prove the following
The examples given in the next Section illustrate this theorem. The following
results relate to further generalizations of the structure resuRs presented above.
Let 12 admit a MSS, SO(,) and let 8(X,) = [S(X,), N(X~)].
DEFINITION 3. (i) 6(') is called a BDT (basic data transformation) and (ii)
N ( X . ) is called a version of MSN if (a) 8(.) is one-to-one a.e., and (19) S(Xn)
and N ( X . ) are independent.
Here, the MSN is an ancillary statistic which is complementary to the MSS,
and 8(.) gives the new 'coordinate' system. The relation between MSN and
NPDF statistics is given by the following
This means that under appropriate circumstances the family of all per-
mutation (randomization) statistics for a given NP hypothesis can be
parameterized by the family of all B-Pitman functions or by the family of all
versions of MSN. Further, noninvariant B-Pitman functions are associated with
MSN versions which are not (maximal) invariants. (See Section 7.) For the MSS
there is essentially only one version. Some modifications of these theorems are
needed when S(X,) is not complete.
For the sample point E , = (X1 . . . . . X , ) E R", consider the group .~, of trans-
formations {g,}, where, typically,
with each ji = 0, 1 (1 ~< i ~< n) and (i~. . . . . i,) a permutation of (1 . . . . . n). Thus,
~. had(n!)2" :possible elements, and under H(01), for each g, E ~,, g, .E, has
the same (joint) distribution as E,. For any e , - - ( x i . . . . . x,)E R", the orbit
O(e,) is defined by
_ / n \1/2
Note that Ei=l " Xi2 remains invariant under ~3., while, )(. is a symmetric function
of Xi . . . . . X. (and hence, remains invariant under any permutation of the
subscripts). Thus, conditionally on [XI[. . . . , [X.[ given, t. in (3.4) has 2" equally
likely realizations over the 2" equally likely sign-inversions. Thus, we may
proceed as in (2.9) through (2.11), replace n! by 2" and construct an exact size
a test based on this permutation distribution of t. generated by the 2" equally
likely sign inversions. Note that the test function (or the critical region) for such a
randomization test, generally, depends on [X11. . . . . IX.I, and has to be deter-
mined from the given sample. In this setup, standard statistical tables are
therefore of not much use. Further, for this test, we have not used the
permutational-invariance, and hence, it is not necessary to assume that the X~ are
i.i.d.r.v. It suffices to reframe the null hypothesis H~1) as that of the symmetry of
the d.f. of each X~ without imposing the identity of these d.f.'s. Thus, in this
permutation procedure, we not only eliminate the normality assumption needed
with the parametric (Student) t-test, but also allow for possible nonhomogeneity
of the d.f.'s This explains the broad scope of this randomization test.
Suppose now that the d.f. F in (3.1) is continuous. Let R 7 be the rank of IXil
among IX1[. . . . . ]X,[, for i = 1. . . . . n, and let Z I < - . . < Z , be the ordered
r.v.'s corresponding to ]Xd . . . . . IX,]. We denote by
(-1)X~ have the same d . f . F . For p --- 1, this reduces to (3.1), while for p > 1,
this is a natural generalization of the notion of symmetry of F and is less
restrictive than the total symmetry of F. If we define the sample point E~ by
(X1 . . . . . X,) (so that Y(, = R p") and consider the group ca0 of transformations
{gO}, where
then the joint distribution of E , remains invariant under ~d. Note that E~ is a
p n matrix, so that for each row, we may consider a set of order statistics for
the absolute values and define the absolute ranks and signs in the same manner
as in the univariate case. This leads us to three matrices of (i) absolute order
statistics, (ii) absolute ranks and (iii) signs of the observations. Unfortunately,
for p ~> 2, these three matrices are not generally mutually independent. Thus,
the maximal invariance character of the matrices of signs and absolute ranks
may not be true, in general and hence, tests, based on these matrices, are not
generally.genuinely distribution-free. Nevertheless, the group g0 relates to the
invariance under 2 n conditionally equally likely sign inversions (under H~I~),
and hence, a randomization test may be constructed as in (2.14)-(2.15) with
M, = 2 n. For the multivariate sign-test, this intelligent observation is due to
Chatterjee (1966), while, Sen and Puri (1967) adapted this for general multi-
variate signed-rank tests. We may refer to Chapter 3 (by Hugkovfi) for more
details.
against the alternatives that they are not all the same. For the sample point
E , = (X1 . . . . . Xn) (ER"), consider the group ~d, of transformations {g,}, where
and ~ , = {rl . . . . . rn} is the set of all possible n ! permutations of (1 . . . . , n). Thus,
for every e, = ( X l , . . . , x~) ( C R ~), the orbit O(en) is defined by
with at least one strict inequality on a set of measure nonzero; the case of H <
Randomization procedures 13
where the ei are i.i.d.r.v, with d.f. F, defined on R. Under H0, p = 0 and we
may want to test this against p > 0 or ~ 0. Normal theory tests are based on a
suitable version (circular or not) of the serial statistic E ( X H X ~ ) . H e r e also,
under H0, the joint distribution of (X1 . . . . . X,) remains invariant under any
permutation of the arguments, so that a randomization test procedure may be
based on the statistic using (2.14)-(2.15) for M, = n!. More general serial
statistics were considered by M. N. Ghosh (1954) and the theory of ran-
domization tests was developed. Randomization tests based on pure or mixed
rank statistics can also be constructed by the same permutation principle; we
may refer to Chapter 5 (by G. K. Bhattacharyya) for some of these. Tests based
on the pure rank statistics are genuinely distribution-free (when F is con-
tinuous), while the ones based on mixed rank statistics are only permutationally
(conditionally) distribution-free. The model (3.14) can also be generalized to
the multivariate case by replacing the X~, X~-I and e~ by appropriate p-vectors
and p by P, a p x p matrix, where again the null hypothesis relates to P = 0.
For p >/2, the pure rank based tests are generally only permutationally
14 C. B. Bell and P. K. Sen
distribution-free, while the others are so even for p = 1. Note that the structure
of the orbit remains the same for univariate as well as multivariate situations.
that is, X~1) and X! 2) are stochastically independent (though the variates within
the same vector need not be independent). As in (2.1)-(2.2), we have here
(1) )x (~,.,.Ikl
"''':kn(1)
~,(1))
(\ A1 " ' ' X ~ and E~(r)=
E~ = X~). X(2)] X(2)" .X(2) , r ~ ~, (3.16)
n rI rn
(Snll
Sn = \Sn21
8n12~
Sn22]
(3.17)
The largest characteristic root (or the trace) of S* is taken as the test statistic
for testing the null hypothesis. Note that both S, lX and S,22 remain invariant
under ~, while S, x2 has n! possible (conditionally, equally likely) realizations
over the orbit. Hence, for a real valued function T , - - T ( $ * ) of S*, we may
generate the permutational (conditional) distribution of T, by using (2.5), and
with this, we may proceed as in (2.9)-(2.11) to get an exact size a (ran-
domization) test. For p = q = 1 (i.e., the bivariate case), this reduces to the
two-sided version of (2.11). If at least one of p or q is greater than 1, as in
Section 3.1 or 3.2, we may conclude that the randomization tests are only
permutationally distribution-free, while for p = q = 1, we may proceed as in
after (2.12) and obtain some genuinely distribution-free rank tests. Puri, Sen
and Gokhale (1970) have considered multivariate rank tests for this in-
Randomization procedures 15
3.4. H y p o t h e s i s o f e x c h a n g e a b i l i t y
where ~ is the set of all possible (p!) permutations of (1 . . . . . p). In this case,
the sample point E , = (X1. . . . . X,) is a p n matrix. Let ri = (rl~. . . . . rpi)',
i = 1 . . . . . n, be n independent vectors, where each ri takes on the permutations
of (1 . . . . , p) with the common probability (p!) -1. Also, let ~n = {il . . . . . in} be
the set of all possible n ! permutations of (1 . . . . . n). Consider then the group ~d,
of (n!)(p!)" transformations {gn}, where, typically,
The group ~, maps the sample space onto itself, and, under H~04) in (3.19),
g , . E , has the same distribution as E . for every g , E ~3,. Thus, we may
appeal to (2.14)- (2.15) with M n = ( n ! ) ( p ! ) L Here also, if the test function
~b~(En) is symmetric in X1 . . . . . X,, then, we may reduce M, to (pI)". For the
normal theory model, (3.19) reduces to the equality of the means, of the
variances of the p coordinate variates and the equality of the covariances for
any pair of them; in the literature, this is known as the HMvc (viz., Wilks 1946).
The same test statistic may be used in the randomization procedure wherein
the assumed normality of F will no longer be needed; this test will only be
conditionally distribution-free. Alternatively, as in Sen (1967), we may consider
a rank statistic where the Xij in (3.20) are replaced by their ranks (with respect
to all the np = N observations). Since the coordinates in each column of E , are
not necessarily independent, such rank statistics may not be genuinely dis-
tribution-free (under H~4)), although they are conditionally so. This hypothesis
of multivariate interchangeability also arises in a very natural manner in the
analysis of randomized block designs, which we present below.
16 c. B. Bell and P. K. Sen
Suppose that there are n (I>2) blocks of p (>12) plots each where p different
treatments are applied, and let X~j stand for the response of the plot in the ith
block receiving the jth treatment, for i = 1. . . . . n, j = 1. . . . . p. We denote the
d.f. of X/j by F~j, defined on R, and frame the null hypothesis
where the F~ are arbitrary. Again, under H~5), the vectors ri, i = 1. . . . . n,
defined as in before (3.20), are independent, each having p! possible equally
likely realizations (when the F~ are assumed to be continuous). Hence, we have
a group ~3~ of (p!)~ transformations {g~}, where, typically,
\Xl,pl "
and, for every gn E ~3n, g~ E~ has the same distribution as E~. Thus, we may
again appeal to (2.14)-(2.15) with M, = (p!)n, and in this setup, it is not
necessary to assume that the F~ are all the same. It is also possible to replace
the assumption that for each i, X~I. . . . . X~p are independent, by their exchan-
geability, i.e., the joint d.f. of (X~I. . . . . X~p) is symmetric in its arguments, a
case that may arise in 'mixed models' where the block effects may be random
and the treatment effects are nonstochastic. Thus, in a randomization pro-
cedure, the additivity of the block or treatment effects and the independence of
the errors may be eliminated along with the normality of the F~. Randomization
tests based on the classical ANOVA test statistics, dating back to Fisher (1934),
are conditionally distribution-free, while the procedures solely based on the
vectors r/, i = 1 . . . . . n, (such as the tests due to Friedman, 1937; Brown and
Mood, 1951, and others), are genuinely distribution-free when the F/ are all
continuous. One of the characteristics of the 'within block rankings' is that they
do not incorporate the inter-block comparisons. Incorporation of this inter-
block information is possible through suitable alignment processes: Tests are
based on 'ranking after alignments' and rest on a somewhat different group-
invariance structure. If we conceive of additive block effects (i.e., the F~ differ
in shifts only), then, it seems intuitive to eliminate the block-effects through
substitution of their estimates and adapting an overall ranking on the residuals
(or aligned observations). Based on X , . . . . , Xip, let ~ be some translation-
invariant estimator of the block average, and let Y~j = X/j - )~, for j = 1 . . . . . p,
i = 1. . . . . n. If X/ is symmetric in (X/1. . . . . X/p), then, under Hg5), r/l rip
. . . . .
are exchangeable r.v., for each i and these aligned vectors are independent for
different i. Let Y~;1< " " < Y~;p be the ordered r.v. corresponding to the Y~j,
1 ~<j ~<p, for i = 1. . . . . . n, and let Y* be the matrix of these order statistics.
Then, under Hg 5), conditional on Y*, one obtains a set of (p!)~ possible
realizations of the aligned observations, corresponding to the intra-block
permutations of the order statistics, and these are all conditionally equally
likely. Hence, we may again appeal to (2.14)-(2.15) with M . = (p!)n, and rank
R a n d o m i z a t i o n procedures 17
process and the time interval (0, h], and conversely Fo and h determine Lg0 if
X0 = 0, with probability 1. With the Yj defined as in above, the theory
developed in Section 3.2 applies, and hence, randomization procedures based
on the Yj workout for the problems related to such processes with stationary
independent increments.
(b) Processes with stationary symmetric increments. Define the Yj as in (a).
Here, additionally, one has the information that F0 is symmetric. As such, we
may appeal to the theory developed in Section 3.1, and the randomization tests
developed there are applicable for problems related to such processes.
(c) Spherically exchangeable processes. The problems here lead to per-
mutation tests analogous to those in Section 3.4 (relating to tests for sphericity).
(d) Exchangeable processes. For such processes, the Yi, defined as in (a), are
not necessarily independent, but are exchangeable r.v. As such, the theory
developed in Section 3.4 remains applicable here.
It should be mentioned here that throughout this section the tests discussed
are not 'omnibus' tests, but are rather 'directional' in the sense of being
directed towards certain parametric alternatives. In fact, in his original papers,
Pitman (1937a,b; 1938) was concerned with the classical normal alternatives,
and was seeking N P D F tests which would yield high power against these
alternatives. These early tests were based on the classical statistics for the
problem at hand, and hence, the associated B-Pitman functions were closely
related to the classical parametric statistics. In Section 5, we shall discuss the
problem of choosing randomization tests statistics for a specific NP problem,
and consider some optimality criteria too.
where 2 0 = n -1 E7=1X and ~-o = n-, Z7=a y0. Note that r,(Q)E [-1, 1] for all
Q and the permutation distribution of r,(Q) is given by
211/2
- -
and locally most powerful (LMP) rank tests for a variety of situations. However,
the alternatives against which these tests are so optimal are, more often than not,
parametric families. It turns out that for appropriate parametric families of
alternatives, the MP N P D F test is always a permutation test. This optimal test can
only be a rank test if the B-Pitman function is constant on rank sets. In this
context, we have the following result due to Bell and Donoghue (1969).
1 ifR(h(Xn))>K(a,n),
O(X,) = if R ( h ( X , ) ) = K(a, n ) ,
0 ifR(h(X.))<K(a,n),
where h(.) is a B-Pitman function such that L I ( X ' ) < LI(X~) implies h(Xf,)<
h (X") for all X " and X " on the same orbit, and for almost all orbits.
We illustrate this theorem with the following example from Bell and
Donoghue (1969). For the hypothesis H 0 : F 1 . . . . . F, = F (unknown) (see
Section 3.2), consider the parametric alternative such that XI . . . . . X , are
independent with X, - N0z,, O'2), where /z, = tz,(0) = ( - 1 ) ~ cos(rTr/O), r >i 1.
Here,
n
log LI(X,,, O) = - g log(2~r) - ~
r=l r=l
(and X, = (Xa . . . . . X,)). This implies that the MP N P D F test should be based
on the permutation (B-Pitman) statistic with h ( X , ) = E~'=I(-1)'Xr cos(r~r/O).
One may wish to avoid the dependence on 0. For 0 close to 1, one has
{HF/: F~(x, O) = exp{a(O)A,(x) + k(i, O) + t(x, 0)}, with a(O) > 0}.
22 C. B. Bell and P. K. Sen
(ii) If, further, a(O) is continuous in O, a(Oo)= 0 = k(i, 0o), and Q(x, O, i)=
o(a(O)), for all i >1 1 and x, then R ( h ( X , ) ) is the statistic of the L M P N P D F test
for 0 = Oo (randomness) vs. the family
{HF~: F~(x, O) = exp{a(O)Ai(x) + k(i, O) + t(x, O) + Q(x, 0, i)}, 0 > 00}.
As is generally the case, for a composite alternative hypothesis, the density
qn(xn) (as well as the ordering in (5.1)) depends on the unknown (nuisance)
parameters; a very similar case arises with the h ( X , ) in Theorem 5.1 (or 5.2).
Thus, the (generalized) Neyman-Pearson lemma on which the above optimality
property is based may not hold for the entire set of alternatives. The situation
is comparable to the classical Neyman-Pearson testing theory, where a (uni-
formly) most powerful similar region may not generally exist, and, one may
have to choose some restricted optimal one (such as the locally most powerful,
asymptotically most powerful, most powerful unbiased, c(a) test etc.,). Such a
restricted optimal parametric test statistic may then be employed as in Section
3 in the construction of randomization tests whenever under the null hypothesis
the orbit and the conditional probability law on the orbit can be defined
unambiguously. Such a randomization test will have the similar (restricted)
optimality property within the class of randomization tests. For example, for
the two-sample problem with the shift alternative in mind, under normality on
the d.f. (when the alternative holds), the one-sided uniformly most powerful
randomization test is based on the classical Student t-statistic with the rule in
(2.9)-(2.11). For some other d.f., for shift alternatives, we may similarly use the
locally most powerful test statistic and the use of that in the randomization
procedures in Sections 2 and 3 would lead to locally most powerful ran-
domization tests against such specific alternatives. If we use invariant tests
(such as the rank tests), then, we would get locally most powerful invariant
tests in this manner. The recent papers of Sen (1981a) and Basu, Ghosh and
Sen (1983) cast some light on locally optimal tests of this type. Use of
Neyman's c(a)-test statistic in a randomization procedure (permitting the
availability of the orbit) may be generally recommended on the ground of local
asymptotic optimality, where the sample size is taken large and the alternative
hypotheses are chosen in the neighborhood of the null hypothesis, so that the
asymptotic power function does not converge to the degenerate limit 1.
We have discussed the optimality or asymptotic optimality of randomization
or permutation tests for certain parametric alternatives. One very practical
question is: How good are these tests against wider (NP) families of alter-
natives? No attempt is made to answer this question in this article. A more
practical question concerns the amount of computation involved in actually
performing such tests. Some comments on this aspect are made in the next
section.
n! = 3 628000, (b) 2 n = 1024 or (c) (2n)(n!)= 3 715 891 200, depending on the
hypothesis at hand. This being the case, one immediately seeks some
modification of the original form of the randomization tests when n is not very
small. Several such modifications have been treated in the literature. Brief
outlines of only three such modifications will be presented here. They are (a)
random permutations, (b) matching moments and (c) central limit theorems on
orbits. (A rigorous discussion of much of this material is in Puri and Sen
(1971).)
For several practical reasons, one may wish to exclude the identity per-
mutation, e*, from the random sample. That is, one chooses {3'1. . . . . 3'm} to be
a random sample (with replacement) from S-{e*}. In this case, some of the
analysis becomes more tractable. For (6.1) adapted to this sampling scheme,
one has the following
This means that in performing the randomized permutation test with the
statistic in (6.1), the critical region of the form: {R *(h (Xn))> C(a, k*, m)}, for
large m, one may take C(a, k *, m) ~- I-~m+ O'm~'~, where ~-~ is the upper 100a %
point of the standard normal d.f. There are other approaches too, but they will
not be treated here.
(conditional) moments of h2(') are (i) E(rn)= 0, (ii) V(r,)= ( n - 1 ) -1, (iii)
E(r3) = O(n -2) and (iv) E(r~)= 3(n 2 - 1)-111+ O(n-1)]. For large n, these
moments correspond to those in the normal theory case when the population
correlation coefficient is equal to 0. Thus, a permutation test based on hi(') or
h2(') is approximately equal to the (parametric) test based on T =
r , { ( n - 2 ) / ( 1 - r ~ ) } 1/2. A more comprehensive picture is given in Hoettding
(1952).
be asymptotically optimal in the same sense. For invariant tests, the asymptotic
equivalence-results are one-step further generalization in the sense that the
optimal invariant test statistics may be the projections of the optimal un-
conditional test statistics into the space of the maximal invariants, and hence,
the asymptotic equivalence results demand that the proportional loss in the
conditional arguments converges to 0 as the sample sizes increase. For rank
tests, this theory has been developed in Terry (1952), Hfijek and Sidfik (1967,
pp. 63-71), and some further developments are sketched in Sen (1981a).
In the final section, we briefly discuss the issue of invariance of permutation
tests.
DEFINITION 7.1. Let O" be a family of distributions (on some sample space X)-
A group G" of 1-1 transformations (of X onto itself) is called an O"-
generating group, if {J(g"(-)): g " E G"} = ~ " for each J(.) in g2".
One notes immediately, that in Example 7.1, G'ris an O"-generating group.
26 C. B. Bell and P. K. Sen
If a family g~" admits a group G" as in Definition 7.1 above and a finite set
S" with the usual properties, then one can establish some useful results, as in
Bell and Kurotschka (1971), Bell (1964) and Wijsman (1957).
THEOREM 7.1. Let 0 " admit both an JT'-generating group G" of transfor-
mations, and a finite permutation set S" of k* elements with S"(z) being a
sufficient statistic. Then
(i) each set A invariant wrt G" is similar wrt ~",
(ii) the set-valued random variable G"(Z) is N P D F wrt ~", and assumes k *
distinct values each with probability [k *]-l;
(iii) for a.a. z, {G"(y): y E S"(z)} is a maximal (essential) similar partition (of
the sample space X);
(iv) 7(z) = G"(z) n S"(z) is 1-1 a.e.,
(v) 6 ( z ) = [S"(z), G"(z)] is 1-1 a.e.,
(vi) G"(z) and S"(z) are independent, and
(vii) G"(z) is M - S - N .
On the basis of the above result, one might wish to define a 'natural' group
of transformations as one satisfying Theorem 7.1.
Under the Conditions of that theorem, one wishes to construct new groups of
transformations as follows.
Let h(-) be a B-Pitman function wrt J2" and S", and let {[R(h(z))= s], s =
1, 2 . . . . . k*} be the associated maximal (essential) similar partition.
DEFINITION 7.2. (i) for z E { R ( h ) = s}, let gh(Z) = S " ( g " ( z ) ) n { R ( h ) = s}; and
for z @ D * ( h ) = X - U k
1 {R(h) - s}, define gh(z) arbitrarily;
(ii) G ~ = {gh('): g " G G"}.
References
[1] Basawa, I. V. and Rao, B. L. S. Prakasa (1980). Statistical Inference for Stochastic Processes.
Academic Press, New York.
Randomization procedures 27
[2] Basu, A. P., Ghosh, J. K. and Sen, P. K. (1983). A unified way of deriving LMP rank tests
from censored data. J. Roy. Statist. Soc. Set. B 45, 384-390.
[3] Basu, D. (1955). On statistics independent of a complete sufficient statistics. Sankhy(t 15,
377-380.
[4] Basu, D. (1958). On statistics independent of sufficient statistics. Sankhy~ 18, 223-226.
[5] Bell, C. B. (1960). On the structure of distribution-free statistics. Ann. Math. Statist. 31,
703-709.
[6] Bell, C. B. (1964a). Some basic theorems of distribution-free statistics. Ann. Math. Statist. 35,
150-156.
[7] Bell, C. B. (1964b). A characterization of multisample distribution-free statistics. Ann. Math.
Statist. 35, 735-738.
[8] Bell, C. B. (1965). Enige problemen in de verdelingsvrije statistiek. Rapport $340 Mathema-
tisch Centrum, Amsterdam.
[9] Bell, C. B. (1970). Less methods non-parametriques e t a distribution libre. Tech. Rep. Ctr.
Res. Math. Univ. Montreal.
[10] Bell, C. B. (1982). Signal detection for spherically exchangeable stochastic processes. Tech.
Rep. 4-82, Ser. Statist. Biostatist., San Diego State Univ.
[ll] Bell, C. B. and Ahmad, R. (1982). Signal detection for stochastic processes with stationary
independent symmetric increments. Tech. Rep. 2-82, Ser. Statist. Biostatist. San Diego State
Univ.
[121 Bell, C. B. and Bellot, F. (1965). Nota sobre los estadisticos no parametricos de Pitman.
Trabajos de Estadis. 16, 25-39.
[13] Bell, C. B., Blackwell, D. and Breiman, L. (1960). On the completeness of order statistics.
Ann. Math. Statist. 31, 794-797.
[14] Bell, C. B. and Doksum, K. A. (1965). Some new distribution-free statistics. Ann. Math.
Statist. 36, 203-214.
[15] Bell, C. B. and Doksum, K. A. (1967). Distribution-free tests for independence. Ann. Math.
Statist. 38, 619-628.
[16] Bell, C. B. and Donoghue, J. (1969). Distribution-free tests of randomness. Sankhya Set. A
31, 157-176.
[17] Bell, C. B. and Haller, H. S. (1969). Bivariate symmetry tests: Parametric and nonparametric.
Ann. Math. Statist. 40, 259-269.
[18] Bell, C. B. and Kurotschka, V. (1971). Einige prinzipien zur behandlung nichtparameterischer
hypothesen. In: Studi di Probabilita, Statistica, e Ricera Operative in onore di Giuseppe Pompil.i,
Oderrisi-Buggio, pp. 164-186.
[19] Bell, C. B. and Smith, P. J. (1969). Some nonparametric tests for the multivariate goodness of
fit, multisample, independence and symmetry problems. In: P. R. Krishnaiah, ed., Multivariate
Analysis IL Academic Press, New York, pp. 3-23.
[20] Bell, C. B. and Smith, P. J. (1972a). Some aspects of the concept of symmetry in non-
parametric statistics. In: D. S. Tracy, ed., Symmetric Functions in Statistics (Syrup. in honor of
Paul Dwer). Univ. Windsor, pp. 143-181.
[21] Bell, C. B. and Smith, P. J. (1972b). Completeness theorems for characterizing distribution-
free statistics. Ann. Inst. Statist. Math. 24, 435-453.
[22] Bell, C. B., Woodroofe, M. and Avadhani, T. V. (1970). Some nonparametric tests for
stochastic processes. In: M. L. Purl, ed., Nonparametric Techniques in Statistical Inference.
Cambridge Univ. Press, New York, pp. 215-258.
[23] Berk, R. H. and Bickel, P. J. (1968). On invariance and almost invariance. Ann. Math. Statist.
39, 1573-1576.
[24] Brown, G. W. and Mood, A. M. (1951). On median tests for linear models. In: Proc. Second
Berkeley Syrup. Math. Statist. Prob., pp. 159-166.
[25] Capon, J. (1961). Asymptotic efficiency of certain locally most powerful rank tests. Ann. Math.
Statist. 32, 88-100.
[26] Chatterjee, S. K. (1966). A bivariate sign test for location. Ann. Math. Statist. 37, 1771-1781.
[27] Chatterjee, S. K. and Sen, P. K. (1964). Nonparametric tests for the bivariate two-sample
location problem. Calcutta Statist. Assoc. Bull. 13, 18-58.
28 C. B. Bell and P. K. Sen
[28] Dwass, M. (1957). Modified randomization tests for nonparametric hypotheses. Ann. Math.
Statist. 28, 181-187.
[29] Edgington, E. S. (1980). Randomization Tests. Marcel Dekker, New York.
[30] Fisher, R. A. (1934). Statistical Methods for Research Workers. Oliver and Boyd, Edinburgh.
[31] Fraser, D. A. S. (1957). Nonparametric Methods in Statistics. Wiley, New York.
[32] Friedman, M. (1937). The use of ranks to avoid the assumption of normality implicit in the
analysis of variance. J. Amer. Statist. Assoc. 32, 675--701.
[33] Gerig, T. M. (1969). A multivariate extension of the Friedman's X2-test. J. Amer. Statist.
iAssoc. 64, 1595-1608.
[34] Ghosh, M. N. (1954). Asymptotic distribution of serial statistics and applications to non-
parametric tests of hypotheses. Ann. Math. Statist. 25, 218-251.
[35] Hfijek, J. (1961). Some extensions of the Wald-Wolfowitz-Noether Theorem. Ann. Math.
Statist. 32, 506-523.
[36] Hfijek, J. and Sidhk, Z. (1967). Theory of Rank Tests. Academic Press, New York.
[37] Hoeffding, W. (1951a). A combinatorial central limit theorem. Ann. Math. Statist. 22, 558--566.
[38] Hoeffding, W. (1951b). Optimum nonparametric tests. Proc. Second Berkeley Syrup. Math.
Statist. Prob., pp. 83-92.
[39] Hoeffding, W. (1952). The large sample power of tests based on permutations of observations.
Ann. Math. Statist. 23, 169-192.
[40] Jogdeo, K. (1966). On randomized rank score procedure of Bell and Doksum. Ann. Math.
Statist. 37, 1697-1702.
[41] Junge, K. (1976). Charakterisierunggassatze Virteilungsfreier Statistiken und Ahnlicker
Mengen. Diplomarbert, Univ. Gottingen.
[42] Kendall, M. G. and Stuart, A. (1979). The Advanced Theory of Statistics, Vol 2: Inference and
Relationship. Hafner Publ. New York.
[43] Lehmann, E. L. (1959). Testing of Statistical Hypotheses. Wiley, New York.
[44]! Lehmann, E. L. and Schett6, H. (1950). Completeness, similar regions and unbiased estima-
tion, Part I. Sankhy~ 10, 305-340.
[45] I Lehmann, E. L. and Scheff6, H. (1955). Completeness, similar regions and unbiased estima-
tion, II. Sankhyd 15, 219-236.
[46] Lehmann, E. L. and Stein, C. (1949). On the theory of some nonparametric hypotheses. Ann.
Math. Statist. 20, 28--45.
[47] Mielke, P. W., Berry, K. J., Brockwell, P. J. and Williams, J. S. (1981). A class of non-
parametric tests based on multiresponse permutation procedures. Biometrika 68, 720-724.
[48] Motoo, M. (1957). On the Hoeffding combinatorial central limit theorem. Ann. Inst. Statist.
Math. 8, 145-154.
[49] Noether, G. E. (1949). On a theorem by Wald and Wolfowitz. Ann. Math. Statist. 20, 455--458.
[50] Pitman, E. J. G. (1937a). Significance tests which may be applied to samples from any
population. Suppl. JRSS IV, 119-130.
[51] Pitman, E. J. G. (1937b). Significance tests which may be applied to samples from any
population II. Suppl. JRSS IV, 225--232.
[52] Pitman, E. J. G., (1938). Significance tests which may be applied to samples from any
population III. The Analysis of variance test. Biometrika 29, 322.
[53] Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley,
New York.
[54] Puri, M. L., Sen, P. K. and Gokhale, D. V. (1970). On a class of rank order tests for
independence in multivariate distributions. Sankhya Ser. A. 32, 271-298.
[55] Quade, D. (1967). Rank analysis of covariance..L Amer. Statist. Assoc. 62, 1187-1200.
[56] Robinson, J. (1980). An asymptotic expansion for permutation tests with several samples,
Ann. Statist. 8 (4), pp. 851-864.
[57] Savage, I. R. (1962). Bibliography of Nonparametric Statistics. Harvard University Press,
Cambridge, Massachusetts.
[58].Savage, I. R. (1969). Nonparametric Statistics: A personal review, Sankhya Series A 31,
107-144.
Randomization procedures 29
[59] Schefff, H. (1943). Statistical inference in the nonparametric case. Ann. Math. Statist. 14,
305-332.
[60] Sen, P. K. (1967). Some nonparametric generalizations of Wilks' tests for HM, Hvc and HMvc,
I. Ann. Inst. Statist. Math. 19, 451-467.
[61] Sen, P. K. (1968). On a class of aligned rank order tests in two-way layouts. Ann. Math.
Statist. 39, 1115-1124.
[62] Sen, P. K. (1969). Nonparametric tests for multivariate interchangeability. Part II. The
problem of MANOVA in two-way layouts. Sankhya Ser. A 31, 145-156.
[63] Sen, P. K. (1981a). On invariance principles for LMP conditional test statistics. Calcutta
Statist. Assoc. Bull. 30, 41-56.
[64] Sen, P. K. (1981b). Sequential Nonparametrics : Invariance Principles and Statistical Inference.
Wiley, New York.
Sen, P. K. (1982). The UI-principle and LMP rank tests. In: B. V. Gnedenko et al., eds.,
[65] Colloquia Mathematica Societatis Janos Bolyai 32: Nonparametric Statistical Inference. North-
Holland, Amsterdam, pp. 843--858.
[66] Sen, P. K. (1983). On permutational central limit theorems for general multivariate linear rank
statistics. Sankhya Ser. A 45, 141-149.
[67] Sen, P. K. and Puff, M. L. (1967). On the theory of rank order tests for location in the
multivariate one-sample problem. Ann. Math. Statist. 38, 1216-1228.
[68] Smith, P. J. (1969). Structure of nonparametric tests of some multivariate hypotheses. Ph.D.
Thesis, Case Western Reserve Univ.
[69] Terry, M. E. (1952). Some rank order tests which are most powerful against specific
parametric alternative. Ann. Math. Statist 23, 346-366.
[70] Wald, A. and Wolfowitz, J. (1944). Statistical tests based on the permutations of the
observations. Ann. Math. Statist. 15, 358-372.
[71] Warner, J. E. (1969). Asymptotic properties of multivariate permutation tests with ap-
plications to signal detection. Ph.D. Thesis, Case Western Reserve Univ.
[72] Watson, G. S. (1957). Sufficient statistics, similar regions and distribution-free tests. J. Roy.
Statist. Soc. Set. B 19, 262-267.
[73] Welch, B. L. (1937). On the z-test in randomized blocks and Latin squares. Biometrika 29,
21-52.
[74] Wijsman, R. A. (196"/). Cross-sections of orbits and their applications to densities of maximal
invariants. Proc. Fifth Berkeley Syrup. Mata. Statist Probab., 1, pp. 389-400.
[75] Wilks, S. S. (1946). Sample criteria for testing equality of means, equality of variances and
equality of covariances in a normal multivariate distribution. Ann. Math. Statist. 17, 257-281.
[76] Wilks, S. S. (1962). Mathematical Statistics. Wiley, New York.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 t~
E~sevierScience Publishers (1984)31-62
V a s a n t P. B h a p k a r
1. Introduction
31
32 Vasant P. Bhapkar
for some p-variate c.d.f. F ; ~[-gi: ([.Z(1)~, . . . , #~))' is then the vector of location
parameters for F~. The distributions F are said to belong to the scale f a m i l y ~ s
if
Fi(x) = F ( ~ ) , i = 1 . . . . . k. (2.4)
k nj
R, = Z Z q'(X,- Xj.), (3.1)
/=lu=l
(i) T h e sample sizes n~ ~ o0 in such a way that n J N ~ p~, 0 < pi < 1, for all
i = 1 . . . . . k.
(ii) T h e S~'s are subject to the linear constraint
k
~'~ ai,,Si,, = rl,, (3.3)
i=1
L
N1/2[S. - n.(F)] ~ ~;(0, T(F)) (3.4)
k
a,rh(F) = 71, T(F)a = 0, (3.6)
i=1
(b) Assumptions (iii) and (iv) only require that the 'asymptotic mean' lq.(F)
of S, and the 'asymptotic covariance matrix' T(F) of N 1/2 S to be distribution-
free in 40. This asymptotic distribution-free characteristic of S will be guaran-
teed automatically if S is, in fact, strictly distribution-free in 40.
(c) 7/.(F) ----r/(F) and r/. ~- 71 for all n for many rank statistics. For instance,
this is the case with those based on generalized U-statistics (see Section 6). This
is also true for statistics based on linear rank statistics (see Section 5) for
location parameters with scores defined in a skew-symmetric manner.
The rank statistics that have been proposed in the literature (see, e.g.,
Kruskal and Wallis, 1952, 1953; Bhapkar, 1961; Puri, 1964; Bhapkar and
Deshpande, 1968; Puri and Sen, 1971) for homogeneity of several populations
have one of the general forms
To = N ( s - r i d ) ' r - , ( s - n . J ) , (4.1)
or
Ta = N ( S - r l j ) ' T - ( S - "oJ). (4.2)
for sufficiently large values of To, say for To > T0,~ at level of significance a.
The distribution-free property of the rank statistic To for F E 40 guarantees
that T0.~ is a constant independent of F. Theoretically, T0~ can be determined
by considering the N! equally likely values of To corresponding to distinct
permutation vectors r of (1, 2 , . . . , N)'. In practice one need consider only the
N ! / n l ! . . . nkl distinct combinations for samples of size nl.
For large number of combinations, one uses the approximation provided by
36 Vasant P. Bhapkar
Xk_l,a
,2 the upper 100a % critical value of chi-squared distribution with k - 1
degrees of freedom (dr). This approximation is justified in view of the next
theorem.
THEOREM 4.1. Assume the regularity conditions ~I for univariate random sam-
L
ples X(.). If F E ~:o, then To.---~X~_x. Furthermore, in case 71. ~ 71, if ~7. - rl =
L
o(N-1/2), then T~d, ~ X2_l when F ~ ~o.
PROOF. The proof for To. follows immediately from (3.4), (3.5) and (4.1) since
T has rank k - 1.
In case rl. r/, N1/E('qn - r/)--~ 0 and, thus, N I / E ( S . - r l j ) has the same limiting
distribution as N 1 / E ( S n - Tinj ) , which is X(0, T) when F E ~-0. Then the limiting
distribution of T~. is X~-r []
The T0-test (or, equivalently, the T~-test) is seen to be consistent against all
alternatives F for which 7 1 ( F ) / ~ j , in case TI,(F)=-71(F ) for all n. This
assertion depends on the following lemma:
L
LEMMA 4.1. Suppose N 1 / 2 ( y N - ~N)--~3;( 0, 0 ) as N - ~ ~ and Q N =
N ( Y, - 5N)'A( YN - 5N), where A is positive definite. Assume that the constants ~N,
5~, ~ and 5 satisfy the conditions NI/2(S~N - ~) = 0(1), N1/2(SN - 5) = 0(1) as N -~ oo.
p
Then Q, ~ , in the sense that P[QN > c] -~ 1, as N -~ ~, for every fixed c, if and
only ifl~# ~$.
where S ' = (Sb S*'). Letting ~/'(F)= [rh(F), r/ * '(F )], we note from (4.3) that
p
To. ~ ~ whenever r/*(F) / ~/j, in view of Lemma 4.1. However, r/*(F) = r/j if
and only if r / ( F ) = j as a consequence of relation (3.6). Thus it follows that
p
To. ~ oo whenever r/(F) / nj. Thus we have established
for all F in 0%.For F in 0%0,the condition (4.4) ensures r/. - T/= o(N-1/2), which was
used as a condition for To and T ] to have the Xk-12 limiting distribution under
~0: F E 0%0, in Theorem 4.1.
Under the condition (4.4), N1/2[rI,(F)- r/(F)] ~ 0 and, then, N1/2[Sn-1/(F)]
has" the limiting distribution N(0, T(F)) in view of (3.4). Arguing as in the proof
of Theorem 4.2, the consistency of T0-test follows for F for which 7 / ( F ) ~ ~TJ.
Furthermore, the same is true for the T~-test, since To. and T ] . have the same
limiting distributions. Thus we have established
For the investigation of asymptotic power of To, as a test statistic for Y(0, one
needs to consider a sequence of alternatives, say F(N) in 0%(N), converging to
some F in 0%0 at a suitable rate. Thus, we have
L L
Ton ~ x2(k - 1, ~),
The proof is straightforward in view of relations (3.4), (4.5) and (4.6) and,
hence, the details are omitted. Note here in view of the first relation in (3.6)
that we have
k
ai~i = 0 . ~.7)
i=l
5. U n i v a r i a t e t e s t s b a s e d o n l i n e a r r a n k s t a t i s t i c s
One important class of rank statistics To (or T~) is based on linear rank
statistics. H e r e we take
with JN constant valued over intervals ( u / ( N + 1), (u + 1)/(N + 1)], and suppose
1
J ( v ) = lim J N ( v ) , 0 < v < 1,
N.-~o~
k
rl =
f0 J(v) dr,
(5.3)
g ( x ) : ~ piFi(x) .
i=1
Then it can be shown that the conditions ~(ii)-(iv) are satisfied, provided
r/n -- r/ = o ( N -1/2) and some regularity conditions are satisfied for scores su (or
functions JN, J; see the chapter by Ghosh in this volume). Then, we have
1 N
71hi(F) = ~ E & & ( F ) , J ( H ( x ) ) dFi(x),
u=l
(5.4)
T =/~[ap I - J],
1 k
T~ = X ~= ni(Si - r/) 2 ; (5.6)
Hajek and Sidak (1967) have considered a modified version of To, say T;, as
a test criterion for N0; here
1 ~ ni(Si- ~)2
T;.=--~N i= 1
where
1 N
AN - N - 1 ~" (su - g)2. (5.7)
In view of (5.2), (5.3) and (5.5) we note that AN-~A and the limiting disc
tributional properties of T;, are the same as those of To,.
It can be shown (Purl and Sen, 1971) that the conditions (4.5) and (4.6) hold
for sequences of location and scale alternatives, under regularity assumptions.
L o c a l l y m o s t p o w e r f u l rank tests
Consider the class of criteria To (or T ; , T;) where the scores su are defined
by the formula
thus the large-sample /1( 2 t e s t with I df, using criterion To, is equivalent to the
two-sided standard normal test using Z = ( S I - ' 0 , ) ( N n l / n 2 A ) m. It is known
(see, e.g., Capon, 1961; Hajek and Sidak, 1967) that the right tailed standard
normal test using Z is the asymptotic version of the locally m o s t p o w e r f u l rank
(l.m.p.r.) test for the hypothesis ~f0:F1 = F2 against o n e - s i d e d alternatives
Su = [- g'(v.)]
g(V~)J' u = 1. . . . . N ; (5.11)
40 VasantP. Bhapkar
1 X
oo x : , go,x
s , = E [ - V , ~g ( V
] ,)]' u = 1, . . . , N . (5.12)
H e r e we consider some specific T ] (or T6) criteria of the type (5.6) or (5.7)
based on linear rank statistics (5.1) for the hypothesis ~ 0 : F 1 - - F 2 . . . . . Fk
against location alternatives ~ : F~(x) = F ( x - tzg), i = 1 . . . . . k for some F, with
not all/zi equal.
Consider the scores s, -- E ( Y ~ ) , u = 1 . . . . , N where Y1 < Y2 < ' " < YN are
order statistics in a random sample of size N from a distribution which is
symmetric around zero; then ~/n --= "O = 0. In this case, as pointed out in R e m a r k
3 on the conditions M in Section 3, the criterion T ] coincides with To. Two such
criteria are presented below.
K r u s k a l - W a l l i s H-statistic. H e r e su = u / ( N + 1 ) - for u = 1 , . . . , N, and
1
thus J N ( v ) =- J ( v ) = v - for 0 < v < 1. Then -q, -- g = 0 = "O for all n and A - 12.
Also S~ = E~'L~R ~ J n i ( N + 1) - = {R~ - ( N + 1 ) / 2 } / ( N + 1), /~ being the average
rank for the i-th sample, and we have
To = T~ - (N + 1)2 ni Ri -
i=1
)2
12 ~ n~ / ~ - + 1 H say. (5.13)
T~ - N ( N + 1) 2
i=1
alternatives ~g~ w i t h / z , -/-/,2 > 0 for G ( x ) = e*/(1 + e*), -oo < x < ~, which is the
c.d.f, of logistic distribution.
N o r m a l scores statistic. H e r e the scores are given by (5.11), w h e r e V, < V 2 <
" < V/v are the o r d e r statistics in a r a n d o m sample of size N f r o m the
standard n o r m a l distribution with c.d.f, qb. T h e n s, = E ( V , ) , u = 1 . . . . . N.
Since the standard n o r m a l distribution is s y m m e t r i c a r o u n d zero, ft. =-'0 = 0.
N o t e that here
su = E ( V . ) = E [ O - ' ( Y ~ ) ] ;
for this system of scores, we have J ( v ) = ~ - l ( v ) , 0 < v < 1, in view of (5.2) and
(5.3). H e n c e h = 1, and thus
k
To = T~ = ~'~ niS]. (5.14)
i=l
This test statistic or, rather, the T~ version of the statistic is originally due to
Fisher and Yates (1938). An asymptotically equivalent form due to Van der
W a e r d e n (1953) uses approximate scores su -- dP-I(u/(N + 1)).
F o r the case k = 2, the right-tailed test in terms of Z = S~(Nnl/n2) 1/2 is
1.m.p.r. test for Y(0: FI = F2 against one-sided location alternatives Y(, with
]-~1 - - ]-/'2 > 0 for G ( x ) = P(x).
(i) Su =
u
-~-~-~ - ,
(ii) s,=
(u
~-T-~-
)2, (5.15)
U 2
(iii) s, = [~b-' (~--~--~)] ,
For the case k = 2, the right-tailed Z-test, given by (5.9) with the + sign for
the square root, happens to be a 1.m.p.r. test for Y(0:F1 = F2 against one-sided
scale alternatives ~2 for which 01/02 > 1 and G = ', for the normal scores
su = E(V~) or its approximate version (iii).
, ( x l , x2 . . . . . xk) = ( r , ) , (6.2)
r~ being the rank of x~ in the k-tuplet {xl . . . . . Xk}. The condition (3.3) is
k
satisfied for a~, =- al = i/k, and r/n ~- r/= Ei= 1 (j)/k.
It can be shown under suitable regularity assumptions (see Chapter 8 by
Ghosh in this volume; Bhapkar, 1961, 1966) that conditions ~ (iii)-(iv) are
satisfied for
k
n,.(F)--- . , ( r ) = G 4(/);,j(r),
j=l
h (6.3)
T = ~(k - 1) [k2a;'- k q j ' - kjq'+ qJ].
~ ( k - 1) ( k - 1) 2
= j=l l=l 4(J)4(1) J - 1 I- 1 B(j+ I- 1,2k-j- 1+ 1)--q
here the expectation is for i.i.d.r.v.'s Xi, except that X 1 = Xk+l. After replacing
Pi by hi~N, the criterion To (or T~) simplifies to (see, e.g., Bhapkar, 1961, 1966)
Univariate and multivariate multisample location and scale tests 43
1 (k ;21) 2 k 1 k
For the scale tests with common (possibly unknown) location/x in the family
(2.4), the functions suggested corresponding to tests D and C are
(6.8)
~bc(j)= j
r-I
k+l
The location tests V or B Can also be used as criteria for testing the equality of
scale parameters for asymmetrical distributions, e.g., those encountered in life
studies or survival analysis. Reference may also be made to Chatterjee (1966)
for scale test based on U-statistics.
Deshpande (1970) has considered the problem of determining the relative
'weights', say b(j)/Eiq~(i), so as to maximize the efficiency (see Section 11) of
the T~-test for testing the hypothesis ~0 against location (or scale) alternatives
for the normal family.
For the special case k = 2, the T2-statistic (6.5) (except those for which ~b(j)
is constant valued, e.g. for the D test) reduces essentially to the two-sided
Mann-Whitney (1947) criterion. The Mann-Whitney U is defined as the num-
ber of pairs (Xltl, X2t2) for which X2t2"~Xltl, and it can be shown that the
Mann-Whitney test is equivalent to the Wilcoxon rank sum test for two
samples.
k
a~ m ~q(a)=,Ca)
in "t n '
a=l,., "~
p.
i=1
(i) S a m e as M(i).
(ii) T h e Sill's are subject to the linear constraint
k
~'~ ah,Si. = r/., (7.1)
i=l
L
N1/2[S. - ~/.(F)] ---}At(0, T(F)), (7.2)
k
~', airli(F) = r/ (7.4)
i=1
To = N ( S - j , . ) ' ( Z - L - X ) ( S - j . . ) , (7.5)
or T ; with 1/ replacing ~. in (7.5). Here L is some consistent estimator of
A (F) under N0. As in section (4), T] = To whenever ~/. = ~/; furthermore, To is
invariant under any choice of g-inverse Z-, while T] is invariant only asymp-
totically, in case ~/. ~ r/.
The use of To (or T]) as a large sample X2 criterion with (k - 1)p d.f. for
testing N0 is justified in view of
p
THEOREM 7.1. Assume conditions ~ and suppose F E ~o. If L--~A (F), then
L L
To. ~X2((k - 1)p). If ~. ~l and [In, - ~/1[= o(N-m), then T],---~xZ((k - 1)p).
THEOREM 7.2. Assume conditions c and suppose I1.. - .11 = o(N-I/2). Then the
To (or T~) test is consistent against all alternatives for which ~ (F) ~ j ~.
L
Then under the sequence of distributions {F(N)}, To. ~ X2((k - 1)p, ~:), where
= 8'(v)(x- a - l ( v ) ) n ( v ) . (7.7)
If Jill. - 7111= o(N-la), then T~. has the same limiting distribution.
46 Vasant P. Bhapkar
1 1
S} ~) = ~_, s (~) : ~'~ s(~)Z! ~) , (8.1)
ni t=l Rit(a) nl u=l u ~u
= A-1 _ j , (82)
P
In (8.2), H (~, F (~,~/refer to marginal c.d.f.'s for variables a, (a,/3) respectively for
distributions H, F etc., A ( f ) = [A~(F)] and
~ (~)
, , ( F ) = P e [ Z , , (el - 1 ] , u=l ..... N. (8.3)
It can be shown (see Puri and Sen, 1971) that, under suitable regularity
assumptions, ~(iii) and (iv) are satisfied for S. defined by linear rank statistics
(8.1); furthermore, the conditions (7.6) are satisfied for suitable sequences of
location and scale alternatives
The estimates L of A (F) can be obtained on replacing F (~), F (~'m in (8.2) by
the corresponding sample distribution functions.
For the special case where s (~
u
= su, we have J(~)= J, -n- / i (~ = r/n and
As in Section 5, the T] and To criteria coincide when rl, ~- rl, which is often
the case with linear rank statistics for the location problem. In particular, for
the multivariate analog of Kruskal-Wallis statistic (5.13) we have
here s(~)=s
u - - u = u/(N+ 1)-1, Ri is the vector of average ranks for the i-th
sample, and the T; analog is the multivariate version
1 nl nk
s !)=
' nln2''' nk ,1=1
Z " " tkg=1 ~)i()( X l t l ..... Xktk) ' (8.8)
k
71n = ' I -(~)-
--(~)- -- --, rl !~)(F) =--rll~)(F) = ~_~4a(~)(j)v~;)(F),
/=1 k " " " j=l
(8.10)
1
Z - ~(k - 1) [k2AT'l- k q j ' - kjq'+ qS],
with RI ") the rank of XI ~) in the k-tuplet {X]~). . . . . X~")}, Xi's being in-
dependent random vectors with c.d.f.'s F~, respectively. Furthermore,
where X1 . . . . . XZk are i.i.d, vectors with c.d.f. F except that X, = Xk+~.
If the same kernel function 4) is used for all variables a, we have ,7(~) = rl and
,L~ ( F ) = E[4,2(R~))]- n 2= Z,
48 Vasant P. Bhapkar
(8.13)
i=l
(2)
i r-a /~i -
. . . . .
/z. (9.1)
for i = 2 , . . . , k. Test criteria are available (see, e.g., Morrison, 1976), based on
the characteristic roots of a suitable determinantal equation, for testing such a
hypothesis of parallelism of profiles. This hypothesis is of interest with c o m -
m e n s u r a b l e variables, e.g. repeated measurements at different time points.
Discarding the distributional assumption of normality, Bhapkar and Pat-
terson (1977) formulated a nonparametric analog of the hypothesis of paral-
lelism, say ~ , in the form
for ~,!~.)fFI defined by (8.11). For the location family ~L (2.1), if the variables a
U x J
Univariate and multivariate multisample location and scale tests 49
again implies (9.2). (9.3) may be interpreted as the condition of parallelism for
parametric scalar profiles.
The formulation (9.2) is convenient for criteria based on U-statistics. Bhap-
kar and Patterson (1977) have proposed the criterion
in the notation of Section 8, for statistics $ defined by (8.8) with the same
kernel ~b(~)= ~b for all variables a. Here g = 1 / j ' E - l j and To is the statistic
(8.13) to be used as a X2((k - 1)/9) criterion for ~0, while 7'1 is proposed as a
X Z ( ( k - 1)(/7 - 1)) criterion for testing parallelism hypothesis formulated as ~1.
7"1 may be interpreted as a statistic testing i n t e r a c t i o n between populations and
variables; if this interaction is absent and, thus, the profiles may be considered
to be parallel, then 7"2 may be used as a c o n d i t i o n a l criterion for testing that the
profiles coincide, given that they are parallel.
For criteria based on linear rank statistics, Bhapkar (1982) has proposed the
corresponding formulation of nonparametric analog of parallelism hypothesis,
say
~i: ~:~u(F)-
(1) _ ~12u)(F). . . . . ~)(F), i = 1, ..., k , u _-- 1, ..., N,
(9.5)
for sc~u
t~)(F) defined by (8.3). The corresponding T1 criterion offered by Bhapkar
is
where S's are linear rank statistics defined by (8.1), To is (8.5) and g = 1 / j ' E - l j .
As with U-statistics, T1, 7"2 are to be used as X 2 ( ( k - 1)(p - 1)) and )(2((k - 1))
criteria for testing Y(~ and Y(0, conditional on Y(~ respectively. Alternatively, we
could consider T (or T~) on replacing r/, in (9.6) by ~/ (or h by AN in (5.7)).
The use of T1 criteria (9.4) or (9.6) for testing Ytl or Yg[ respectively, is
justified in view of Theorems 10.1 and 10.2 in the next section.
50 VasantP. Bhapkar
Tm, o = N---
A [S. - 7/.j]'[M@ Q]'[(M,~M'fl@ (QEQ') -1]
x [M @ Q][S. - 7 . i l , (10.2)
assuming that the same comparisons are made across all the variables, as in
remark (2) in Section 7. It is assumed that M ~ M ' is of full rank m < k. In (10.2) it
is the understanding that for the case m = k, e.g. with M = Ik, the inverse of
M,~M' is to be understood as a g-inverse.
The statistic TM, o has been suggested (Bhapkar, 1982) as a large-sample
xZ(mq) criterion for m < k, and X2((k - 1)q) criterion for m = k, for q ~<p.
Observe that T1k,~p= To, the criterion given by (7.5) with j @ 7/. replaced by
~.j for this special case of the same comparisons for all the variables. We also
note that Tl~,p = T1, the criterion given by (9.4) or (9.6) for parallelism of
profiles, when P is a ( p - 1)x p matrix of rank (p - 1) such that Pj = 0. This
assertion holds in view of the identity
THEOREM 10.1. Assume conditions ~ for {S.} and suppose that the sequence of
distributions {F(m} satisfies the conditions (7.6). Then, under {F(N)}, for the
sequence of statistics defined by (10.2)
Univariate and multivariate multisample location and scale tests 51
form >k,
L t X2(mq' ~)
TM, O,N
[ M @ Q I 6 ( F ) = 0.
The proof of Theorem 10.1 is straightforward for the case m < k, and follows
for the case m = k in view of the condition Zi ai6}~)(F) = 0, which follows from
the condition (7.4). These details are omitted.
THEOREM 10.2. Under conditions c~ with the same comparisons for all the
p
variables, TM, o,. ~ ~ for all F in ~ for which
[ M @ Q] [ r / ( F ) - -qj] / 0.
The proof is straightforward for the case m < k in view of (10.1) and (4.4). For
the case m = k, it follows in view of the relation (7.4). These details are
omitted.
These techniques are illustrated in Section 15 with a numerical example.
There are a couple of points that should be noted regarding the statistics of
type TM, O, given by (10.2), which include as special cases statistics T1, given by
(9.4) or (9.6). Here in Theorem 10.1, the sequence of distributions {F(N)} is
explicitly assumed to satisfy condition (7.6); then E is some consistent estima-
tor of the correlation matrix P(F), where A ( F ) = AP(F), as in Section 8. It is,
then, desirable to use some weighted 'within samples' type estimator E, as at
the end of Section 8, in view of (7.3).
The second point to be noted is the increase in the over-all significance level
if such statistics Tu, o are to be used simultaneously for several subhypotheses
with different M and/or Q, each at the same nominal significance level. T o
guard against the possibility that the over-all significance level increases beyond
the nominal level, it is desirable to restrict such subhypotheses tests only to
those planned in advance. Furthermore, the significance level, say cq, for the
j-th subhypothesis being tested, should satisfy the restriction
2
j=l
0~j = O r ,
52 Vasant P. Bhapkar
When two or more statistics are available for testing a given hypothesis, one
statistic is considered more efficient if it is more powerful than other statistics,
using the same level of signification, at the same fixed alternative.
Such a comparison of powers for two statistics based on the same data is
usually dependent on a, the level of significance, N, the sample size (or some
measure of sample sizes with several samples) and the fixed alternative at
which the powers are compared. In order to define a suitable measure of
efficiency an alternative approach is adopted comparing the corresponding
sample sizes necessary to attain equal power, say/3, at the same alternative for
two tests using the same level a. A limit argument is usually needed for this
measure to be independent of particular values a,/3; furthermore, one needs to
use then a sequence of alternatives converging to the null hypothesis at a
suitable rate in order to come up with a meaningful definition.
Accordingly, the asymptotic relative efficiency (ARE) measure e(St, $2) of
test $1 relative to test $2, developed by Pitman (1948), is defined as
where N1, N 2 ~ oo is such a way that the tests S~ at level a, based on samples of
size Ni, have the same limiting power/3 for a suitable sequence o f alternatives.
It is understood that the limit e(S1, $2) is independent of a and/3 ; furthermore,
it should also be independent of the sequence of alternatives if possible.
Consider the simple case of null hypothesis ~ 0 : 0 = 00, for a real parameter
0, and tests Sj rejecting ~0 if the corresponding test statistic Sj >I cj-~, j = 1, 2.
Under some regularity conditions it has been shown (see, e.g., Noether, 1955)
that
L
Nm(Sj. - r/j.(F)) --* N(0, T(F)),
where ~/j.(F)~ qj(F). Let T0j be the test rejecting Y(0 if T0j/> cj~, where the
statistic T0j is given by the expression (4.1) in terms of Sj, j = 1, 2. Assuming
conditions (4.5) and (4.6), from Theorem 4.4 we have under the sequence {F(m}
L
Toj, ~ x 2 ( k - l, ~j) , j = l , 2
where the noncentrality parameter ~:j is 8~T-Sj. It can be shown (see, e.g.,
Andrews, 1954) that the A R E of test T0a relative to 7"o2is given by
where not all /zi are equal with Zi/zi = 0. For the test criteria To in Section 5,
based on linear rank statistics, it can be shown (e.g. Andrews, 1954; Puri, 1964)
that ~3 in (4.5)is then equal t o TL(F)~, SO that sc = (T2L/A)~,ipiOzi-/2) 2, where
/2 = Ei pilzi, A is given by (5.5) and
~,L(F)=f~{~J[F(x)l}dF(x). (11.4)
For the tests using statistics To, given by (6.6) based on U-statistics,
in (4.5) can be shown to be again equal to y~(F)l.t, so that s =
{[y~(k - 1)]2/Ak2}Zip~(IXi-/2) 2, where A is given by (6.5) and
Note that this expression is the same for all k, the number of samples. In
particular, for the KruskaI-Wallis H-test, given by (5.13), we have
for a given by (6.5) and yl~ by (11.5). For any two tests, say T01 and T02 , the
efficiency can be obtained as e(T01, T02)= e(Tm, T)/e(To2, T).
For the univariate scalar problem, consider the sequence of scale alternatives
where not all Ai are equal and Ei zl~ = 0. Then for statistics in Section 5,
8 = ys(F)A, where
simple and clear-cut as in the univariate case. The A.R.E. of test T1 with
respect to T2 for testing some hypothesis ~ is still given by the formula (11.2),
provided T1 and T2 have the same (viz. X2 distribution with the same d.f.)
distribution under ~ and noncentral - X 2 distributions, under a suitable
sequence of alternatives, with noncentrality parameters ~:1 and ~:2, respectively.
However, in the multivariate case the ratio ~q/~:2 depends not only on F, the
family of distributions under conditions (7.6), as in the univariate case, but also
on the 'directions' in which the sequence of alternative distributions {F(m }
converges to the null case F.
For instance, consider the location model with the sequence of location
alternatives F(m = [Fum . . . . . F~(m ], where
k
= E P,(i~i- ft)'IF'(F)A-I(F)F(F)(/,,-/.2), (11.13)
i=1
or
= (k Z21)2 Ek Pi(Iati- ~t)'IF'(F)A-I(F)IF(F)(I&- ~t), (11.14)
i=1
for linear rank statistics and U-statistics cases respectively, with ~ = Eipi/~i and
A (F) defined as in Section 8. We note here that the expressions for A (F), as
for F ( F ) , differ from one statistic to another.
In the special case with the same comparison function (or system of scores)
for all variables a, the expressions (11.13) and (11.14) reduce respectively to
~2 k
= ~- ~ P,(I*,- Pi)'P-I(F)(P t, - ki), (11.15)
and
= Y2( kak2- 1)2 ~ P,(/*, - ~)'p-*(F)(I,t,- fi) (11.16)
i=1
In this chapter we have described location and scale tests for several samples
based on linear rank statistics and U-statistics. Some other types of statistics
have been considered, mostly in the univariate case, in this problem of
comparing several samples.
One such class of statistics is an extension of Kolmogorov-Smirnoff statistics
and Cram6r-von Mises statistic based on the empirical distribution functions
(see, e.g., Kiefer, 1959; Birnbaum and Hall, 1960). The distribution theory of
such statistics based on empirical distribution functions uses techniques
different from the ones used here; also the limiting distributions, even in the
null case, turn out to be somewhat nonstandard compared to the standard
chi-squared (or, in simpler cases, normal) distributions encountered in this
chapter. For this reason, tests based on empirical distribution functions have
not been discussed in this chapter.
Another class of tests uses statistics which are based on counts of obser-
vations from various samples that happen to fall between the specified Order-
statistics of a particular sample (see, e.g., Sen and Govindarajulu, 1966). Some
other tests use counts of observations from the sample that contains the largest
observation (see, e.g., Mosteller, 1948). Again, most of these tests deal only
with the univariate case; furthermore, the techniques and the limiting dis-
tributions happen to be different. For these reasons, and also in view of
possible arbitrary choice of particular samples, such procedures have not been
discussed here in this chapter.
Also, in this chapter we have considered rank tests based on U-statistics of
only the simplest type, i.e., which have kernel function ~b defined for k-tuplets
with just one observation selected from each of the k samples. In theory,
U-statistics can be based on more general type of kernel functions. Apart from
the notational complexity and minor modifications needed to accommodate such
more general kernel functions, the essentials are covered by our discussion of
the simplest case in Sections 6 and 8. As illustrations of univariate rank tests
based on U-statistics with more general kernel functions for the case of two
samples, see for example Sukhatme (1958) and Tamura (1960).
Univariate and multivariate multisample location and scale tests 57
14. Studentization
The rank statistics discussed so far in the earlier sections compare the k
samples with respect to the feature of interest assuming that the population
distributions have identical other features, e.g., the shape or functional form.
Thus, in the location problem, we postulate the location family ~L defined by
(2.1) with common c.d.f.F. This does presuppose equal scale parameter 0~~),
i = 1. . . . . k, for each variable a if a more general location and scale family ofLS
defined by (2.3) is taken into consideration.
Similarly, in a scalar problem we postulate the scale family ffs defined by
(2.2); in the more general ~Ls family it does then, in effect, presuppose equal
locations or known locations, if they are unequal.
The rank statistic discussed earlier need some modifications if these are to be
used for the more general location and scale family ~LS with unknown, and
possibly unequal, location or scale parameters, while comparing the scale or
location parameters, respectively, of the k populations. Such a process of
modification is usually termed studentization of the corresponding statistics.
Furthermore, additional regularity assumptions are usually needed in order
that the modified tests remain at least asymptotically distribution-free.
For instance, consider the scalar problem with k univariate samples. Exact
58 Vasant P. Bhapkar
For reference purposes we first give the sample means and standard deviations
of these variables for different groups. See Table 1.
Table 1
Sample/Variables 1 2 3 4
Table 2
Sample/Variables 1 2 3 4
Note here that large values of sample mean generally correspond to small
values of S}") in view of the definition of function ~bv in (6.7). Thus, the sample
profiles plotted in terms of S}~) show general patterns opposite to those shown
by sample means; however, the differences among S-values are commensurable
in contrast to those among sample means.
The value of the over-all statistic, given by (8.13), for testing the hypothesis
Y(0 of homogeneity is
V0 =114, d.f.=20,
each on 5 d.f. The first two are not significant, while the latter two are highly
significant.
For testing the parallelism hypothesis Yga, as formulated by (9.2), the value of
statistic (9.4) is
1/1=80, d.f.=15.
This also is highly significant; indicating that the population profiles are not
parallel.
Suppose we look at the sets of variables {1, 2} and {3, 4} separately, on the
basis of the nature of these variables. In the notation of Section 10, we are then
considering the two choices for matrix Q as
01
[~ 10 ~] and [ ~ 0 0 0]
{1, 2} {3, 4}
V0 = 9 V0 = 85 d.f. = 10
V1 = 1 V1 = 12 d.f. = 5;
again the results for {1, 2} are not significant, while they are highly significant
for the set {3, 4}.
Next, we wish to consider specific comparisons among the 6 samples. First
we wish to compare Girls with Boys, i.e. the first three populations with the
latter three. This can be accomplished by the technique in Section 13 by letting
M = [1 1 1 - 1 - 1 -1]. Taking Q = I4 and P in (10.2) with P defined as in
Section 3, we have
respectively. These values indicate that the sex effect is significant on the 4
variables, and so is the sex-variables interaction.
For testing the occupation effects, we take
M=[11 0 1 1
0-1 1 0- '
References
Andrews, F. C. (1954). Asymptotic behavior of rank tests for analysis of variance. Ann. Math. Stat.
25, 724-736.
Bhapkar, V. P. (1961). A nonparametric test for the problem of several samples. Ann. Math. Star.
32, 1108-1117.
Bhapkar, V. P. (1966). Some nonparametric tests for the multivariate several sample location
problem. In: Proc. 1st Internat. Syrup. Muir. Analysis, pp. 29-42.
Bhapkar, V. P. (1979). Nonparametric tests for scalar profile analysis of several multivariate
samples. Annals Inst. Stat. Math. 31, 9-20.
Bhapkar, V. P. (1980). ANOVA and MANOVA: Models for categorical data. Handbook of Statistics,
Vol. 1. North-Holland, Amsterdam, pp. 343-387.
Bhapkar, V. P. (1982). On nonparametric profile analysis of several multivariate samples. Dept. of
Stat., Univ. of Kentucky, Technical Report No. 194.
Bhapkar, V. P. and Deshpande, J. V. (1968). Some nonparametric tests for multisample problems.
Technometrics 10, 578-585.
Bhapkar, V. P. and Patterson, K. W. (1977). On some nonparametric tests for profile analysis of
several multivariate samples. J. Multiv. Anal. 7, 265-277.
Bhapkar, V. P. and Schwartz, J. H. (1979). Efficient computation of Bhapkar's V Statistic and
significance points for its use in small samples. J. Star. Comp. Simul. 10, 1-14.
Birnbaum, Z. W. and Hall, R. A. (1960). Small sample distributions for multisample statistics of the
Smirnov type. Ann. Math. Stat. 31, 710-720.
Capon, J. (1961). Asymptotic efficiency of certain locally most powerful rank tests. Ann. Math.
Stat. 32, 88-100.
Chatterjee, S. K (1966). A multisample nonparametric scale test based on U-Statistics. Bull.
Calcutta Stat. Assoc. 15, 109-119.
Chinchilli, V. M. and Sen, P. K. (1982). Multivariate linear rank statistics for profile analysis. J.
Multiv. Anal. 12, 152-171.
Deshpande, J. V. (1970). A class of multisample distribution-free tests. Ann. Math. Stat. 41,
227-236.
Fisher, R. A. and Yates, F. (1938). Statistical Tables for Biological, Agricultural and Medical
Research. Oliver and Boyd, Edinburgh.
Fligner, M. A. and Policello, G. E. (1981). Robust rank procedures for the Benrens-Fisher
problem. J. Amer. Statist. Assoc. 76, 162-168.
Ghosh, M. (1982). Rank statistics and limit theorems. Handbook of Statistics, Vol. 4. North-
Holland, Amsterdam.
Hajek, J. and Sidak, Z. (1967). Theory of Rank Tests. Academic Press, New York.
Hodges, J. L., Jr., Kretch, D. and Crutchfield. R. S. (1975). Statlab. McGraw-Hill, New York.
Hodges, J. L., Jr. and Lehmann, E. L. (1956). The efficiency of some nonparametric competitors of
the t-test. Ann. Math. Stut. 27, 324-335.
Iman, R. L., Quade, D. and Alexander, D. A. (1975). Exact probability levels for the Kruskal-
Wallis test. Selected Tables in Mathematical Statistics 3, 329-384.
Kiefer, J. (1959). K-sample analogues of the Kolmogorov-Smirnov and Cramer-von Mises tests.
Ann. Math. Stat. 30, 420--447.
Kruskal, W. H. (1952). A nonparametric test for the several sample problem. Ann. Math. Stat. 23,
525-540.
62 Vasant P. Bhapkar
Kruskal, W. H. and Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. J. Amer.
Statist. Assoc. 47, 583--621.
Mann, H. B. and Whitney, D. R. (1947). On a test of whether one of two random variables is
stochastically larger than the other. Ann. Math. Stat. 18, 50-60.
Milton, R. C. (1964). An extended table of critical values for the Mann-Whitney (Wilcoxon)
two-sample statistic. 3. Amer. Statist. Assoc. 59, 925-934.
Mood, A. M. (1954). On the asymptotic efficiency of certain nonparametric two-sample tests. Ann.
Math. Stat. 25, 514--522.
Morrison, D. F. (1976). Multivariate Statistical Methods. McGraw-Hill, New York, 2nd ed.
Mosteller, F. (1948). A k-sample slippage test for an extreme population. Ann. Math. Star. 19,
58--65.
Noether, G. E. (1955). On a theorem of Pitman. Ann. Math. Star. 26, 64-68.
Patterson, K. W. (1975). A study of multivariate U statistic techniques with applications to profile
analysis. Unpublished Ph.D. Dissertation, Univ. of Kentucky.
Pitman, E. J. G. (1948). Notes on nonparametric statistical inference. Unpublished notes, Columbia
University, New York.
Puri, M. L. (1965). Asymptotic efficiency of a class of c-sample tests. Ann. Math. Stat. 35, 102-121.
Puri, M. L. and Sen, P. K. (1969). A class of rank order tests for a general linear hypothesis. Ann.
Math. Stat. 40, 1325-1343.
Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley, New
York.
Raghavachari, M. (1965). The two sample scale problem when the locations are unknown. Ann.
Math. Star. 36, 1236-1242.
Rust, S. W. and Fligner, M. A. (1982). A modification of the Kruskal-Wallis statistic for the
generalized Behrens-Fisher problem. Dept. of Stat., University of Kentucky, Technical Report
No. 192.
Sen, P. K. and Govindarajulu, Z. (1966). On a class of c sample weighted rank-sum tests for
location and scale. Ann. Inst. Star. Math. 18, 87-105.
Sugiura, N. (1965). Multisample and multivariate nonparametric tests based on U-statistics and
their asymptotic efficiencies. Oskaka J. Math. 2, 385-426.
Sukhatme, B. V. (1958). Testing the hypothesis that two populations differ only in location. Ann.
Math. Stat. 29, 60-78.
Tamura, R. (1960). On the nonparametric tests based on certain U-statistics. Bull. Math. Stat.
(Japan) 9, 61-68.
Tamura, R. (1966). Multivariate nonparametric several sample tests. Annals Math. Star. 37,
611-618.
Van der Waerden, B. L. (1953). Ein neuer test fur das Problem der zwei Stichproben. Math.
Annalen. 126, 93-107.
Verdooren, L. R. (1963). Extended tables of critical values of Wilcoxon's test statistic. Biometrika 50,
177-186.
P. R. Krishnaiah and P. K. Sen, eds., Handbookof Statistics, Vol. 4 '~
J
Elsevier Science Publishers (1984)63-78
Hypothesis of Symmetry
Marie H u g k o v d
1. Introduction
63
64 Marie Hu~kovd
tive distribution function F E o~p, where ,~p is the family of p-variate con-
tinuous diagonally symmetric distribution functions.
The hypothesis Hl,p is invariant under the groups of transformations Gnl and
(3,2. The group G,~ consists of all transformations
where the hi's are continuous, odd, and strictly increasing. The group Gn2
consists of all mappings
i.e. Gn2 consists of 2" distinct elements. The maximal invariant under G,1 and
G.2 is the set
where R)~ is the rank of [Xjil in the sequence [Xli [. . . . . [Xn/] and sign(x) = - 1 if
x < 0, sign(x) = 0 if x = 0, sign(x) = 1 if x > 0 (see Lehman, 1959). Thus the test
for {HLp, Kp(O+)} invariant under G,1 and G,2 will depend on X1. . . . . X,
through {Rj~, sign(X~i);j = 1. . . . . n, i = 1 . . . . . p}. The statistics generated by
this maximal invariant are called rank statistics for hypothesis of symmetry and
the test generated by these statistics are called the rank statistics for Hz,p.
The most frequent alternative hypothesis is hypothesis of shift in location,
i.e.:
Kp(O): X1. . . . , X, are independent, identically distributed, p-dimensional
random vectors, Xj is distributed according to the distribution function
F(x - 0), where F E ofp, 0 = (01. . . . ,0~)' is a vector parameter, O E 69, t~ # {9 (~ Rp.
The most frequent alternatives correspond to
REMARK 1. Bell and Haller (1969) and Riischendorf (1976) among others were
interested in different concept of multivariate symmetry hypothesis. They
considered the hypotheses generated by some group G of transformations for
the class o%~ of distributions such that P ~ o~c:>gP ~ @o for all g @ G.
REMARK 2. Other alternatives than Kp(O) were also considered; e.g. (see
Hu~kovfi 1971):
Hypothesis of symmetry 65
REMARK 3. It should be noted that most statistics for H,,p arise as analogs of
statistics for the hypothesis of randomness. Many properties (not all) of
statistics for the hypothesis of randomness can be easily transferred to statistics
for Hl,p. Consequently, not all results for statistics for ~/1,p have been pub-
lished.
In the next two sections tests for the univariate and the multivariate
hypotheses of symmetry are surveyed separately.
2. Univariate case
We shall write here /-/1, K(O), and X/ instead of H1,1, KI(0 ) and Xjl,
respectively.
Usually, the test for H1 against K ( O ) is based on the signed rank statistic
S + = ~ sign(Xi)an(R+), (2.1)
i=1
where a , ( 1 ) , . . . , an(n) are known scores. These scores are often related to
some square-integrable function ~0, defined on (0, 1), in the following way:
where [b] denotes the largest integer not exceeding b. Then we shall write
S~(~0) instead of S~+.Typically, one chooses
where U m < . < U(n} is the ordered sample corresponding to a sample of size
66 Marie Hugkovd
0< ~ +~
-00
( f ' ( x ) ) 2 ( f ( x ) ) -' dx < +oo.
T h e c o r r e s p o n d i n g h y p o t h e s e s are d e n o t e d by H T and K * ( O ) .
T h e main results on the a s y m p t o t i c distribution of S~+ can be s u m m a r i z e d as
follows.
lim m a x a an = 0 (2.6)
n--~ l~j<-n "=
Hypothesis of symmetry 67
then
S. a.(.i)) H, ~ N(O, 1) as n ~ .
as n ~oo
is locally optimal (more precisely, locally most powerful test for {H1, K*(O)+};
H~ijek and Sidfik, 1967, p. 174). The test with critical region
S+~ = ~] ci s i g n ( X i ) a . ( R ~ { ) ,
i=1
where c~. . . . . c, are known regression constants, is used. The properties of Sc+
are analogous to those of S +.
n
EH, S + = 0, varn, S~+ = a2(i) ~_, c 2 .
i=1 j=l
If hm._.=
' maxl~i~, ci(Zj=l
2 n c]) -1 = 0 and (2.6) are satisfied then
cj sign(Xj)a,(R~-, f ) ,
j=l
where aN(i) is given by (2.8), provides both the locally and the asymptotically
optimal tests for HT against (1.4) with 0 = {A ; A > 0} and O, = {An-m; A > 0},
respectively. These results can be derived in a manner analogous to the case
{Hb K(O)} or {HT, K*(O)}. For other results the reader is referred to, e.g.
Hu~kovh (1971).
There is also another class of statistics for //1 based on the empirical
distribution function and introduced by Smirnov (1947). They are called
Kolmogorov-Smirnov type statistics for /-/1. They can be used for a rather
broad class of alternatives; for example testing {H1, K(O)}, O-arbitrary
nonempty or for testing/-/1 versus the general alternative:
V, = 2 -1 f n(F,(x)+ F , ( - x ) - 1)2 d F . ( x ) .
F ( x ) + F ( - x + ) = 1 for all x,
S + = ~ sign(X/),
j=l
called the sign statistic. S + equals the difference of the number of positive and
negative X/. The test based on S + is called the sign test. Under H1,
W + = ~ sign(Xj)R].
i'=1
The corresponding test is called the one-sample Wilcoxon test. Under H1,
W+, leads to both the locally and asymptotically optimal test if the underlying
distribution is logistic. The test was introduced by Wilcoxon (1945). For tables
see Wilcoxon (1947), Hfijek (1955), Owen (1962), Selected tables in mathema-
tical statistics, Vol. 1. Normal approximation is good for n t> 15.
The Fraser test. This test employs the statistic
S n+ -- - 2 sign(Xj)a,(R~)
j=l
where a.(i) = E I V[(i), with IVl(1) ~ . - . ~ [V[(,) being the ordered absolute values
corresponding to a sample from N(0, 1) of size n. Under H1,
If the underlying distribution is normal S+, leads to the locally and asymptotic-
ally optimal test. The test was derived by Fraser (1957) and later studied and
tabulated by Klotz (1963).
The one-sample van der Waerden test. This test is also asymptotically optimal
for the normal distribution and is determined by the statistic
Under H1,
This test was introduced by van Eeden (1963). For tables see Owen (1962).
3. Multivariate case
The tests for Hl.p in the multivariate case are based on the vector of
univariate rank statistics, i.e. on
+
S n+ = (Sn+l . . . . . Snp) , (3.1)
Hypothesis of symmetry 71
where X;,n is the generalized inverse matrix of "~p,n, is usually used. Denote by
~n the minimal tr-field generated by
where X2(p) is the upper critical value of the x2-distribution with p degrees of
freedom, can be used for {Hi,p, Kp(O)}. In contrast to the univariate case, the
vectors
(sign(Xn) . . . . . sign(Xn!), , sign(Xlp) . . . . . sign(Xnp))'
72 Marie Hugkovd
and
(R-~I, . . . . Rnl+ . . . . . Rlp,..,R,p),+ + ' p>~2,
En,.,(S,+ I ~ , ) = E~/L,($,+) = o,
varn,,,(S+ I ~,)= ,~p,,, varH1,S + = E~,.,~p,,,
n-l(Xp., - varnLpS,+)~ 0 in probability as n ~
holds. Denote by Fj and Fjk the distribution functions of Xj and (Xj, Xk),
j, k = 1 , . . . , p, resp.; and by F~' and F ~ the distribution functions of IXj[ and
([xA, [Xk[), L k = 1. . . . . p, resp. Put
: fo dF (x, y),
sign(x)~j(FT(lxl))f~(x) dx.
In the following two theorems the main results on the asymptotic distributions
of S + and Q+ under H~,o and under local alternatives are summarized.
ThEOReM 3.1. Let (3.3) be satisfied, and let the matrix ~ defined by (3.10) be
positive definite. Then, under H~,p,
5~(S+[H,,e)L Np(O, ~ ) as n ~ ,
and
5g(Q+ I HI.p)Z> xZ(p) as n-->~,
where ~ ( . . . I Hi,p) denotes the conditional distribution, given sg,, under Hi,p, Iv
is the p p identity matrix and X2(p) is the x2-distribution with p degrees of
freedom.
~(n-1/2S+)~Np(p,Z) as n ~ ,
and
:> x2(p, .'Z-it,),
where X2(., .) denotes the noncentral x2-distribution.
The simplest test for {Hl,p, Kp(O)} are the multivariate extensions of the
univariate sign test which are due to Hodges (1955), Blumen (1958), and
Bennet (1962) among others. Bickel (1965) considered a quadratic rank statistic
based on the coordinatewise one-sample Wilcoxon statistics. Puri and Sen
(1967) introduced and studied the general class of quadratic rank statistics Q~+.
They derived the asymptotic distribution of S~+ and Q~+ under general alter-
natives. Hu~kovfi (1971) treated more general rank statistics (for testing Hl,p
against the alternative (1.4)). Namely, she considered
QC -S~Zp,~S~
__ +~ -- + and S~+ = (S~ . . . . . S~)
+ , I
where
S~ = ~ cj~ sign(Xj~)a~(R+), i = 1. . . . . p,
]=1
and
~p,~ = varnl~(Sc [M~),
and obtained the asymptotic distribution of S~+ and Q~+ under the hypothesis
Hl,p under local alternatives, and under general alternatives.
Most of the known results on rank tests for Hl,p can be found in the book by
Puri and Sen (1971).
The multivariate sign test. This test is based on Q~+ given by (3.6) with
By direct computations one obtains the elements of the variance matrix of S+~
under HI,,:
The multivariate sign test is often used for testing the hypothesis that the Xj
have zero median vectors against the alternative hypothesis (the distribution
74 Marie Hugkovd
rl 4-
S +, = (n + 1)-1 E Rji sign(Xji), i = 1 , . . . , p, (3.12)
i=1
and
O'p.lk = (n + 1)-z 2 R j +
i R j k+ sign(Xji) sign(Xjk), i, k = 1, . . . . p . (3.13)
j=l
O'ik = Fi(x)Fk(y)dFik(x,y), i, k = 1 . . . . . p.
f0 ~ f0 ~ ~
x@-I +1 2 , k , i = 1. . . . . p. (3.15)
O'ik = fo f; crP-l(F*(x)2+1))~_l(F~(y)+
\ 2 1))dF/~(x ' y),
i,k=l ..... p.
Hypothesis of symmetry 75
w 2
2a(T~IH~,p)--,a, (p;O) as n ~
and
m w 2
~(TZ~ l gp(~/nO))--> X (p; O'W-IO) as n ~ .
e + 2 /~'X-1/~
o.,r. O,W-tO ,
where/L and X are defined by (3.10) and W is the variance matrix of Xj.
If the distribution of Xj is multivariate normal with nonsingular variance
matrix W then
lim e o,.r,
+ 2 = 1,
for Q,+ with S:i and O'p,ik given by (3.14) and (3.15) resp.
Some inequalities for further cases can be found in Puri and Sen (1971),
Bickel (1965), Bhattacharya (1967).
References
Albers, W. (1979). Asymptotic expansions for the power of one-sample against contiguous
nonparametric alternatives. In: Proceedings of the 2nd Prague Symposium on Asymptotic Statis-
tics. North-Holland, Amsterdam, 105-117.
Albers, W., Bickel, P. and van Zwet, W. R. (1976). Asymptotic expansion for the power of
distribution free tests in the one-sample problem. Ann. Statist. 4, 108-156.
Anderson, T. W. (1966). Some nonparametric multivariate procedures based on statistically
equivalent blocks. In P. R. Krishnaiah, ed., Proc. 1st lnternat. Symp. Mult. Analysis, pp. 5-27.
Bell, C. B. and Haller, H. S. (1969). Bivariate symmetry tests: parametric and nonparametric. Ann.
Math. Statist. 40, 259-269.
Bennet, B. M. (1964). On bivariate sign test, JRSS B 26, 457-461.
Bennet, B. M. (1962). On multivariate sign tests. JRSS B 24, 159-161.
Bhattacharya, G. K. (1967). Asymptotic efficiency of multivariate normal score test. Ann. Math.
Statist. 38, 1753-1758.
Bickel, P. (1965). On some asymptotically nonparametric competitors of Hotelling's T 2. Ann.
Math. Statist. 36, 160-173.
Blumen, J. (1958). A new bivariate sign test..IASA 53, 448-456.
Butler, C. C. (1969). A test for symmetry using the sample distribution function. Ann. Math. Statist.
40, 2209-2210.
Chatterjee, S. K. (1966). A bivariate sign test for location. Ann. Math. Statist. 37, 1771-1782.
Conover, W. J. (1973). Rank test one sample, two samples and k samples without the assumption
of a continuous distribution function. Ann. Statist. 1, 823-831.
Conover, W. J., Wehnmanen and Ramsey, J. B. (1978). A note on the small-sample power
functions for nonparametric tests of location in the double exponential family. Journ. Amer.
Statist. Assoc. 73, 188-190.
Davidson, R. R. and Bradley, R. A. (1969). Multivariate paired comparisons; the extension of a
univariate model and associated estimation and test procedures. Biometrika 56, 81-95.
Davidson, R. R. and Bradley, R. A. (1970). Multivariate paired comparisons: Some large sample
results on estimation and tests of equality of preference. In: M. L. Purl, ed., Nonparametric Tech.
in Statist. Inf. Cambridge Univ. Press, pp. 111-125.
Dietz, E. J. (1982). Bivariate nonparametric tests for the one-sample location problem. J. Amer.
Statist. Assoc. 77, 163-169.
Dixon, W. J. and Mood, A. M. (1946). The statistical sign test. J. Amer. Statist. Assoc. 41, 557-566.
Doksum, K. and Thompson, R. (1971). Power bounds and asymptotic minimax results for one
sample rank tests. Ann. Math. Statistis. 42, 12-34.
Fellingham, S. A. and Stoker, D. J. (1964). An approximation for the exact distribution of the
Wilcoxon test for symmetry. J. Amer. Statist. Assoc. 59, 899-905.
Finch, S. J. (1977). Robust univariate test of symmetry. J. Amer. Statist. Assoc. 72, 387"392.
Fraser, D. A. S. (1957). Most powerful rank-type tests. Ann. Math. Statist. 28, 1040-1043.
Gupta, M. K. (1967). An asymptotically nonparametric test of symmetry. Ann. Math. Statist. 38,
849-866.
Hfijek, J. (1955). Some rank distribution and their use (Czech.) Casopis pro p~st. matematiky 80,
17-31.
Hfijek, J. (1969). Nonparametric Statistics. Holden-Day, San Francisco.
Hfijek, J. and Sidfik, Zb. (1967). Theory of rank tests. Academia, Prague.
Hemelrijk, J. (1952). A theorem on the sign test when ties are present. Indagationes Math. 14,
322-326.
Hill, D. L. and Rao, P. V. (1977). Tests for symmetry based on Cram6r-von Mises Statistics.
Biometrika 64, 489-494.
Hodges, J. L. Jr. (1955). A bivariate sign test. Ann. Math. Statist. 26, 523-527.
Hollander, M. (1971). A nonparametric test for bivariate symmetry. Biometrika 58, 203-212.
Hollander, M. and Wolfe, D. A. (1973). Nonparametric statistical methods. Wiley, New York.
Hu~kovfi, M. (1970). Asymptotic distribution of linear rank statistics used for testing symmetry. Z.
Wahrsch. Verw. Gebiete 14, 308-322.
Hypothesis of symmetry 77
Hugkovfi, M. (1971). Asymptotic distribution of rank statistics used for multivariate testing
symmetry. J. of Multivariate Analysis 1, 461-484.
Irle, A. and K16sener, K. (1980). Note on the sign test in the presence of ties. Ann. Statist. 5,
1168-1170.
Joffe, A. and Klotz, J. (1962). Null distribution and Bahadur efficiency of the Hodges bivariate sign
test. Ann. Math. Statist. 33, 803-807.
Klotz, J. (1963). Small sample power and efficiency for the one sample Wilcoxon and normal scores
test. Ann. Math. Statist. 34, 624-632.
Klotz, J. (1964). Small sample power of the bivariate sign tests of Blumen and Hodges. Ann. Math.
Statist. 35, 1576-1582.
Klotz, J. (1965). Alternative efficiences for signed rank tests. Ann. Math. Statist. 36, 1759-1766.
Koul, H. L. and Staudte, R. G. Jr. (1972). Asymptotic normality of signed rank statistics. Zeitschrift
Wahrsch. Verw. Gebiete 22, 293-300.
Koul, H. L. and Staudte, R. G. Jr. (1972b). Power bounds for Smirnov statistics in testing the
hypothesis of symmetry. Michigan State University.
Koziol, J. (1979). A test for bivariate symmetry based on the empirical distribution function.
Comm. Statist. A 8, 207-221.
Koziol, J. (1980). On a Cramer-von Mises type statistic for symmetry. J. Amer. Statist. Assoc. 75,
161-167.
LeCam, L. (1960). Locally asymptotically normal families of distributions. Univ. Calif. Publ. in Stat.
3, 37-98.
Lehman, E. L. (1959). Testing Statistical Hypothesis. Wiley, New York.
Mac Kinnon, W. J. (1964). Table for both the sign test and distribution-free confidence intervals of
the median for sample sizes to 1000. J. Amer. Statist. Assoc. 59, 935-956.
Orlov, A. J. (1972). On testing symmetry of distribution. Th. Probab. Applic. 17, 357-361.
Owen, B. (1962). Handbook of Statistical Tables. Addison-Wesley, Reading, MA.
Prfigkovfi, Z. (1976). Asymptotic expansion and local limit theorem for the signed Wilcoxon
statistic. CMUC 17, 335-344.
Pratt, J. W. (1959). Remark on zeros and ties in some nonparametric tests. Ann. Math. Statist. 26,
368-386.
Puri, M. L. and Sen, P. K. (1969). On the asymptotic normality of one sample rank order statistics.
Teoria Verojatnost. i Primenen. 1, 167-172.
Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley, New
York.
Puri, M. L. and Shane, H. D. (1970). Statistical inference in incomplete block designs. In: M. L.
Puri, ed., Nonparametric Techniques in Statistical Inference. Cambridge Univ. Press, pp. 131-153.
Putter, J. (1955). The treatment of ties in some nonparametric tests. Ann. Math. Statist. 26,
368-386.
Rothaman, E. D. and Woodroofe, M. (1972). A Cramrr-von Mises type statistic for testing
symmetry. Ann. Math. Statist. 34, 2035-2038.
R/ischendorf, L. (1976). Hypothesis generating groups for testing multivariate symmetry. Annals of
Statist. 4, 791-795.
Russel, C. T. and Puri, M. L. (1974). Joint asymptotic multinormality for a class of rank order
statistics in multivariate paired comparisons. J. Multivariate Analysis 4, 88-105.
Savage, J. R. (1959). Contribution to the theory of rank order statistics; one-sample case. Ann.
Math. Statist. 30, 1018-1023.
Selected Tables in Mathematical Statistics VoL 1. Edited by the Institute of Mathematical Statistics,
Vol. 2, pp. 171-260.
Sen, P. K. (1967). Nonparametric tests for multivariate interchangeability. Part I. Sankhya Ser. A
29, 351-372.
Sen, P. K. (1970). On the distribution of the one-sample rank order statistics. In: M. L. Puri, ed.,
Nonparametric Tech. Statist. Inf. Cambridge Univ. Press, pp. 53-72.
Sen, P. K. (1971). On a class of aligned rank order tests for multiresponse experiments in some
incomplete block design. Annals of Ma~h. Statist. 42, 1104-1112.
78 Marie Hugkov6
Sen, P. K. (1979). On some distribution-free tests for affine symmetry. Calcutta Statist. Assoc. Bull.
27, 105-108, 59-79.
Sen, P. K. and David, H. A. (1968). Paired comparison for paired characteristics. Ann. Math.
Statist. 39, 200-208.
Sen, P. K. and Puri, M. L. (1967). On the theory of rank order tests for location in the multivariate
one sample problem. Ann. Math. Statist. 38, 1216-1228.
Shane, H. D. and Puri, M. L. (1969). Rank order tests for multivariate paired comparisons, Ann.
Math. Statist. 40, 2101-2117.
Shirahata, S. (1974). On tests of symmetry for distrete populations. Austral. J. Statist. 6, 83-90.
Smirnov, N. V. (1947). On criteria for the symmetry of distribution laws of random variables.
Doklady Akademii Nauk SSSR, 11-14.
Snijders, T. (1981). Rank tests for bivariate symmetry. Ann. Statist. 9, 1087-1095.
Srinivan, R. and Godio, L. B. (1974). A Cram6r-von Mises type statistic for testing symmetry.
Biometrika 61, 196-198.
van Eeden, C. and Benard, A. (1957). A general class of distribution-free tests for symmetry
containing the tests of Wilcoxon and Fisher I, II, III. lndagationes Math. 19 (Proc. Kon. Nederl.
Akad. Wet. 60), 381-391,392-400, 401-408.
van Eeden. C. (1963). The relation between Pitman's asymptotic relative efficiency of two tests and
the correlation coefficient between their test statistics. Ann. Math. Statist. 34, 1442-1451.
van der Waerden, B. L. and Nievergelt, E. (1956). Tafeln zum Vergleich zweier Stichproben mittels
x-Tests und Zeichentest, Springer, Berlin.
Vorli~kov~, D. (1972). Asymptotic properties of rank tests of symmetry under discrete dis-
tributions. Ann. Math. Statist. 43, 2013-2018.
Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics Bull. L 80-83.
Wilcoxon, F. (1947). Probability tables for individual comparisons by ranking methods. Biometrics
3, 119-122.
Wilcoxon, F. (1949). Some rapid approximation procedures. American Cyanamid Company, Stan-
dord.
Yanagimoto, T. and Sibuya, M. (1972). Test of symmetry of a one-dimensional distribution against
positive biasedness. Ann. Inst. Statist. Math. 24, 423-434.
Yanagimoto, T. and Sibuya, M. (1976). Test on symmetry of a bivariate distribution. Sankhya Ser.
A 38, 105-115.
Walsh, J. E. (1951). Some bounded signification level properties of the equal-tail sign test. Ann.
Math. Statist. 22, 408-417.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 A
O Elsevier Science Publishers (1984) 79--88 --I"
Measures of Dependence
Kumar Joag-Dev
O. Introduction
79
80 KumarJoag-Dev
1. Measures of concordance
With the same notation as in the Introduction, the general idea of construc-
ting a measure is to develop a notion of deviation between F and the product
measure FI"/:2. Thus for the fixed marginals if F deviates more, the measure
would be higher. Usually for every such a concept, there is a population
version and a sample version of a measure. In most cases the population
version is either the mean or the asymptotic mean of the sample version. We
begin with the
(a) Spearman's rank correlation coefficient. This is one of the oldest measure
based on ranks. Spearman proposed this in 1904 and the idea came from
Francis Galton's use of the 'grades' (or the percentiles) of a distribution. Let Ri
and & denote the ranks of X~ and Y~ among X and Y observations respec-
tively and write
di=Ri-&.
The initial proposal of Spearman was to use E [d/l, now known as Spearman's
'foot rule'. However, due to mathematical convenience, the product moment
correlation between Ri and &, namely
0 02
190.12 I~(X, y : 0.12) = ~ ~/(X, y : Or12) , (1.4)
0 6
00.12 psy ( N ) - 7rk / 4 - 2 2 '
and hence
6
Ps,e ( N ) = -- arcsin(o'12/2). (1.5)
"iT
[ 1 if x > 0 ,
s(x)= ~l i f x = O , (1.6)
-1 ifx<O.
Then
(1.7)
and hence
1
z = n ( n - 1) ~'~ ~" s(Xi - Xi)s(Yi - Yj). (1.9)
i#]
0 2
Ocq2 rp(g12) = 4O(0, O, aq2) = ~r~/1 - 0"122 (1.10)
Hence
2
Tp (0"12) = -- arcsin 0"12. (1.11)
and Kendall's z, they also studied a metric I(zr, o-) between two permutations zr
and o', obtained by considering the minimum number pairwise transpositions
required to achieve certain alignment. They obtained some interesting in-
equalities for these metrices.
Some multivariate analogs of the above measures have been studied in the
literature. Ehrenberg (1952) did some numerical comparisons of the following
two measures of concordance between m judges who were asked to rank n
objects. The first measure is obtained by taking an average of the ('~) rank
correlations while the second is obtained by using Kendall's ~- instead of the
rank correlation. Hays (1960) studied sampling distributions of the above
measures and made a numerical comparison with some h"2 approximations.
More recently Alvo, Cabilio and Feign (1982) made the asymptotic study of
these measures and showed that the average 7 is superior (asymptotically) to
average rank correlation coefficient.
(c) Blomquist's q. Blomquist (1950) proposed a measure which is similar in
the spirit as 'tau', where symmetric differences Xi - Xj and Yi - Yi are replaced by
the deviations from the sample medians X', and Y, respectively. Thus
n l - n2 (1.13)
q - nl + n2"
Such a measure was also considered by Mosteller (1946), who called it as one
of the 'useful inefficient statistics'. The measure can be construed as the
bivariate version of the so called 'median test' for testing the hypothesis
whether the two random samples came from the same population.
The population analog of q is clearly
The evaluation of this for the case of the bivariate normal distribution is the
same as given in (1.10) and hence 2/(7r arcsin (rlz).
(d) Hoeffding's A. The following measure, proposed by Hoeffding (1948b), is
similar to the well known notion of the 'distance' between two distribution
functions suggested by Kolmogorov, Smirnov, CramSr and yon Mises.
where the sum is taken over all 5 tuples with a~ = 1 , . . . , n, a ~ aj; for i~ j and
nPk is same as n !/(n - k)!. Although, the expression in (1.15) has an advantage
of being recognized as a U-statistic, Hoeffding (1948b) showed that an alter-
native expression in terms of ranks can be given as follows. Let
2
A = ~ (R, - 1)(R, - 2)(S~ - 1)(S~ - 2), B = ~] (R~ - 2)(Si - 2)T~,
1 1
C = ~] T~(T~ - 1),
1
or the n u m b e r of pairs in the lower quadrant with the vertex (Xi, Y~). T h e n
Blum, Kiefer and Rosenblatt (1961), while studying the asymptotic behaviour,
p r o p o s e d a statistic asymptotically equivalent to D,,
n
B . = - ~ ~ , [ N , ( i ) N 3 ( i ) - N2(i)N4(i)l 2 ,
i=1"=
where N1, Nz, N3, N4 are the n u m b e r of pairs residing in the four quadrants of
the plane f o r m e d at (X/, Y/).
Measuresof dependence 85
It should be noted that unlike the measures defined in (a), (b), (c), the
measures (1.14) or (1.15) due to its similarity with the notion of distance, do not
take negative values.
Other distance measures can be easily constructed by replacing dF(x, y) in
(1.14) by dF(x)dF(y). Other notions such as Kolmogorov-Smirnov distance
could also be used for measuring the deviation.
cov(X, Y) >/0,
The most commonly used yardstick for measuring the dependence between
two variables is the product moment correlation coefficient. It is well known
that this measures linear dependence. If the goal is to measure functional
dependence, such as Y-= g(X), or in general, whether h(X, Y) equals con-
stant, then correlation coefficient is certainly unsuitable. The standard example
where the mass is uniformly distributed on xZ+ y2= 1 shows that the cor-
relation coefficient can be 0 while a perfect functional dependence exists.
There have been several articles, where one starts with certain desirable
conditions on the measure of dependence and then examining how some of the
measures commonly used live up to those conditions. By modifying some of
these conditions some authors have proposed new measures. Hall (1970) has
given an excellent survey of the developments in this direction and has
proposed a new measure having different desirable properties. We give a brief
outline of these results.
The following is a list of the conditions which one would like a measure of
dependence ~:(X, Y) to satisfy. The systematic search of a measure via such
conditions was first made by Renyi (1959).
(a) ~(X, Y) = ~( Y, X).
(b) 0 ~ < 1 .
(c) ~ = 0 if and only if X and Y are independent.
(d) ~ = 1 if either X = g(Y), almost surely, or Y = g(X) almost surely, for
some measurable function.
(e) ~:(f(X), g(Y))<~(X, Y).
(f) If (X, Y) has a bivariate normal distribution then sc agrees with the
absolute value of the correlation coefficient.
In many instances the condition (a) may be undesirable. This is clear if one
Measuresof dependence 87
References
[1] Alvo, M., Cabilio, P. and Feign, P. D. (1982). Asymptotic theory for measures of concordance
with special reference to average Kendall tau. Ann. Statist. 10, 1269-1276.
88 Kumar Joag-Dev
[2] Bhuchongkul, S. (1964). A class of nonparametric tests for independence in bivariate popu-
lations. Ann. Math. Stat. 35, 138-149.
[3] Blomquist, N. (1950). On a measure of dependence between two random variables. Ann.
Math. Stat. 21, 593-600.
[4] Blum, J. R., Kiefer, J. and Rosenblatt, M. (1961). Distribution free tests of independence
based on the sample distribution function. Ann. Math. Star. 32, 485-498.
[5] Diaconis, P. and Graham, R. L. (1977). Spearman's footrule as a measure of disarray. J. Roy.
Statist. Soc. Ser. B. 39, 262-268.
[6] Ehrenberg, A. S. C. (1952). On sampling from a population of rankers. Biometrika 39, 82-87.
[7] Esscher, F. (1924). On a method of determining correlation from the ranks of the variates.
Aktur. tids. 7, 201-219.
[8] Fr6chet, M. (1951). Sur les tableux de corr61ation don les marges sont donn6es. Ann. Univ.
Lyon Sect. Ser. 3. 14, 53-77.
[9] Gebelein, H. (1941). Das Statistiche Problem der Korrelation als Variations und Eigenwert-
problem und sein Zusaminenhang mit der Ausgleichungsrechnung. Zeit. fiir Agnew. Math.
und Mech. 21, 364-379.
[10] Hall, W. J. (1970). On characterizing dependence in joint distributions. In: Essays in
Probability and Statistics, Univ. of N.C. Press. 339-376.
[11] Hays, W. L. (1960). A note on average tau as a measure of concordance. J. Amer. Statist.
Assoc. 55, 331-341.
[12] Hoeffding, W. (1940). Masstabinvariante Korrelationstheorie. Schriften des Math. Inst. des
Inst. fiir Ange. Math. der Univ. Berlin 5, 179-233.
[13] Hoeffding, W. (1948a). A class of statistics with asymptotically normal distribution. Ann.
Math. Stat. 19, 293-325.
[14] Hoeffding, W. (1948b). A nonparametric test of independence. Ann. Math. Stat. 19, 546-557.
[15] Hotelling and Pabst (1936). Rank correlation and tests of significance involving no assump-
tions of normality. Ann. Math. Stat. 7, 29-43.
[16] Kendall, M. (1938). A new measure of rank correlation. Biometrika. 30, 81-93.
[17] Kimeldortt, G. and Sampson, A. R. (1978). Monotone Dependence. Ann. Stat. 6, 895-903.
[18] Lehmann, E. L. (1966). Some concepts of dependence. Ann. Math. Stat. 37, 1137-53.
[19] Lindeberg (1925). Uber die Korrelation. Skand. Mathematiker kongres i Kobenhaun. 1,
437-446.
[20] Lindeberg (1929). Some remarks on the mean error of the percentage of correlation. Nordic
Stat. J. 1, 137-41.
[21] Linfoot, E. H. (1957). Inf. Control. 1, 85-89.
[22] Mosteller, F. (1946). On some useful 'inefficient' statistics. Ann. Math. Stat. 17, 377.
[23] Plackett, R. L. (1954). A reduction formula for normal multivariate integrals. Biometrika 41,
351-360.
[24] Renyi, A. (1959). On measures of dependence. Acta-Math. Acad. Sc. Hung. 441-51.
[25] Saramonov, O. V. (1958a). Maximum correlation coefficient (symmetric case). Dokl. Akad.
Na. SSSR 120, 715-718.
[26] Saramonov, O. V. (1958b). Maximum correlation coefficient (nonsymmetric case). DokL
Akad. Na. SSSR 121, 52-55.
[27] Slepian, D. (1962). The one sided barrier problem for Gaussian noise. Bell. Syst. Tech. J. 41,
463-501.
[28] Yanagimoto, T. and Okamoto, M. (1969). Partial orderings of permutations and monotonicity
of a rank correlation statistic. Ann. Inst. Stat. Math. 21, 489-506.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4
Elsevier Science Publishers (1984) 89-111 J
Gouri K. Bhattacharyya
I. Introduction
89
90 Gouri K. Bhattacharyya
F is a fixed df and 01 ~ ' ' " ~ ON are unknown location parameters. A linear
regression structure 0i = c~ +/3b~ is a further special case where the b/s are
known and increasing constants, and a and/3 are unknown parameters. The
null hypothesis corresponds to/3 = 0, and the alternative is/3 > 0.
A fairly large number of nonparametric tests for trend have been proposed
in the literature. However, they can be grouped into two broad classes, albeit
with some overlap. In the first, the tests are motivated from heuristic reasoning
where simplicity of the procedure and ease of computation are the prime
considerations. These tests are based on the signs of the differences of some or
all pairs (Xi, Xj). Letting Uq = 1 (0) according as X / < (>) Xj, the test statistics
have the general form
c~jUo (2.1)
l<~i<j<~N
where the nonnegative constants Q are viewed as the weights attached to the
signs of (Xi - X i ) . Some prominent members of this class are
M= Uo, D= Z (j-i)U~j,
l<<.i<j<~N l<~i<j<_N
N/2 N/3 (2.2)
CS1 : E (N - 2i + 1)Ui,N-i+l, CS2 : E Ui,2N/3+i.
i=1 i=1
The first two are due to Mann (1945) and Daniels (1950), respectively. The last
two are among a subclass of tests studied by Cox and Stuart (1955) where the test
statistic is of the form (2.1) with the added requirement that only the disjoint
U0's be involved, that is, c0ei,j,= 0 whenever i, j, i' and j' have any element
repeated.
The null distributions of these statistics derive from the probability structure
that all N ! orderings of Xa . . . . . XN are equally likely. Moreover, being linear
functions of the indicator variables Uq, their exact means and variances are
easy to calculate, and they are asymptotically normal under mild conditions on
the eq's. In particular,
Kendall'stau: T=(2M-N)/(N),
N+I',
Spearman's rank correlation: rs = [12/(N 3 - N)] ~ i----T-)RNi,
i=l
92 Gouri K. Bhattacharyya
Tables of critical values of r and rs are available, and these can be used to
perform the M and D tests for trend. Cox and Stuart (1955) motivated the tests
CS1 and CS2 by the features that these are 'simplified versions' of D and M,
respectively, and they do not incur a substantial loss of asymptotic efficiency. In
fact, their Pitman asymptotic relative efficiencies (ARE) with respect to D are
0.87 and 0.84, respectively, for the normal linear trend model.
The tests in the other class are based on the linear rank statistics
N
TN = ~'~ (CNi -- gN)aN(RNi) (2.3)
i=I
where U ) < -.. < --NlftN)denote the order statistics of a random sample of size
N from the uniform (0, 1) distribution, and f'(x) = df(x)/dx. Following
Theorem II.4.8 of Hfijek and Sid~ik (1967), the locally most powerful (LMP)
rank test of H0:/3 = 0 rejects the null hypothesis for large values of the statistic
N
T?v = ~'~ (bi - bN)aN(RN,, f ) (2.4)
i=1
which obviously belongs to the class of linear rank statistics defined in (2.3). In
particular, if the trend is linear in time (bi = i), the LMP rank tests for the
logistic and normal parent distributions are respectively given by
where W () are the order statistics from the standard normal df 4. The first
expression in (2.5) is a linear function of rs and hence of D as well. Con-
sequently, Daniel's test is the LMP rank test for a linear trend in location when
the underlying distribution is logistic. The second statistic in (2.5) gives the
normal scores test for trend.
Tests for randomness against trend or serial correlations 93
In regard to the asymptotic efficiencies of the tests for trend, some scattered
results were only available in the early literature confined especially to the
normal distribution (cf. Cox and Stuart, 1955, and Stuart, 1956). Evidently, a
rank test of the form (2.3) involves a choice of the regression constants cng and
the score function an('). In the case of a location-trend model, its power
properties would depend upon the underlying distribution F as well as the
rapidity of the growth of the trend. Aiyar et al. (1979) provide a comprehensive
study of the effects of these factors upon the Pitman A R E ' s of various trend
tests including the linear rank test (2.3) and some relatively important members
of the class (2.1). Considering the local alternatives F d x ) = F ( x - f l n b i ) such
that for some A > 0 , O<limn_,~NaflN < ~ , and assuming some regularity
conditions on F, b~ and cni, the A R E of the test TN relative to the asymptotic-
ally most powerful test T } is given by
Note that Daniels' statistic D can be written in the two equivalent forms
12
LN -- N (N2 - 1) ~
N (i_____ll)RNi
N 2+
/3, = sup{b: LN (x - be) > 0}, /3" = inf{b: LN (x - by) < 0},
y~= wj toj, i = 1. . . . . k ,
j=l
define the integer m by the requirement that y,,-1 <~< Ym. Then
i l L = Ym if T i n > l ,
(2.8)
= (Y,, + Y,.+1)/2 if y,, = .
Thus /3L is the weighted median obtained from the ratios (Xj - X i ) / ( j - i) with
the associated frequency counts ( j - i ) . It has an interesting similarity with
the usual least squares estimator which is the weighted mean of the ratios
(Xj - Xi)/(J' - i) with the corresponding weights (j - i).
In a similar manner, one can also derive the H o d g e s - L e h m a n n estimator of
13 from the test statistic M. The resulting estimator /3M turns out to be the
(unweighted) median of the ratios ( X j - X i ) / ( j - i ) . For the more general
situation where F~(x)= F ( x - I x - / 3 t ~ ) and the ti's are distinct but not neces-
Tests for randomness against trend or serial correlations 95
Such median estimators of the slope of a line were originally proposed by Theft
(1950) on intuitive ground. Sen (1968) and Bhattacharyya (1968) highlight their
relation to the test statistics M and L, and derive some distributional proper-
ties. The A R E of these estimators relative to the least squares estimator is the
same as the A R E of the M (or L) test relative to the t test. In particular, when
Fi(x) = F ( x - fli), the A R E has the lower bound 0.953 for all continuous F, and its
value is 0.985 when F is normal.
b=[12/(N3-N)]~_,(i-~)Xi
i=1
obtained by ranking each row of X separately. The basic ingredients for the
construction of a generalized D a n i e l s - S p e a r m a n statistic are
Note that L~N and O~e are the Spearman rank correlations for the pairs (i, X~i),
i = 1. . . . . N and (X~i, Xt3i), i= 1. . . . . N, respectively. Under the null hypo-
thesis H0: FI . . . . . FN and conditionally given the collection of the column
vectors of Ru, all N! permutations of the columns are equally likely. From this
conditional or permutation probability measure, one can verify that the mean
vector and the covariance matrix of LN are 0 and (N - 1)ON, respectively. In
analogy with the normal theory test statistic WN, the p-variate Daniels-
Spearman statistic is then defined as the quadratic form
L~ ) = ( N - 1)LkQ~ILlv (2.11)
with its large values leading to the rejection of H0. Unlike the univariate
situation, the rank statistic L~ ) is not unconditionally distribution-free under
H0. However, a strictly distribution-free test is obtained by performing the test
conditionally on the observed configuration of RN, that is by referring to the
permutation distribution of Luo). Note that the column permutations of RN only
affect LN while the matrix QN remains invariant. If ON is singular, a general-
ized inverse is to be used in (2.12) in place of the regular inverse. The choice is
of little consequence in large samples because the probability of ON being
singular tends to zero as N ~ o0 whenever the parent distribution F(x) does not
degenerate to a dimension less than p.
As N ~ % the permutation distribution of L~ ) converges weakly to X~, the
central chi-squared distribution with p degrees of freedom. So, in large
samples, an approximate level c~ test is given by the rejection region L~ ) >/c
where c denotes the upper c~-point of X 2. The permutation test is asymptotic-
ally power-equivalent to this unconditional test, and for this reason, both will
be referred to as the L~ ) test.
The L~ ) test remains consistent for a wide class of general trend alternatives
where the trend in at least one variable does not damp out too fast.
Specifically, the property of consistency holds for all sequences {Fu} such that,
for some rl < and some 1 ~< ~ ~<p,
A,~t~= 12 ~ f~ F~(x)Ft3(y)dF~,~(x, y ) - 3.
Tests for randomness against trend or serial correlations 97
where U~ ) are order statistics from the uniform (0, 1) distribution. Then the
rank test, that maximizes the average power fi(A) uniformly for 3 in a positive
neighborhood of 0, rejects for large values of the statistic
N
T~-- ~'~ (O~- ON)au(RN~, f ) . (3.2)
i=l
Comparing (3.2) with (2.4) we observe that the LMP rank test for the trend
problem has a similar structure as that of the optimal rank test for the change
problem of this section. However, the regression constants b~ in the former
relate to the assumed trend model. In the latter, the analogue of the regression
constants are the terms Q~ which relate to the weights used to average the
power. For the special case of uniform weights qi = 1/(N - 1), i = 2 . . . . , N, the
test statistics
N N
To) = ~ ( i - 1)RN,, T(2) = ~ ( i - 1)E~(W (RN~))
i=1 i=1
correspond to the choices of the logistic and normal scores, respectively. Note
that T(1) is equivalent to the Daniels-Spearman test. Moreover, for the
degenerate weight function qm = 1, qi = 0, iS m, T(1) and T(2) reduce to the
two-sample Wilcoxon and normal scores tests, respectively.
Motivated by (3.2) one can consider a general rank statistic of the form
N
TI~ = ~, (Oi - O~)aN(RNI) (3.3)
i=l
N
lim ~ ( Q i - O u ) / N = a < ~ ,
N~ i=m
N
lim Z ( O l -- Q N ) 2 / N = c 2, 0 < c 2 < oo
N - ~ i=1
Tests for randomness against trend or serial correlations 99
then the limiting distribution of N-1/2TN (under kN) is N(/z, c2d 2) where
In the setting of a normal distribution with known o-, Chernoff and Zacks
(1964) propose the test ZN = Z ( i - 1)(Xi- )~) from the likelihood ratio cri-
terion. Confining to the uniform weights, and using the above results and
asymptotic normal distribution of ZN, it follows that the A R E of TN relative to
ZN is the same as the A R E of the corresponding two-sample rank test relative
to the t test. In particular,
where U o = 1 (0) according as Xi < (>) Xj. Using a heuristic argument, Pettitt
states that N-I{3/(N + 1)}l/ZBu is asymptotically distributed as the Kolmogorov
goodness of fit statistic, and recommends the use of the Kolmogorov-Smirnov
table. On the other hand, Sen and Srivastava (1975) only provide some
estimated critical values based on Monte Carlo Simulation. Asymptotic pro-
perties of these tests remain to be investigated.
We digress for a moment to remark that a somewhat different testing
problem arises when the initial level of the process is known. Let 320 denote the
process at the initial time t = 0, and assume that the df F0 of X0 is symmetric
100 Gouri K. Bhattacharyya
with the known center P.o. Based on the later observations XI . . . . . XN, we wish
to test if the center has shifted above #0 at some time point m (1 ~< m ~< N). For
this problem, Page (1955) proposes a nonparametric test based on the statistic
i-1
j=l
where U/i---1 (0) according as Xj < (>)X~. Although the same symbol R is
again used for these new ranks, we stress that the distributional properties of
the sequential ranks are altogether different from those of the ordinary ranks.
In particular, the sequential ranks of iid random variables are themselves
independent random variables with Ri uniformly distributed on the integers
{1 . . . . . i} so E(R~) = (i + 1)/2 and Var(Ri) = (i 2 - 1)/12. Let
Z~=i_ 1 R _ i 1 Vk = ~, Z i
i~l
(3.7)
VN (t) = (12/N)1/2{ g[Nt I + (Nt - [Nt]) VW,>,}, 0~<t~<l.
where B(t) is the standard Brownian motion process, 8 = lim N~/2f ( F - )dGN
and I(0, t) = 1 (0) according as 0 <~ (>) t. Since 6 = 0 under the null hypothesis
of no change, and since
the control limit is given by c = q~-~((1 - a)/2) where q) is the standard normal
df. The nonparametric detection scheme based on the sequential ranks is then
asymptotically equivalent to the
Logarithmic stopping rule: Stop at the smallest t for which B(t)>! q,(t) where
parametric detection scheme relative to the control chart based on the cumula-
tive sums. The asymptotic comparison is effected by studying the functions 0(t)
and qJ*(t), defined in (3.8) and (3.9), for some important models for change
such as a location shift, scale change, or a contamination of two distributions.
The behavior of the function h ( O ) = [d~O(O)/dO][dO*(O)/dO] -1 turns out to be
crucial in determining whether or not one method is superior to the other.
Interestingly, in the case of a location shift, h2(0) iS the same as the Pitman
A R E of the Wilcoxon test relative to the t test for the two-sample problem. It
is found that for distributions with heavy tails, the nonparametric scheme is
superior to the scheme based on the cumulative sums both in the sense of
asymptotic power and the expected stopping time.
N-1
r, = ~ . (X~ - 2 ) ( X ~ + , - 2 ) I S 2 (4.1)
i=1
where
N N
2=~'~Xi/N and S2=~(X~-2) 2.
i=1 i=l
N-1
Ta = x,x,+,. (4.2)
i=1
I,~a = N - I ( A ~ - A2),
Also, if F has a finite moment of the order 4 + 6 for some 6 > 0 , then the
unconditional limit distribution of (7"1-/Za)/O'a is N(0, 1).
REMARK. Permutation tests based on the serial correlations of lag h/> 1 were
proposed by Wald and Wolfowitz (1943) for the problem of detecting a trend
or cyclic movement in a series. However, these tests have extremely low
asymptotic efficiency for the alternatives of location trend. Indeed, they are
104 Gouri K. Bhattacharyya
more sensible tests to detect a serial dependence than trend, and for this reason
they are included in the present section.
12(N3- N) -1 ~ Ri
N+I)(Ri+l N+I)
2 2 "
N-1
72 = E RtRi+I (4.5)
i=1
More general serial rank statistics can also be constructed by using some
scores in place of the ranks. For instance, a van der Waerden type normal
scores statistic for serial correlation is
The aforementioned tests draw from intuitive reasoning in which the normal
theory test is used as the starting point, and then the distribution-free property
is attained by means of either a permutation argument or use of the ranks.
Gupta and Govindarajulu (1981) derive a locally most powerful (LMP) rank
test for the autoregressive process
where E / s are iid N(0, 0"2), hi(p) are nondecreasing and hi(0)= 0. For the
simple special case k = 1, the LMP rank test is based on the statistic
N-1
T4 = .~'~ aN(Ri, Ri+I) (4.7)
i=l.
where aN(a, ~ ) = E ( W ~ ) W ~ )) and W(~ denotes the order statistics from the
Tests for randomness against trend or serial correlations 105
2 1N N
Var(Z4) = N~-I -]- N-/a~__ 1 ~__1 a 2(oil,/~) - -N -N + l ~ ] a ~ ( ~ ' a ) } l
ot=l
The statistic
N-1
T ] = ~, aN(Ri)aN(Ri+l),
i=1
where g ( x ) = q~[~iD-l(F(x))] and q~(x) is the pdf of N 0 , 1). For all F satisfying
the regularity conditions, we have e3(F)/> 1, and the lower bound 1 is attained
by F = 4~. Also, e2(F)~> 9~4/1024-~ 0.8561.
K n o k e (1977) reports an extensive Monte Carlo study to examine the
adequacy of the normal approximation under the null hypothesis and to
compare the power of these tests for the first and second order models under
the normal as well as some non-normal parent distributions. A few other tests
which are not strictly distribution-free are also included in this study. Overall,
the rl test is found to be fairly robust and generally more powerful than 7"2
except for extreme departures from normality.
All the preceding rank tests of this section are applicable when the null
hypothesis of randomness only specifies that the random variables X1 . . . . . XN
are iid with a continuous df F. It is often the case that the investigator has
knowledge about the level of the process so the median of F can be assumed
known. If, in addition, it is reasonable to assume that F is symmetric, then an
altogether different class of nonparametric tests holds considerable promise for
reason of its certain elegant features.
To describe these tests we now assume that the random variables X~ . . . . . XN
have symmetric and continuous (possibly non-identical) df's F~. . . . . FN, res-
106 Gouri K. Bhattacharyya
where, in the first expression of (4.10), u ( z ) = 1 (0) if z > (~<)0, R+,i is the rank
of Iz, I among Izd . . . . . Iz.I, and a+~(-) denotes a score function. In the second
expression, V~ = 1 (0) according as the absolute rank i is associated with a
positive (negative) Z. Under the null hypothesis the distribution of a one-
sample linear rank statistic holds in this case even though the Z~'s are
dependent. In fact, the Zfls are symmetrically distributed around O, and
(u(Z1) ..... u ( Z n ) ) and R + are independent. Consequently,
max
l<<.i~n
+2( -
an .l
.=
+2
an ,q
l -->0.
The principal advantages of Dufour's tests are their simplicity, rapid con-
vergence to normality, and the availability of extensive tables for some special
cases. In addition, the validity of these tests does not require that X1 . . . . . Xu
be identically distributed. On the other hand, the assumption of symmetric
distributions with a common known median is a serious limitation.
An exact level a test can be based on the conditional distribution (5.1) with the
rejection region set on the lower tail.
Lehmann (1959) demonstrates a strong optimality property of this run test in
the context of a stationary Markov chain which is defined by the structure
P[Xi = 1] = Po/(Po4-ql)
where P0 and p~ are the same for all i. With this model, the probability of any
realization (Xl. . . . . xN) is given by
po+ ~ X-l~v~n-v~u~m-u
tlU k'OF1 t/lt/0
where u and v are the numbers of runs of O's and l's, respectively. The null
hypothesis of independence is equivalent to Ho:Po=Pl. Because of an
exponential-family structure of the likelihood, a uniformly most powerful
unbiased test exists for testing/4o versus/-/1: po<pl, and the test is based on
the statistic ( U + V ) = R, conditional on m. This is precisely the run test
described above. U n d e r the null hypothesis, R is asymptotically normal with
mean 2 N A ( 1 - A) and standard deviation 2 A ( 1 - A)N 1/2 where A = m/N. So a
normal approximation can be used to perform the run test in large samples.
We now return to the problem of testing randomness in a series XI, , XN
where the Xfls, instead of being dichotomous, are continuous variables. T h e
run test can be adapted to this situation once we devise a reasonable m o d e of
converting the continuous m e a s u r e m e n t s to a dichotomy. One natural criterion is
the sample median X, and we write 1(0) for an Xi according as X~> (<)X. Then the
run statistic R counts the n u m b e r of runs above and below the median of the
original sequence. Its null distribution is given by
Tests for randomness against trend or serial correlations 109
( [ ~ 2 ] - 1 ) 2 / ( [ N ])
PIR = 2k] = 2 1 N/2 '
where [N/2] denotes the integer part of N/2. For large N, R is approximately
normal with mean = N/2 and standard deviation = N1/2/2.
By introducing a dichotomy with reference to the median of X1, . , XN one
is prone to forfeit useful information regarding the pattern of progression of the
series. On the other hand, such information is captured more by the differences
(X/+I- X~) of the successive members. Another class of run statistics arises
when we consider the signs of these differences. This species of runs is quite
different from the runs above and below the median, especially in regard to the
distribution theory.
By assuming continuous distributions we ignore the possibility of ties among
X~. . . . . XN, and define the vector of signs of the successive differences
TK,, = Z2K + Z~
References
Hfijek, J. and Sidfik, Z. (1967). Theory of Rank Tests. Academic Press, New York, and Academia,
Prague.
Hodges, J. L., Jr. and Lehmann, E. L. (1963). Estimates of location based on rank tests. Annals of
Mathematical Statistics 34, 598-611.
Kander, Z. and Zacks, S. (1966). Test procedures for possible changes in parameters of statistical
distributions occurring at unknown time points. Annals of Mathematical Statistics 37, 1196-1210.
Knoke, J. D. (1977). Testing for randomness against autocorrelation: alternative tests. Biometrika
64, 523-529.
Lehmann, E. L. (1959). Testing Statistical Hypotheses. Wiley, New York.
Levene, H. (1952). On the power function of tests of randomness based on runs up and down.
Annals of Mathematical Statistics 23, 34-56.
Levene, H. and Wolfowitz, J. (1944). The covariance matrix of runs up and down. Annals of
Mathematical Statistics 15, 58-69.
Mann, H. B. (1945). Nonparametric tests against trend. Econometrica 13, 245-259.
Moore, G. H. and Wallis, W. A. (1943). Time series significance tests based on signs of differences.
Journal of the American Statistical Association 38, 153-164.
Noether, G. E. (1950). Asymptotic properties of the Wald-Wolfowitz test of randomness. Annals of
Mathematical Statistics 21, 231-246.
Page, E. S. (1955). A test for a change in a parameter occurring at an unknown point. Biometrika
42, 523-526.
Pettitt, A. N. (1979). A nonparametric approach to the change-point problem. Applied Statistics 28
(2), 126-135.
Sen, P. K. (1968). Estimates of the regression coefficient based on Kendall's tau. Journal of
American Statistical Association 63, 1379-1389.
Sen, A. and Srivastava, M. S. (1975). On tests for detecting change in mean. Annals of Statistics 3,
90-108.
Stuart, A. (1956). The efficiencies of tests of randomness against normal regression. Journal of the
American Statistical Association 51, 285-287.
Theil, H. (1950). A rank-invariant method of linear and polynomial regression analysis. Proc. Kon.
Ned. Akad. v. Wetensch. A. 53, 386-392, 521-525, 1397-1412.
Wald, A. and Wolfowitz, J. (1943). An exact test for randomness in the nonparametric case based
on serial correlation. Annals of Mathematical Statistics 14, 378-388.
Wolfowitz, J. (1944). Asymptotic distribution of runs up and down. Annals of Mathematical
Statistics 15, 163-172.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4
Elsevier Science Publishers (1984) 113-121 t./
J. L e r o y F o l k s
1. The setting
We have obtained k independent test statistics T~ for the null hypotheses H0i,
i = 1, 2 . . . . . k. For each test the significance level Li (also called the P-value or
observed significance level) is given by an upper tail probability. That is,
113
114 J. Leroy Folks
In all of these examples, we usually assume that the test statistic is con-
tinuous (approximately so for the contingency table chi square). In cases like
these, the significance levels L~ are independent uniform (0, 1) random vari-
ables when the H0i are true. With two-sided chi square and F tests, for
example, this will not be true.
2. Combination methods
m 1 - L(,,)'~
L=P X~k_m+ 1 L(m) ]
( k )
L = P X2k ~> - 2 ~ log Li .
i=1
Fisher made it clear that the test was based upon the product II~L~ of
significance levels. Pearson (1933) proposed what appears to be the same test
but suggested that Li should be the minimum of the right and left tail areas.
Once this defect was corrected (David, 1934), Fisher's test and Pearson's test
were the same. Even today some writers refer to Fisher's test as Pearson's Px
test.
S, = r~(1 - L,)
( k ))
L=P S~EF-(1-L,
i=l
where S ~ F(Eai, ).
The choice of a ' s is entirely arbitrary and can be done to weight the L's
differently. If ai is integer valued then S ~ - X 22a i" Even if all the a~ are not
integer valued but Z~i is, then S - X 22]~oti and the combined significance level is
given by
k
x 2 r;](1 - .
i=1
k
2 '
i=1
(
L : P Z ~> ~'~ aiqO-l(1 - Li)/X/
i=1
.
This gives another way of weighting the Li unequally. Previously Stouffer et al.
(1949) had suggeSted using the unweighted sum
k
L = ~ A i e -wIll
i=1
where
Ai = a~ -1 a i - aj).
,=
Zelen (1957) was interested in combining the inter and intrablock F tests for
equality of treatments in an incomplete block design and proposed weights 1
and a2. Zelen and Joel (1959) determined that the type II error is minimized
over a wide range of parameters when a2 is proportional to the ratio of
intrablock and interblock variances.
2. 7. George-Mudholkar method
George and Mudholkar suggest transforming the Li to logistic variables by
using the transformation Yi = - l o g [ L i / ( 1 - L i ) ] . Then each Y/ is a logistic
variable and a probability statement can be made concerning the sum EYe.
They propose the approximation
i
Combination of independent tests 117
L=e(z-<
where Z is a standard normal variable.
3. Example
We now illustrate the combination of L's for the methods just discussed with
the following example:
Fisher-Pearson method
L=P(x2>~-2~logLi)=P(x2>~ 14.9751)=0.020451.
i=1
Lancaster method
Li 1- Li Ol i F ,1(-12, a i (1 - Li)
0.07 0.93 2 8.66
0.20 0.80 1 3.22
0.04 0.96 3 13.20
Sum 25.08
L = P(X{z I>25.08)= 0.014448.
118 Jr. Leroy Folks
Liptak-Stouffer method
Good-Zelen method
George-Mudholkar method
L=P(tsk+4)>-rr~'~log~)=O.O17297.
i
Edgington method
/2-0.50 610.31_0.50],
~/1/(12k) L 3
4. Comparison of methods
Extensive studies of power have been made when the distribution of the T~'s
is assumed. Bhattacharya (1961) and Koziol and Perlman (1978) assumed that
the T~'s were independent X2 variables. They found that Fisher's test performed
well over a wide range of parameter values and that the Liptak-Stouffer
inverse-normal method performed very poorly. Marden (1982) obtained
theoretical results proving that under the chi-square assumption, Fisher's
method and Tippett's method are admissible. The Liptak-Stouffer method,
George-Mudholkar method and Edgington method were shown to be inad-
missible. It was conjectured that Lancaster's method was admissible.
Zelen and Joel (1959) and Pape (1972)suggest using the G o o d - Z e l e n method
for combination of tests when the T~'s are F statistics. Monti and Sen (1976)
present combinations of F statistics with locally optimum properties. Marden
(1982) showed that Fisher's method is admissible if and only if all degrees of
freedom are greater than one and that Tippett's method is always admissible.
Oosterhoff (1969) found a complete class of tests in a problem involving the
combinations of nonindependent noncentral t-tests.
Littell and Folks (1971) showed that Fisher's test was optimal in the sense of
Bahadur efficiency. Essentially this means that as the sample sizes for the T~'s
go to infinity the overall level, L, converges to zero under the alternative as fast
or faster than any other test. Berk and Cohen (1979) showed that there are
many methods which are Bahadur efficient, e.g. Lancaster's method and the
G e o r g e - M u d h o l k a r method. On the other hand the G o o d - Z e l e n method is not
Bahadur efficient.
Cohen, Marden and Singh (1982) study second-order optimality properties of
classes of statistics which are Bahadur efficient. If both Tx and 7"2 are Bahadur
efficient, consider D = log L 1 - log L2. Then T2 is said to be more efficient
(second order) than Ta if under the alternative hypothesis the limit of D as
n -->~ is positive with probability one. If the limit in probability is greater than
zero, T2 is said to be weakly superior to 7"1. Under some weak conditions, it is
shown that the statistic based on normal transforms has the greatest second
order efficiency.
5. Other methods
Several authors have assumed that not only the levels but the statistics, T~,
and their distributions are available. Van Zwet and Oosterhoff (1967) consider
the case where the T~'s are asymptotically normal and use the corresponding
asymptotically optimal methods for the small-sample case.
Littell and Louv (1981) consider inversion of combined tests as a way of
generating confidence intervals.
Several authors consider methods, particularly Bayesian methods, other than
the ones described in this paper.
Finally, we should mention that Lancaster (1949) and E. S. Pearson (1950)
studied the effect of discreteness upon the combination of tests.
120 J. Leroy Folks
6. Summary
Fisher's method is strongly supported by the literature. It has good power for
a large set of alternative parameter values. Tippett's method has good power
against alternatives for which a few of the null hypotheses H01, H02. . . . . Hok
are false but not for which many are false.
For almost all problems studied, Fisher's method and Tippett's method are
admissible.
References
Berk, R. H. and Cohen, A. (1978). Asymptotically optimal methods of combining tests. J. Amer.
Stat. Assoc. 74, 812-814.
Bhattacharya, N. (1961). Sampling experiments on the combination of independent x2-tests.
Sankhya, Set. A 23, 191-196.
Birnbaum, A. (1954). Combining independent tests of significance. J. Amer. Stat. Assoc. 49,
559-574.
Cohen, A., Marden, J . I. and Singh, Kesar (1982). Second order asymptotic and non-asymptotic
optimality properties of combined tests. J. Statist. Plann. Inference 6, 253-256.
David, F. N. (1934). On the P,, test for randomness: remarks, further illustration, and table of P*n
for given values of -Iogl0An. Biometrika 26, 1-11.
Edgington, E. S. (1972). A normal curve method for combining probability values from in-
dependent experiments. J. Psych. 82, 85-89.
Fisher, R. A. (1932). Statistical Methods for Research Workers. Oliver and Boyd, Edinburgh, 4th
Edition.
George, E. O. and Mudholkar, G. S. (1977). The logit method for combining independent tests.
Inst. Math. Stat. Bull. 6, 212.
Good, I. J. (1955). On the weighted combination of significance tests. J. Roy. Stat. Soc. Set. B 17,
264-265.
Koziol, J. A. and Perlman, M. D. (1978). Combining independent chi-squared tests. J. Amer. Stat.
Assoc. 73, 753-763.
Lancaster, H. O. (1949). The combination of probabilities arising from data in discrete dis-
tributions. Biometrika 36, 370-382.
Lancaster, H. O. (1961). The combination of probabilities: An application of orthonormal
functions. Austral. J. Statist. 3, 20-33.
Littell, R. C. and Folks, J. L. (1981). Confidence regions based on methods of combining test
statistics. J. Amer. Statist. Assoc. 76, 125-130.
Littell, R. C. and Louv, W. C. (1981). Confidence regions based on methods of combining test
statistics. J. Amer. Statist. Assoc. 76, 125-130.
Marden, J. I. (1982). Combining independent noncentral chi squared or F tests. Ann. Statist. 10,
266-277.
Monti, K. L. and Sen, P. K. (1976). The locally optimum combination of independent test statistics.
J. Amer. Statist. Assoc. 71, 903-911.
Oosterhoff, J. (1969). Combination of One-Sided Statistical Tests, Mathematical Centre Tracts, 28,
Amsterdam.
Pape, E. S. (1972). A combination of F-statistics. Technometrics 14, 89-99.
Pearson, E. S. (1950). On questions raised by the combination of tests based on discontinuous
distributions. Biometrika 37, 383-398.
Pearson, K. (1933). On a method of determining whether a sample of size n supposed to have been
drawn from a parent population having a known probability integral has probably been drawn at
random. Biometrika 25, 379-410.
Combination of independent tests 121
Stouffer, S. A., Suchman, E. A., DeVinney, L. C., Star, S. A. and Williams, R. M. (1949). The
American Soldier: Adjustment During Army Life (Vol. I). Princeton Univ. Press, NJ.
Tippett, L. H. C. (1931). The Methods of Statistics. Williams and Norgate, London, 1st ed.
Van Zwet, W. R. and Oosterhoff, J. (1966). On the combination of independent test statistics. Ann.
Math. Stat. 38, 659-680.
Wilkinson, B. (1951). A statistical consideration in psychological research, Psych. Bull. 48, 156-158.
Zelen, M. (1957). The analysis of incomplete block designs. J. Amer. Statist. Assoc. 52, 204-216.
Zelen, M. and Joel, L. S. (1959). The weighted compounding of two independent significance tests.
Ann. Math. Statist. 30, 885--895.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 "7
/
Elsevier Science Publishers (1984) 123-173
Combinatorics
Lajos Takdcs
n!=l.2.--n.
n! ~ ~/2,rrn(n/e)"
123
124 Lajos Takdcs
for 1 ~< k ~< n; these arrangements are called combinations without repetition.
The number of ordered arrangements of k different objects chosen among n
distinct objects is
for 1 <~ k ~< n ; these arrangements are called variations without repetition.
The number of unordered arrangements of k objects chosen in such a way
that each object may be any of n distinct objects is
for k = 1,2 . . . . and n/> 1; these arrangements are called variations with
repetition.
See Table 1 for a display of the above formulas. For more details see Netto
[38], Riordan [42], and Ryser [45].
Binomial coefficients. The oldest combinatorial problems are connected with
the notion of binomial coefficients. For any x the k-th binomial coefficient
Combinatorics 125
(k = 1, 2 . . . . ) is defined as
(k)=X(X-1)...(x-k+k! 1)
a n d (~)= 1.
In the early age of m a t h e m a t i c s b i n o m i a l coefficients a p p e a r e d in three
different disguises.
T h e first a p p e a r a n c e is c o n n e c t e d with the solution of the p r o b l e m of finding
the n u m b e r of ways in which k o b j e c t s can b e chosen a m o n g n distinct objects
w i t h o u t regard to order. A s we have already seen, the solution is (~) for
,~c = 1, 2 . . . . . n. See T a b l e 2.
Table 2
nk 0
0
1 0
1 2
0
3
0
4
0
1 1 1 0 0 0
2 1 2 1 0 0
3 1 3 3 1 0
4 1 4 6 4 1
F kl= F k + F ~ + . . . +F k
n i1
Table 3
Fk
n
k 1
0
1
1
1
2
1
3
1
4
1
5
1
2 1 2 3 4 5 6
3 1 3 6 10 15 21
4 1 4 10 20 35 56
5 1 5 15 35 70 126
W e have
126 Lajos Tak~cs
Fk= ( n +
k-1
(a + b) ~ = ~ Ckakb n-k
n
k=O
k ! ( n - k)!
Table 4
Arithmetic triangle
1
1 1
1 2 1
1 3 3 1
1 4 6 4 1
The numbers (~), 0 ~< k ~< n, gained importance when it was recognized that
they a p p e a r as coefficients in the expansion of (a + b) n. Apparently, this was
known to O m a r Khayyam, a Persian poet and mathematician, who lived in the
eleventh century. See W o e p c k e [63]. In 1303 Shih-chieh Chu [8] refers to the
numbers (~), 0 ~< k ~< n, as an old invention. (See Y. Mikami [32, p. 89], and J.
N e e d h a m [37, p. 133].) T h e numbers (~,), 0 ~< k ~< n, arranged in the form of a
triangular array, appeared in 1303 at the front of the b o o k of Shih-chieh Chu.
Unfortunately, the original b o o k was lost, but it was restored in the nineteenth
century. T h e triangular array first appeared in print in 1527 on the title page of
the b o o k of P. Apianus (see Smith [47, p. 509]). In 1544, M. Stiefel showed that
in the binomial expansion of (a + b) n the coefficients C kn (0 <~k <~n) can be
calculated by the recurrence equations
Corn binatorics 127
c k + l = Ckn ..~ c k + l
n+l n
where C On
= C nn = 1 for n >/0. He arranged the coefficients C nk ~ 0 ~ k <~n, in a
triangular array. In 1556 N. Tartaglia claimed the triangular array as his own
invention.
The numbers C kn ' 0<~ k <-n, appeared in the seventeenth century in con-
nection with combinations. In 1634 H6rigone knew that
+ (c + - c
i=0 j ] (a + bj) n- j / (c + b(n - j))
=(a+c+bn) a+c
n (a + c + bn)"
The method of inclusion and exclusion. One of the most powerful com-
binatorial methods in probability theory and in mathematical statistics is the
method of inclusion and exclusion. To formulate the main result, let us suppose
that O is a finite set and A1, A2 . . . . . An are subsets of /2. Denote by Hk
(0 ~< k <~ n) the set of all those elements o f / ' / which belong to exactly k sets
among A1, A 2 , . . . , An. The goal is to find N(Hk), the number of elements of
Hk, provided that we know N(/2), the number of elements of /-2, and
N ( A i l A h . . . A i i ) , the n u m b e r of elements in the intersection of the sets
Ai t, A t . . . . . Aij for all 1 ~< il < i2 < ' " < ii ~< n. We have
N(Hk)=~.(-1)'-k(~)S, (1)
]=k
where
So = N(12),
$1 = N ( A , ) + N ( A 2 ) + " " " + N ( A n ) ,
$2 = N(A1A2)+ N ( A 1 A 3 ) + " " + N(An-IAn),
or, in general,
k=j
EXAMPLE (see M o n t m o r t [34, 35] and Tak~ics [58]). D e n o t e by 12 the set of all
the n ! p e r m u t a t i o n s of 1, 2 . . . . . n. Let Ai (i = 1, 2 . . . . ,n) be the set of all those
p e r m u t a t i o n s of 1, 2 . . . . . n in which the i-th element is i, or briefly, Ai is the
set of p e r m u t a t i o n s of 1, 2 , . . . , n in which there is a match at the i-th place.
T h e n N ( 1 2 ) = n! and
n! +
N ( H k ) = k! j~=k (J - k)!
for k = 0, 1 . . . . . n.
Let us suppose that a box contains n cards m a r k e d 1, 2 . . . . . n and that all
the n cards are drawn o n e by o n e without replacement. All the n! possible
o u t c o m e s are supposed to be equally probable. D e n o t e by u, the n u m b e r of
matches in the p e r m u t a t i o n of 1, 2 , . . . , n o b t a i n e d by the n drawings. T h e n
P { v . = k} = N ( H k ) / N ( 1 2 ) , that is,
n -- "-
N(a) j!
for j = 0, 1. . . . . n.
Corn binatorics 129
P{v~=k}=(-1)'-k(~)Bj fork=0,1 . . . . . n,
j~k
and
where B0 = 1 and
for j = 0, 1 , . . . , n.
If in the sums for P{~'n = k} and P{vn I> k} we add less than n - k + 1 terms,
the error is of the same sign, and has smaller absolute value than the first term
neglected.
Stirling numbers. In 1730 Stirling [49, 50] introduced some remarkable num-
bers which we call today Stirling's numbers of the first kind and Stirling's
numbers of the second kind. See Jordan [24], Riordan [42] and P61ya and
Szeg6 [40].
Stirling's numbers of the first kind will be denoted by S(n, k) for n/> 0 and
k / > 0. See Table 5.
Table 5
S(n, k)
nk 0
0
1
1
0 0
2
0
3 4
0
5
0
6
0
1 0 1 0 0 0 0 0
2 0 1 1 0 0 0 0
3 0 2 3 1 0 0 0
4 0 6 11 6 1 0 0
5 0 24 50 35 10 1 0
6 0 120 274 225 85 15 1
130 Lajos Takdcs
where n 9 0 and k/> 1. The initial conditions are S(n, 0 ) = 0 for n/> 1,
S(0, k ) = 0 for k/> 1 and S ( 0 , 0 ) = 1.
The numbers S(n, k ) (1 ~< k ~< n) have a simple combinatorial interpretation,
namely, S(n, k ) is the number of those permutations of 1, 2 . . . . . n which
decompose into k disjoint cycles.
We have the following explicit formula
n~
S(n, k ) 2., kl!k2! . . . k.!lk12 k2" " " n k" ' ka+ 2k2+ "'" + n k , = n ,
kl + k z + ' " + k,, = k ,
for l~<k ~<n where the summation extends over all nonnegative integers
kl, k2 . . . . . k, which satisfy the stated conditions.
We have, for any x and n 1> 1,
~S(n,k)x" 1 ( 1 )k
,=k n! ----k-~. l g ( 1 - ~ "
J
for l ~ < i < j ~ < n , then
P(~,, = k} = S(n, k )
n!
/'/ F/ "
Combinatorics 131
Table 6
@(n, k)
nk 0
0
1
1
0
2
0
3
0
4
0
5
0
6
0
1 0 1 0 0 0 0 0
2 0 1 1 0 0 0 0
3 0 1 3 1 0 0 0
4 0 1 7 6 1 0 0
5 0 1 15 25 10 1 0
6 0 1 31 90 65 15 1
~(n,k)= (-1)k'~"(-1)i(k)i~
k! i=o
for 0 ~ k ~ n. Also
1 n[
~(n, k ) = ~ ; h ! j ~ ! _ :.j~!, 1~+'"+1~ = n
k=l
Xk
~_, ~(n, k)x" =
n=k
(1 - x)(1 - 2x). (1 - kx)"
The numbers ~(n,k) (O<-k<~n) frequently appear in the solutions of
132 Lajos Takgtcs
T, = ~ ~ ( n , k ) ,
k=0
n = 0, 1. . . . . are called Bell numbers. W e can interpret T, as the n u m b e r of
different partitions of a set of n elements, say (1, 2 . . . . . n), into any n u m b e r of
disjoint subsets. W e have To = T~ = 1, T2 = 2, T3 = 5, T4 = 15, T5 = 52, T6 =
203 . . . . .
T h e following recurrence f o r m u l a can be used in calculating T, for n i> 1:
Tn+l --2 ( 7 ) r ]
j=0
F o r n / > 1, we have
A ( n + 1, k) = (k + 1)A(n, k ) + (n + 1 - k ) A ( n , k - 1)
Table 7
A(n, k)
0 1 2 3 4 5
1 1 0 0 0 0 0
2 1 1 0 0 0 0
3 1 4 1 0 0 0
4 1 11 11 1 0 0
5 1 26 66 26 1 0
6 1 57 302 302 57 1
Combinatorics 133
I
i
Fig. 1.
k+~ ( )
A ( n , k ) = A ( n , n - l - k ) = ~ ( - 1i=0
)' n +il (k + 1 - i) n
for0<~k<~n-l, andA(n,k)=0fork1>n~>l.
If n/> 1, then for any x we have
A(n, k = x n.
k=0
for 1 <~ k ~< n, that is, (~-~) is the n u m b e r of solutions of the equation
t'n-~__o/ml(-1)J(~)(n-kllJm )
k=l
Partitions. T h e a b o v e f o r m u l a s a r e c o n c e r n e d with o r d e r e d a r r a n g e m e n t s . It
is m o r e difficult to find t h e n u m b e r of u n o r d e r e d d e c o m p o s i t i o n s of a p o s i t i v e
i n t e g e r n as a s u m k (or a n y n u m b e r ) of p o s i t i v e integers. D e n o t e by p(n, k)
t h e n u m b e r of s o l u t i o n s of (3) in p o s i t i v e i n t e g e r s a~, az . . . . . ak satisfying the
r e q u i r e m e n t s a l <~ a2 ~<" ~< ak. O b v i o u s l y p(k, k ) = 1 for k i> 1, a n d p(n, 1) = 1
for n / > 1, a n d we can d e t e r m i n e p(n, k) for all 1 ~< k ~< n by t h e r e c u r r e n c e
equation
k
p(n, k) = ~_. p(n - k, i) .
i=1
Table 8
p(n, k)
kn 1
1
1
2
1
3
1
4
1
5
1
6
1
7
1
2 0 1 1 2 2 3 3
3 0 0 1 1 2 3 4
4 0 0 0 1 1 2 3
5 0 0 0 0 1 1 1
6 0 0 0 0 0 1 1
The sum
p(n) = ~ p(n, k)
k=l
is t h e n u m b e r of d e c o m p o s i t i o n s of n as a s u m of any n u m b e r of p o s i t i v e
i n t e g e r s w i t h o u t r e g a r d to o r d e r . W e d e f i n e p ( 0 ) = 1.
L. E u l e r [16, p. 318] k n e w t h a t
for n = 1, 2 . . . . where
If n/> 576 and if we replace the infinite sum Z=k=l by a finite sum E~= 1 where
N = [ ~ / n ] , then the error is less than , and the integer p(n) can be determined
precisely. If n is large, the following asymptotic formula can be used for the
approximation of p(n):
, , 1 _e,, 2~,/3
p[n) 4n~/3
a s /,/....> oo.
Success runs. Let us consider a sequence of independent and identical trials
in which each trial has two possible outcomes, namely, success, which has
probability p, and failure, which has probability q, where p > 0, q > 0 and
p + q = 1. In this case we speak about a sequence of Bernoulli trials with
probability p for success. Denote by i,, the number of successes in the first n
trials. Then
for j = 0 , 1 . . . . . n.
136 Lojos Takdcs
+ 1 [(j-k)/m]
(_l)i(ki)(]-ktm_?l)}p,qm_j.
.
(n-~){ ~
i=0
a (a+2k\ ~+k k
P{p(a)=a + 2k}= a + 2~ k }P q
for k = 0, 1, 2 . . . . . and
P{~l*=2j-n}=[(7)-(jn_a)]P'q"-J
if 2j < n + a, and
a
P{rl * = a} = P { p ( a ) <~ n} = P{rln >i a} + P{% < -a}.
for - b < 2 j - n < a. H e r e the sum is formed for all k = 0, _1, ---2. . . . for which
the binomial coefficients are not 0. We note that (7) = n ! / j ! ( n - j ) ! if j =
0, 1 , . . . , n and (7)= 0 otherwise.
For more details see Feller [18].
De Moivre numbers. In 1708 A. De Moivre [9] solved the following problem
of games of chance: Two players, A and B, agree to play a series of games. In
each game, independently of the others, either A wins a coin from B with
probability p or B wins a coin from A with probability q, where p > 0, q > 0
and p + q = 1. Let us suppose that A has an unlimited n u m b e r of coins, B has
only k coins, and the series ends when B is ruined, i.e., when B loses his last
coin. D e n o t e by p ( k ) the duration of the games, i.e., the n u m b e r of games
played until B is ruined. A. De Moivre discovered that
k (k +2j)
P { p ( k ) = k + 2j} - k + 2j J pk+jqj (4)
for k i> 1 and j / > 0 . D e Moivre stated (4) without proof. Formula (4) was
proved only in 1773 by P. S. Laplace and in 1776 by J. L. Lagrange. In 1802 A.
M. A m p e r e expressed his view that formula (4) is remarkable for its simplicity
and elegance.
It is convenient to write
L(j' k ) - k + 2j j /
for j I>0 and k 1> 1, L ( 0 , 0 ) = 1 and L(j, 0 ) = 0 for j / > 1. The numbers L(j, k)
might appropriately be called De Moivre numbers. They can also be expressed
as
138 Lajos Takds
L(j,k)=(k+2j-1)-(k+
j
2j-1)
j-1
Table 9
L(j, k)
kJ 0
0
1
1
0
2
0
3
0
4
0
5
0
1 1 1 2 5 14 42
2 1 2 5 14 42 132
3 1 3 9 28 90 297
4 1 4 14 48 165 572
5 1 5 20 75 275 1001
M(j,k)=(2j+k) k+l
j j+k+l
ways such that for every r -- 1, 2 . . . . . 2j + k a m o n g the first r letters there are
at least as m a n y A as B. Obviously, M(j, k) = L(j, k + 1)/(j + k + 1).
Combinatorics 139
P(a, b) = (a - b)/(a + b)
provided that all the possible voting records are equally probable.
P(a, b) can be expressed as N ( a , b)/(a+b), where N ( a , b) is the n u m b e r of
favorable voting records, and (a+ab) is the n u m b e r of possible voting records.
B e r t r a n d ' s f o r m u l a follows f r o m the obvious observation that N ( a , b ) =
L(b, a - b).
In 1960 L. Takfics [51,52] generalized B e r t r a n d ' s ballot t h e o r e m in the
following way:
Let us suppose that a box contains n cards m a r k e d al, a2 . . . . . a, where
al, a 2 , . . . , a, are nonnegative integers with sum a~+ a 2 + ' ' " + a, = k, where
k ~< n. W e draw all the n cards without replacement. Let us assume that every
o u t c o m e has the same probability. T h e probability that the sum of the first r
n u m b e r s drawn is less than r for every r = 1, 2 . . . . . n is given by
P(n, k ) = (n - k )/n .
P ( a + b, 2b) = (a - b)/(a + b ) .
Order statistics. Most of the problems in order statistics are connected with
the comparison of a theoretical and an empirical distribution function or with
the comparison of two empirical distribution functions.
To consider the first case, let ~1, ~2. . . . . ~:n be mutually independent random
variables each having the same distribution function F(x). Denote by Fn(x) the
empirical distribution function of the sample (sc~,so2. . . . . ~n). Define
does not depend on F(x). The probability P{6+(m, n) ~< x} was found by B. V.
G n e d e n k o and V. S. Koroljuk [19] for m = n, and V. S. Koroljuk [27] for
n = mp where p is a positive integer. If n = rap, where p is a positive integer,
and if k is a nonnegative integer, then by the generalization of Bertrand's
ballot theorem we obtain that
,
~" (n + k + 1 - sp) s m -
s) "
(k +l)/p<~s<<.m
t r i b u t e d r a n d o m v a r i a b l e s . T h e f o l l o w i n g t w o t h e o r e m s a r e v e r y u s e f u l in such
studies.
In 1953 E. S p a r r e A n d e r s o n [1] d i s c o v e r e d t h e f o l l o w i n g c o m b i n a t o r i a l
t h e o r e m . See also W . F e l l e r [17].
L e t cl, c2 . . . . . cn b e real n u m b e r s a n d c o n s i d e r t h e n! p e r m u t a t i o n s of
(Cl, c2 . . . . . cn). T h e r e a r e as m a n y p e r m u t a t i o n s in w h i c h p r e c i s e l y k (k =
0, 1 . . . . . n ) a m o n g its n successive p a r t i a l s u m s a r e strictly p o s i t i v e ( n o n -
n e g a t i v e ) as t h e r e are p e r m u t a t i o n s in w h i c h t h e first (last) m a x i m a l e l e m e n t in
t h e s e q u e n c e of 0 a n d t h e n s u c c e s s i v e p a r t i a l s u m s o c c u r at t h e k - t h p l a c e
(k --0, 1,...,n).
In 1956 F. S p i t z e r [48] d i s c o v e r e d a n o t h e r c o m b i n a t o r i a l t h e o r e m w h i c h t o o
has m a n y a p p l i c a t i o n s in f l u c t u a t i o n t h e o r y .
L e t cl, c2 . . . . . cn b e real n u m b e r s such t h a t cl + c2 + + cn = 0 b u t ci, + c~ +
..+ciJ0 for l < ~ i l < i 2 < . . . < i i < ~ n if j < n . For each k=0,1 ..... n-1
t h e r e is e x a c t l y o n e cyclic p e r m u t a t i o n of (c~, c2, , cn) such t h a t e x a c t l y k of
its p a r t i a l s u m s a r e positive.
References
[17] Feller, W. (1959). On combinatorial methods in fluctuation theory. In: Ulf Grenander, ed.,
Probability and Statistics. The Harald Cramrr Volume. Almqvist and Wiksell, Stockholm and
Wiley, New York, pp. 75-91.
[18] Feller, W. (1968). An Introduction to Probability Theory and its Applications, Volume 1, third
edition. Wiley, New York.
[19] Gnedenko, B. V. and Koroljuk, V. S. (1951). On the maximum discrepancy between two
empirical distribution functions. (In Russian) Dokl. Akad. Nauk SSSR 80, 525-528. [English
translation: Selected Translations in Mathematical Statistics and Probability, IMS and AMS, 1
(1961) 13-16.]
[20] Hall, M. Jr. (1967). Combinatorial Theory. Blaisdell Publishing Co., Waltham, MA.
[21] Hardy, G. H. and Ramanujan, S. (1918). Asymptotic formulae in combinatory analysis.
Proceedings of the London Mathematical Society 17, 75-115. [Collected Papers of G. H. Hardy.
Vol. 1. Oxford University Press, 1966, pp. 306-339.]
[22] Johnson, N. L. and Kotz, S. (1977). Urn Models and Their Application. A n Approach to Modern
Probability Theory. Wiley, New York.
[23] Jordan, C. (1867). De quelques formules de probabilitr. Comptes Rendus Acad. Sci. Paris 65,
993-994.
[24] Jordan, Ch. (1939). Calculus of Finite Differences. Budapest. [Reprinted by Chelsea, New York,
1947.]
[25] Jordan, K. (1927). A val6sziniisrgszfimitfis alapfogalmai (Les fondements du calcul des
probabilitirs). Mathematikai ~s Physikai Lapok 34, 10%136.
[26] Jordan, K. (1972). Chapters on the Classical Calculus of Probability. (English translation of the
Hungarian original published in 1956.) Akadrmiai Kiad6, Budapest.
[27] Koroljuk, V. S. (1955). On the discrepancy of empirical distribution functions for the case of two
independent samples. (In Russian) Izv. Akad. Nauk SSSR Set. Math. 19, 81-96. [English
translation: Selected Translations in Mathematical Statistics and Probability, IMS and AMS, 4
(1963) 105-121.1
[28] Leibniz, G. W. (1666) Dissertatio de Arte Combinatoria.
[29] MacMahon, P. A. (1908). Second memoir on the compositions of numbers. Philosphical
Transactions of the Royal Society of London. Set. A 207, 65-134.
[30] MacMahon, P. A. (1915-1916). Combinatory Analysis. I-II. Cambridge University Press.
[Reprinted by Chelsea, New York, 1960.]
[31] MacMahon, P. A. (1978). Collected Papers, Volume I: Combinatorics. The MIT Press,
Cambridge, Mass.
[32] Mikami, Y. (1913). The Development of Mathematics in China and Japan. Teubner, Leipzig.
[Reprinted by Chelsea, New York, 1961.]
[33] Mohanty, S. G. (1979). Lattice Path Counting and Applications. Academic Press, New York.
[34] Montmort, P. R. (1708). Essay d'Analyse sur les Jeux de Hazard. Paris.
[35] Montmort, P. R. (1713). Essay d'Analyse sur les Jeux de Hazard, second edition. Paris. [Reprinted
by Chelsea, New York, 1980.]
[36] Narayana, T. V. (1979). Lattice Path Combinatorics with Statistical Applications. University of
Toronto Press, Toronto.
[37] Needham, J. (1959). Science and Civilisation in China, VoL 3: Mathematics and the Sciences of the
Heavens and Earth. Cambridge University Press.
[38] Netto, E. (1901). Lehrbuch der Combinatorik. B. G. Teubner, Leipzig. Second edition, 1927.
[Reprinted by Chelsea, New York, 1964]
[39] Pascal, B. (1665). Trait~ du Triangle Arithmdtique. Pads.
[40] P61ya, G. and Szegr, G. (1972 and 1976). Problems and Theorems in Analysis, Vol. 1-11. Springer,
New York.
[41] Rademacher, H. (1937). On the partition function p(n). Proceedings of the London Mathematical
Society (2) 43, 241-254. [Collected Papers of Hans Rademaeher. Vol. II. Edited by E. Grosswald.
The MIT Press, Cambridge, Mass., 1974, pp. 108-122.]
[42] Riordan, J. (1958). A n Introduction to Combinatorial Analysis. Wiley, New York.
[43] Riordan, J. (1968). Combinatorial Identities. Wiley, New York.
Combinatorics 143
[44] Rota, G.-C. (1964). The number of partitions of a set. American Mathematical Monthly 71,
498--504. [Reprinted in Finite Operator Calculus by Gian-Carlo Rota. Academic Press, New
York, 1975, pp. 1--6.]
[45] Ryser, H. J. (1963). Combinatorial Mathematics. The Carus Mathematical Monographs No. 14.
The Mathematical Association of America.
[46] Smirnov, N. V. (1944). Approximate laws of distribution of random variables from empirical data.
(In Russian) Uspechi Mat. Nauk 10, 179-206.
[47] Smith, D. E. (1925). History of Mathematics, Vol. 11. [Reprinted by Dover, New York, 1958.]
[48] Spitzer, F. (1956). A combinatorial lemma and its application to probability theory. Transactions
of the American Mathematical Society 82, 323-339.
[49] Stirling, J. (1730). Methodus Differentialis, sive Tractatus de Summatione et Interpolatione
Serierum lnfintarum. London.
[50] Stirling, J. (1749). The Differential Method: or, A Treatise Concerning Summation and
Interpolation of Infinite Series. London.
[51] Takfics, L. (1961). The probability law of the busy period for two types of queuing processes.
Operations Research 9, 402--407.
[52] Takfics, L. (1962). A generalization of the ballot problem and its application in the theory of
queues. Journal of the American Statistical Association 57, 327-337.
[53] Takfics, L. (1962). Ballot problems. Zeitschrift fiir Wahrscheinlichkeitstheorie und verwandte
Gebiete 1, 154-158.
[54] Tak~ics, L. (1967). Combinatorial Methods in the Theory of Stochastic Processes. Wiley, New York.
[55] Takfics, L. (1967). On the method of inclusion and exclusion. Journal of the American Statistical
Association 62, 102-113.
[56] Takfies, L. (1969). On the classical ruin problem. Journal of the American Statistical Association
64, 889-906.
[57] Takfics, L. (1979). A generalization of the Eulerian numbers. Publicationes Mathematicae
(Debrecen) 26, 173-181.
[58] Takfics, L. (1980). The problem of coincidences. Arehive for History of Exact Sciences 21, 229-244.
[59] Takfics, L. (1982). Ballot problems. In: S. Kotz and N. L. Johnson, eds., Encyclopedia of Statistical
Sciences, Vol. 1. Wiley, New York, pp. 183-188.
[60] Takfics, L. (1982). Combinatorics. In: S. Kotz and N. L. Johnson, eds., Encyclopedia of Statistical
Sciences, Vol. 2. Wiley, New York, pp. 53--60.
[61] Wallis, J. (1693). De Algebra Tractatus. Oxford. [John Wallis: Opera Mathematica. II. Georg
Olms Verlag, Hildesheim, 1972,]
[62] Whitworth, W. A. (1879). Arrangements of m things of one sort and n things of another sort,
under certain conditions of priority. Messenger of Mathematics 8, 105-114.
[63] Woepcke, F. (1851). L'Alg~bre d'Omar Alkfiyyami. French translation. Duprat, Paris.
[64] Worpitzky, J. (1883), Studien fiber die Bernoullischen and Eulerschen Zahlen. Journal fiir die
Reine und Angewandte Mathematik 94, 203-232.
P. R. Krishnaiah and P.K. Sen, eds., Handbook of Statistics, Vol. 4
u
Elsevier Science Publishers (1984) 145-171
Malay Ghosh
I. Introduction
N
Su = ~ c ( i ) a ( R i ) , (1.1)
i=l
where the c(i)'s are referred to as regression constants, and the a(i)'s are
referred to as scores. Such statistics arise quite naturally in obtaining locally
most powerful rank tests against certain regression alternatives (see e.g. H~jek
and Sid~k, 1967). The special case when c(1) . . . . . c ( m ) = 0 and c(m + 1)=
*Research supported by the Army Research Office Durham Grant Number DAAG29-82-K-
0136.
145
146 Malay Ghosh
N
Sgr = ~'~ c(i) sgn Z i a ( R ~ ) , (1.2)
i=1
and such statistics are referred to as signed rank regression statistics. When
c(1) . . . . . c ( N ) = 1, the statistics are used for one-sample location tests. In
this case the important special choices for the score function a are (i) a ( 1 ) =
. . . . a ( N ) = 1 which corresponds to the sign test statistic, and (iii) a(i) = EIZ[~i)
where IZI0)< --- < IZI(N) denote the order statistics corresponding to the ab-
solute values in a random sample of size N from the N(0, 1) distribution. The
resulting test statistic is referred to as the one sample normal scores test statistic.
We shall review the literature on limit theorems for linear rank statistics and
signed linear rank statistics. Some of these statistics are closely related to the
so-called U-statistics. For example, the Wilcoxon signed rank statistic can also
be expressed as
(see Tukey, 1949), and we shall see later that (,~)-1 times the above statistic is
expressible as a weighted average of two one-sample U-statistics. The Wil-
coxon rank sum statistic W is expressible as W = U + (,~1), where U =
Ei%~ ~j%~ Iwj~Xil, where U is the celebrated M a n n - W h i t n e y U-statistic. In the
above, and in what follows Ia = 1 if the event A occurs, and Ia = 0, otherwise.
The M a n n - W h i t n e y U-statistic is an exampl e of a two-sample U-statistic.
Section 2 presents the central limit theorem, rates of convergence and laws
of large numbers for one-sample U-statistics. Multisample extension of these
results is given in Section 3. Section 4 deals with jackknifed and bootstrapped
U-statistics, while Section 5 provides functional central limit theorems for
U-statistics. Some miscellaneous remarks related to U-statistics are made in
Section 6.
Rank statistics and limit theorems 147
If (2.1) holds, we say that O(F) is estimable. If O(F) is estimable, the smallest
sample size, say m for which (2.1) holds, is called the degree of O(F), and
~b(X1. . . . . Xm) is called the kernel of 0(1:). Without any loss of generality, we
may assume that ~b is symmetric in its arguments, as otherwise, we can define
~b0(X1. . . . . X,,) = (m !)-1E ~ b ( X ~ , . . . , X~m), where the summation extends over
all possible permutations (al . . . . . am) of the first m positive integers.
Corresponding to a symmetric kernel ~b of O(F), we define a one-sample
U-statistic (see Hoeffding, 1948) by
Such statistics have proved to be quite useful in the theory of point and
confidence estimation, and in hypothesis testing. For a very detailed account of
U-statistics, the reader is referred to Purl and Sen (1971, Chapter 3), Randles
and Wolfe (1979, Chapter 3), and Serfling (1980, Chapter 5).
Examples of one-sample U-statistics include the sample mean
i=1
i=1 l~i<j~n
148 Malay Ghosh
can be expressed as
(n + 1)-~(2U,~ + (n - 1)U,2)
where
n
Unl :" n -1 ~ sgn Xi
i=1
Both U,x and Un2 a r e U-statistics. This fact was mentioned in Section 1.
Hoeffding (1948) obtained a very useful central limit theorem for suitably
normalized U-statistics. Later, Hoeffding (1961) introduced a decomposition
for U-statistics which facilitated considerably the development of the asymp-
totics for U-statistics. Similar decomposition for more general statistics (not
necessarily symmetric) is available in the later work of Hfijek (1968), Efron and
Stein (1981), and Karlin and Rinott (1982).
To arrive at Hoeffding's decomposition, we first introduce the following
notations. For any c = 1 . . . . . m, let
m "
)
(2
Z Z ~,2(xj, x ; ) + . - - + - -
(m")
XE ' ' " E ~//m(Xj, . . . . . Xj,). (2.8)
l<.jl<...<jm<.n
THEOREM 1. Assume that E[t~2(X1 . . . . ,Xm)] <o0 and ~rl = V[~Ol(X1)] > 0 .
Then
L
~ / n ( U , - O(F))/(m2~l) u2---~ N(O, 1).
L
where ~ means convergence in distribution.
The above theorem, first obtained by Hoeffding (1948), follows easily from
(2.8) by applying the classical central limit theorem on the mean like term
m n -1Ej-"_-I ~01(Xj) and showing that E(RZ,)--~ 0, where R , is the remainder term
in (2.8) in the decomposition of U, - O(F). This proof involves the 'projection'
idea of breaking up a statistic as a sample mean plus a remainder term
converging to zero at a rate faster than the centered sample mean.
As an application of Theorem 1, consider the example when
where /z4 = E(X1 - tz)4. Accordingly, if or4 </x4 < ~, applying Theorem 1, one
gets
L
~v/-~(Un - 0"2)/(].,4 - 0-4) 1/2---)' N O , 1).
One may make a note that (2.10) is not the usual way the variance of a
U-statistic is expressed in the literature (see, for example, Puri and Sen, 1971;
Randles and Wolfe, 1979; or Serfling, 1980). Writing (c = V[cb(X1 . . . . . Xc)],
V ( U , ) is usually expressed as V ( U , ) = (m")-I Ec"=l (m)(7,--"~)~'c.
Situations could arise though when ~rl = 0. For instance, in the sample
variance example, one could have/.t4 = o-4. In such a situation, Theorem 1 is
not applicable.
Hoeffding (1948) obtained the inequalities ~c/c <~~d/d for 1 ~< c ~< d ~< m. If
~1 . . . . . ~'c--0 < ~'~+1, one can still obtain asymptotic distribution of suitably
normalized U,.
The case ~'1 = 0 < ~'2 has been addressed by Gregory (1977) (see also Serfling,
1980). If E~b2(X1 . . . . , Xm) < o~" and ~1 : 0 < ~2, then
L
n(U,, - O(F))--> (~) Y ,
where Y is a weighted average of possibly infinitely many chi-squared random
variables each with one degree of freedom. As an application, consider once
again the sample variance example where the Xi's are Bernoulli random
variables with probability of success equal to .
Recently, interest has been focussed on obtaining rates of convergence to
normality for U-statistics. The problem has been addressed by Grams and
Serfling (1973), Bickel (1974), Chan and Wierman (1977), and Callaert and
Janssen (1978), the latter obtaining the sharpest rate as follows.
(i) EF[~2(/1 . . . . . /m)]<oo, and (ii) v<oo. Note that the condition
Ev[4~(X1. . . . . Xm)]3 < oo implies both conditions (i) and (ii), but not vice versa.
An example to this effect appears in a paper of Borovskikh (1979).
Central limit theorems for U-statistics with random indices were obtained by
Sproule (1974), and rates of convergence to normality for such random U-
statistics were obtained by Ghosh and Dasgupfa (1982). Callaert, Janssen, and
Veraverbeke (1980) obtained certain asymptotic Edgeworth type expansions
for U-statistics.
Next, in this section, we mention briefly the laws of large numbers for one
sample U-statistics. The classical strong law of large numbers generalized to
U-statistics is as follows.
THEOREM 3. If EF[c~(X1 . . . . . Xm)] < o~ then U,-+ O(F) almost surely as n-,oo.
The strong convergence result was first obtained by Sen (1960) under the
stronger moment condition EI~b(X1 . . . . . Xm)12-m-~< o. Later, Hoeffding (1961)
obtained the result under the assumption of finiteness of the first moment of ~b
using the decomposition given in (2.8). Berk (1966) showed that U-statistics are
backward martingales, and proved Theorem 3 using the strong convergence
results for backward martingales.
For most practical purposes, it is useful to obtain an estimate of the variance
of U, or at least o f m2~l, since nV(Un)/m2(xol as n0~. The following
Jackknifed estimator of V(Un) was proposed by Sen (1960). To introduce Sen's
estimator, first let
where the summation extends over all possible 1 ~<a 2 < " " < am <<-nwith aj:~ i
for any j = 2 , . . . , m. Define
S~n = ( n -- 1) -1 ~ [ U ( 1 ) ( X i ) - Un] 2 .
i=l
Sen (1960) (and later Sproule, 1969) showed that if E4~2(X1 . . . . . Xm) < 0% then
P
S2.-+sq as n o .
Accordingly,
L
- o(F))/(mS,.)--, N(0,1).
Very recently, rate of convergence to normality for such studentized U-statistics
has been obtained by Callaert and Veraverbeke (1981). They show that if
E [ ~ ( X 1 , . . . , Xm)[4'5 < % then,
0(F))
supl '
For a generalization of Theorem 1 to vector-valued U-statistics, consider a
152 Malay Ghosh
vector (Unl ..... Unk), w h e r e U,i is a o n e sample U-statistic with kernel ~b~ and
d e g r e e rn~, i.e.
.,i=(n),
rnl ~ ~I(Xj,,..., Xjmi) ,
l~Jl<'"<Jmi<-n
(xo, . . . . . x%)
C(al . . . . . tc)fll . . . . . m-c
x%)
~C(ffl. . . . . . c)fll . . . . . tim-c; Y, ..... "Ym-c = [E6c(al ...... c).B,..... B,,,_c (xa'
6 (x~ . . . . . x%)],
C(tl. . . . . ~c)Yl '" " " ' )'m-c
{( n ' ~ ( 2 m - c ' ~ ( 2 m - ~ c ) } -1
~,~
2m - c/', c /', m ~ c(ot 1. . . . . . )~1 . . . . . a m - c ; ' y l . . . . . "Ym-c
THEOREM 4. Suppose
(i) supl,,,,<...<%~, E~b2(X,~,. . . . . X%) < oo;
(ii) EIol(o(X~)I- 2+8 < ~ for all i = 1 , . . . , n, and
Rank statistics and limit theorems 153
3. Multisample U-statistics
where the summation extends over all possible 1 <~ al < ' - " < am1~< nl, and
1 ~<131 < " " </3m2 < n2.
Important examples of two-sample U-statistics are I7",~-Jr,1 and the Mann-
Whitney U-statistic
nl n2
Unl.~2 = (n,n2) -1 ~'~ Z I t y ~ x d .
i=l j=l
V[ U.,,,~] as
nl -1 n2 -1 ml
V[U""~]= (rnl) (m2) ~=o~ (m''~(nx-m''~(mz'~[n2-rne~"
' d=okC ] \ m l - c J k d ] k m 2 - d ] ~c'a'
(3.4)
where st0.0is defined as 0. The central limit theorem for two-sample U-statistics
is now stated as follows,
THEOREM 5. If
(i) Eck2(X,,..., Xnl , YI .... , Yn2) < 00,
(ii) 0 < A = !imN_~ nl/N < 1 (where N = nl + n2) and
(iii) ~2 = m2,o/A + m~'o1/(1 - A) > 0,
L
then, X/N[U,~.,~- O(F, G)]/cr ~ N(O, 1) as N--~o~.
Un 1 = (nl~-l... ( n c ~ -1
..... nc \~ll / \?Tic/
where the two sets (ajl . . . . . C~imi) and (fljl . . . . . fljmj) have no integers in com-
mon whenever j ~ i and exactly one integer in comnon when j = i. Let
where the only 1 in the subscript of 6o..... 0,1,0..... 0 is in the ith position. Also, let
N = Zk=ln~. The following c-sample U-statistic theorem is due to L e h m a n n
(1963).
and
E F 1 . . . . . F ~ b 2(X 11 . . . . . Xlni,. .., Xcl,.. .~ X c % ) < o0 ,
L
c
then N/-N(UN-O)--> N(O, tr2), w h e r e o'2= ~i=l m 2 iA - lir ~0 ..... o, 1..... O.
The ~i's are called pseudo-values by Tukey. Then the jackknife estimator of ~7
156 Malay Ghosh
is given by
I .....
i
Cn-1
where C,-1
i denotes that the summation is over all combinations (/3i . . . . , fl~) of
m integers chosen from (1 . . . . . i - 1 , i + 1. . . . . n). Let g be a real valued
function, and let
n
1i = nln - (n - 1)~/_1 , ~n = n-1 E ~i
i=1
and
S2g=(n-l)-1~(~,-~.)2.
i=I
The following two theorems are proved by Arvesen (1969). The first one
provides a central limit theorem for suitable normalized "0,. The second
theorem provides a consistent estimator of the variance of the limiting dis-
tribution of @,.
Bickel and Freedman (1981) have developed a very extensive central limit
theory for many bootstrapped statistics including U-statistics. Under the assump-
tion Edpz(x1 . . . . . X,,) < 0% they showed that
L
V'N(Ufv- U.)---> N(0, m2~'1)
as min(n, N)---> ~. Following Sen (1960), Athreya, Ghosh, Low and Sen (1984)
proposed the estimator
2
S=n=(N - I)-' ~" ~[(mN
' , = 1- 11)-' "'" ~ d~(X*'X~2' . . . , X*,.) - uNjl
1<J2<"" <Jm <N
Jk#i
(4.4)
for (1. This estimator multiplied by m 2 c a n be easily recognized as the
jackknifed estimator of the variance of the asymptotic distribution of boot-
strapped U-statistics. Athreya, Ghosh, Low and Sen (1983) showed that under
the assumption that E~bE(X1. . . . . Xm) < ~,
P
S~--~m2~1 as m i n ( n , N ) ~ .
Accordingly, the bootstrapped pivot
L
~ / - N ( U } - Un)/(mSN)-*N(O, 1) as min(n, N ) - ~ .
Also, a strong law of large numbers for bootstrapped U-statistics was proved by
Athreya et al. (1983) under the assumption that EFI~b(X1. . . . , Xm)l1+8 < ~ for
some G > 0.
m-1 k(Uk - O)
Y,(t) = 0, 0 ~ < t ~< - Y . ( k / n ) = (m2(O1/2nm,
n
z.(0) = 0;
z.(t.k) = (uk - o)/v'/2(u.), k >1 n, t.k = ; (5.2)
Z , ( t ) = Z.(t.k), t.,k+l < t < t.k .
For each n, t,., t,.,+l . . . . . form a sequence tending to 0 and Z,(-) is a step
function continuous from the left.
Before stating the limit laws for the processes Y,(.) and Z,(.), we need
certain preliminaries. Consider a collection of random variables 7"1, Tz . . . . .
and T taking values in an arbitrary metric space S, and having respective
probability measures P1, Pz . . . . . and P defined on the Borel sets in S (i.e. on
the ~r-field generated by the open sets with respect to the metric associated
with S). We say that P, ~ P if lim,_~o~P , ( A ) = P ( A ) for every Borel set A is S
satisfying P ( O A ) = 0, where OA = boundary of A = closure of A - interior of
A. In particular if S is a metrizable function space, then P.--* P denotes
'convergence in distribution' of a sequence of stochastic processes to a limit
stochastic process.
In the Miller and Sen (1972) situation, we take S = C[0, 1], the collection of
all continuous functions on the unit interval [0, 1]. The space C[0, 1] is metrized
by
for x = x(.) and y = y(.) in C[0, 1]. Denote by ~ the class of Boret sets in
C[0, 1] relative to p. Denote by (2), the probability distribution of Y,(.) in
C[0, 1], i.e. the probability measure on (C, ~ ) induced by the measure P
through the relation
k
= 1-10({x(.): x(t,)- x(t,-3 a,}).
i=1
For Loynes (1970), S = D[0, 1], the class of all functions which are right
continuous and for which left hand limit exists. The D[0, 1] space is endowed
with the following topology.
Let A denote the class of all strictly increasing continuous mappings of [0, 1]
onto itself. The Skorohod distance between x and y (both belonging to the
D[0, 1] space) is defined by
nondegenerate case, and Neuhaus (1977) in the degenerate case. Hall (1979)
obtained a single result from which invariance principles in both the degenerate
and nondegenerate cases followed as immediate corollaries.
6. Miscellaneous r e m a r k s
Closely related to the U-statistics are the von Mises V-statistics defined by
It can be shown that (see, for example, Ghosh and Sen, 1970) if
E[~b2(X~1. . . . . X~.)] < ~ for all 1 ~< c~1. . . . . am ~< n, then E ( U , - V , ) 2 = O(n-2).
This implies that,
P
n r ( U , - V,)--->O for any r < l .
There are many linear rank statistics which are not U-statistics, and the
development of limit distributions for such statistics requires a different analy-
sis. Notable among such statistics are the so-called normal scores statistics
introduced in Section 1. In this section, we present certain results concerning
null and nonnull distributions of linear rank statistics.
Recall from Section 1 that a linear rank statistic is expressible in the form
N
SN = ~ cN(i)aN(RNI) (7.1)
i=1
Assume also that the scores an(i) are generated by a s ( i ) = ch(i/(N + 1)), i =
1. . . . . N, where ~b is a squared integrable score function on the unit interval [0,1].
Then,
L
(SN - P~N)/O'N~ N(0, 1).
REMARK. In the above theorem, one could also take an(i) = E4~(UNi), i =
1 . . . . . N, where UN1 ~< ~< UNN are the order statistics in a random sample of
size N from the uniform [0, 1] distribution.
In the case an(i) = E~b(UNi) where f01 kb(u)[ du < o0, there is an alternate way
of proving (7.3). Let o%N denote the o.-algebra generated by R N I , . . . , Run.
Then, Sen and Ghosh (1972) have shown that {(SN, fin); N / > 1} is a martingale
sequence. The result can now be proved by appealing to some suitable
martingale central limit theorem (see, for example, Mcleish, 1974). The mar-
tingale idea has been fruitfully exploited in obtaining functional central limit
theorems for linear rank statistics (see Section 9).
In the two-sample case, Chernoff and Savage (1958) obtained in their
pioneering paper the asymptotic nonnull distribution of SN under nonlocal
alternatives. For local alternatives, Hfijek (1962) employed the idea of 'con-
tinguity' of probability measures, and obtained some useful limit theorems.
Finally, Hfijek (1968) obtained the limit distribution of SN under nonlocal
alternatives. His main results are given below.
THEOREM 11. Consider the statistic SN given in (7.1) where the score function
an(i) is given either by q~(i/(N + 1)) or Erp(UNI) where qb has a bounded second
derivative. Assume that
Then
L
(Su - ESu )/V'/2(SN ) --+ N (0, 1). (7.5)
The assertion remains true, if in (7.4) and (7.5), we replace V(SN) by o'~ =
E/N=1 V[lNi(Xi)], where
N
INi(X) = N -1 Z (CN(j)- Clv(i)) f [Ib,>.xI - F~Cv)14,'[~(y)ld F j ( y ) ,
j=l
i=1 ..... N,
The next theorem of Hfijek (1968) does not require the assumption of
boundedness of the second derivative of ~b.
THEOREM 12. Let ~(u)---&l(u)-~b2(u), 0 < u < 1, where the ~bi(u)'s are non-
decreasing square integrable and absolutely continuous inside (0, 1), i = 1, 2.
Assume that
Then (7.5) holds. Also, in (7.5) and (7.6), V ( Su ) can be replaced by o'2Nas defined in
Theorem 11.
Then, if aN(i) = Eqb(UNi), 1 <- i <- N, the conclusion of Theorem 12 remains valid
with E(SN) replaced by tz~. If aN(i) = dp(i/(N + 1)), 1 <- i <~N, then the conclusion
Rank statistics and limit theorems 163
For one sample signed rank statistics STy,analogues of Theorems 11 and 12 have
been obtained by Huskovfi (1970). Pyke and Shorack (1968) used an alternative
approach in obtaining the limit distributions of linear rank statistics in the two
sample case.
Next, in this section, we consider rates of convergence to normality for the
simple linear rank statistics Ss. Results in this direction were obtained by
Jureckovfi and Puri (1975), and later by Huskovfi (1977) and Bergstrom and
Purl (1977). The method of proof consists in approximating the simple linear
rank statistic by a sum of independent random varialzles, and establishing for
arbitrary r, a suitable bound on the rth moment of the error of approximation.
The following assumptions are made.
I. The scores are generated by a function ~b(u), 0 ~ u ~< 1, in either of the
following two ways:
/ SN - ESN
sup P [ Vff~(~N) ~< x ] - qb(x) = O(N-1/2+c). (7.9)
x
The assertion (7.9) remains true if V(SN) is replaced by try, where tr2N is defined
in Theorem 12. Both the assertions remain true with E(SN) replaced by tXl~, where
IZlV is defined in Theorem 13.
REMARK. Edgeworth expansions for linear rank statistics have been obtained
by Albers, Bickel and van Zwet (1976), and Bickel and van Zwet (1978).
164 Malay Ghosh
Sen (1970) obtained strong law of large numbers for statistics of the form S~
(introduced in (1.2)) when the ci's are all equal to 1 and an(i) = ( i / ( N + 1)),
1 ~< i ~< N. The ideas of Sen can be extended to more general regression rank
statistics of the type S~, provided the c/s satisfy certain uniform asymptotic
negligibility conditions. Sen and Ghosh (1972) have a result to this effect.
To state the result of Sen and Ghosh (1972), let
N
Hi(x) = F i ( x ) - F / ( - x ) and /-~(x) = N -1 ~ Hi(x).
i=1
Define
C20 = 2 c2(i) and A20 = n -1 2 {*(i/(n + 1))}2 ,
i=l i=l
N
SN = N - m A ~ 1 ~ C-Nisgn X i * ( R ~ i / ( N + 1)),
i=1
THEOREM 15. Suppose that fd [(u)[' du <o~ for some r > 2 . A s s u m e that
maxi~i~N[~Ni[ = Then, ~ - ~ N - > 0 a . s . a s S - ~ ~.
O(N-1/2).
THEOREM 16. Suppose that the score function aN(i) is the same as in Theorem
15 with f~ [(u)l r du < ~ for some r > 2. Assume that maxl~i~N]C~i[ = O(N-m),
where c ~,i = (ci - eN)/{E?=~ (c~- eN)2}~/2. D e f i n e
Consider once again the statistics S~ = E~=l c(i)E[ch(UNRN)] where UN1 << -
"'" <~ UNN are the order statistics in a random sample of size N from the
uniform (0, 1) distribution. Assume that the score function ~b is squa~,e in-
tegrable, and define 4~ = fo1 qS(u) du and A 2 = fd 4~2(U)du - ,~2. Defining ~ 2 =
/~rl /N=I [as(i) - aN] 2, where aN(i) = E d p ( U s i ) , ! <~ i <- N, one gets the inequ'ality
A 2 <~ A 2. In fact (see Sen, 1981, pp. 92-93), using the dominated convergence
theorem one can show that A 2 ~ A 2 as N---~ o0. Write C 2 = E~V=l( c N ( i ) - i t s ) 2.
For every N 1> 1, consider the stochastic process
YN (t) : S%(t)/(CNAN), T0(t) = max{k: C2k <~ tC2N}, t E (0.1).
(9.1)
(Conventionally, let So = S1 -- 0 so that YN(0) = 0). Then YN belongs to D[0, 1]
for every N >~ 1. The following theorem, proved in Sen (1981) is an improved
version of a corresponding result of Sen and Ghosh (1972).
THEOREM 17. Suppose fd (a2(u) du < ~. Then, under F1 - " =- F s and under
(7.3), d
YN --> W as N---~ ~
Theorems 17 and 18 relate to the null situation, namely, when F1 -=" ~ Fs.
For Contiguous alternatives, functional central limit theorems of the above
type were obtained by Sen and are reported very extensively in his recent
book (see Sen, 1981, pp. 98-104). Functional central limit theorems for linear
rank statistics under nonlocal alternatives is an important open question.
Next, consider the one sample case where Z1 . . . . . Z s are lid with a common
continuous distribution function F. Consider signed rank statistics of the form
(1.2) with c(i) = 1 for all i, a s ( i ) = E[th(UN,)], 1 <~ i ~<N. Consider the hypothesis
of sign invariance, namely, F is symmetric about zero, i.e. F ( x ) + F ( - x ) = 1 for
all x > 0 . Let R~v,= rank of [Xll . . . . . IXs], l~<i~<N. Let ~ s denote the
o'-algebra generated by (sgnX1 . . . . . s g n X s ) and (R~vl. . . . . R~s). Sen and
166 Malay Ghosh
Ghosh (1971) have shown that if f l I (u)l du < oo, then {(S~v, ~N), N / > 1} is a
martingale sequence. Using this martingale structure, weak and strong in-
variance principles (functional central limit theorems) for one sample signed
rank statistics can be established. The results are discussed in detail in Sen
(1981). In what follows, we present a few of the important results.
Once again, let A 2 = N -1 E/u=1 [ a n ( i ) - aN] 2. Define ~'0(t) = max{k: k / N <~ t},
0 < t < l. Let Y ~ ( t ) = S~,(o/(N1/2Au), Y~(0) = 0. The following theorem is
proved in Sen and Ghosh (1973).
An analogous result for the tail sequence Z N = {ZN(t), 0 < t < 1} is proved in
Sen (1981). Define To(t)=-min{k:N/k<~t}, and Z N ( t ) = ~ / N - -S ~ +o ( j ( A N~'0(t)).
Then the following theorem is true.
d
THEOREM 20. Under the same assumptions as in Theorem 19, Zu--> W as
N--+oo in the Ja-topology on D[0, 1].
The above theorem provides a functional central limit theorem for signed
rank statistics under the null situation. Results similar to T h e o r e m s 19 and 20
are proved in Sen (1981) under contiguous alternatives.
I. There exist score functions ffj(u) (19< u < 1), j = 1 . . . . , p such that
~bj(u)= ~bjl(u)-~bjz(U), where ~bjk(u) is nondecreasing, absolutely continuous
and square integrable inside (0, 1), k = 1, 2.
II. For the aj(u) defined in (10.8), maxa~<j<~pf l {aOj(u)_ ~bj(u)}2 du--> 0 as
n ----->oo.
III. so, -- maxl~<i~<n(ci - c,)'Cnl(ci - ?,,)"-->0 as n--> ~.
The main theorem of Sen (1983) is as follows.
Let (X1, Y1). . . . , (X~, Y,) be a random sample from a continuous bivariate
distribution with distribution function F ( x , y ) . The problem is to construct
nonparametric distribution-free tests of Ho: F(x, y) = Fl(X)F2(y), for every pair
(x, y) against Hi: F(x, y) # FI(x)F2(y) for at least one pair (x, y), where F1 and
F2 denote the marginal distribution functions of the X / s and the Y/s respec-
tively. Thus, the null hypothesis of interest is that the X and Y variables are
independent.
Let Ri denote the rank of X/ among X1 . . . . . X,, and let Q / d e n o t e the rank
of Y/ among Y1. . . . . Y~. In addition, let a(1) ~<-.. ~< a(n) with a(1) # a(n),
and c(1)<~... ~< c(n) with c ( 1 ) # c(n) be two sets of scores. Let
(11.1)
with E n - 1 ~in_-i Ci and a n -1 ~=1 al denote the correlation coefficient for the
= =
group of scored rank pairs (c(Ri), a(Q1)) . . . . . (c(R,), a(Q~)). The special case
a(i) = i, i = 1 . . . . . n corresponds to the Spearman rank correlation coefficient.
Note that 7". has the alternate representation
n tl
= O(n-1), (11.3)
References
Albers, W., Bickel, P. J. and van Zwet, W. R. (1976). Asymptotic expansions for the power of
distribution free tests in the one-sample problem. Ann. Statist. 4, 108-156.
Arvesen, J. N. (1969). Jackknifing U-statistics. Ann. Math. Statist. 40, 2076-2100.
Athreya, K. B., (3hosh, M., Low, L. and Sen, P. K. (1984). Laws of large numbers for bootstrapped
means and U-statistics. J. Stat. Planning Inference 9, 185-194.
Berk, R. H. (1966). Limiting behavior of posterior distributions when the model is incorrect. Ann.
Math. Statist. 37, 51-58.
Bergstrfm, H. and Puri, M. L. (1977). Convergence and remainder terms in linear rank statistics.
Ann. Statist. 5, 671-680.
Bickel, P. J. (1974). Edgeworth expansions in nonparametric statistics. Ann. Statist. 2, 1-20.
Bickel, P. J. and Freedman, D. A. (1981). Some asymptotic theory for the bootstrap. Ann. Statist.
9, 1196-1217.
Bickel, P. J. and van Zwet, W. R. (1978). Asymptotic expansions for the power of distribution free
tests in the two sample problem. Ann. Statist. 6, 937-1004.
Billingsley, P. (1968). Convergence of Probability Measures. Wiley, New York.
Bhuchongkul, S. (1964). A class of nonparametric tests for independence in bivariate populations.
Ann. Math. Statist. 35, 138--149.
Borovskikh, Yu. V. (1979). Approximation of U-statistics distribution (in Ukrainian). Proc.
Ukrainian Acad~ Sci. A 9, 695-698..
Callaert, H. and Janssen, P. (1978). 1.,)e Berry-Esseen theorem for U-statistics. Ann. Statist. 6,
417-421.
Callaert, H., Janssen, P. and Veraverbeke, N. (1980). An Edgeworth expansion for U-statistics.
Ann. Statist. 8, 299-312.
Callaert, H. and Veraverbeke, N. (1981). The order of the normal approximation for a studentized
U-statistic. Ann. Statist. 9, 194-200.
Chan, Y. K. and Wierman, J. (1977). On the Berry-Esseen Theorem for U-statistics. Ann. Prob. 5,
136-139.
Chatterjee, S. K. and Sen, P. K. (1964). Nonparametric tests for bivariate two sample location
problem. Cal. Statist. Assoc. Bull. 13, 18-58.
170 Malay Ghosh
Chernoff, H. and Savage, I. R. (1958). Asymptotic normality and efficiency of certain non-
parametric test statistics. Ann. Math. Statist. 29, 972-994.
Dwass, M. (1956). The large sample power of rank tests in two sample problems. Ann. Math.
Statist. 27, 352-374.
Efron, B. (1979). Bootstrap methods: another look at the jackknife. Ann. Statist. 7, 1-26.
Efron, B. and Stein, C. (1981) The jackknife estimate of variance. Ann. Statist. 9, 586-596.
Ghosh, M. and Dasgupta, R. (1982). Berry-Esseen theorems for U-statistics in the non iid case. In:
Proc. of the Conf. on Nonparametric Inference, organized by the Janos Bolyai Math. Soc., held in
Budapest, Hungary, pp. 293-313.
Ghosh, M. and Sen, P. K. (1970). On the almost sure convergence of von Mises' differentiable
statistical functions. Cal. Statist. Assoc. Bull. 19, 41--44.
Grams, W. F. and Serfling, R. J. (1973). Convergence rates for U-statistics and related statistics.
Ann. Statist. 1, 153-160.
Gregory, G. G. (1977). Large sample theory for U-statistics and tests of fit. Ann. Statist. 5, 110-123.
H~ijek, J. (1961). Some extensions of the Wald-Wolfowitz-Noether Theorem. Ann. Math. Statist.
32, 506-523.
H~ijek, J. (1962). Asymptotically most powerful rank tests. Ann. Math. Statist. 33, 1124-1147.
H~ijek, J. (1974). Asymptotic sufficiency of the vector of ranks in the Bahadur sense. Ann. Statist. 2,
75--83.
l-I~ijek, J. (1968). Asymptotic normality of simple linear rank statistics under alternatives. Ann.
Math. Statist. 39, 325-346.
H~ijek, J. and Sidak, Z. (1967). Theory of Rank Tests. Academic Press, New York.
Hall, P. (1979). On the invariance principle for U-statistics. Stoch. Proc. and Appl. 9, 163-174.
Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. Ann. Math.
Statist. 19, 293-325.
Hoeffding, W. (1951). A combinatorial central limit theorem. Ann. Math. Statist. 22, 558-566.
Hoeffding, W. (1952). The large sample power of tests based on permutations of observations.
Ann. Math. Statist. 23, 169-192.
Hoeffding, W. (1961). The strong law of large numbers for U-statistics. Inst. Stat. Univer. North
Carolina Mimeo Series No. 302.
Hoeffding, W. (1973). On the centering of a simple linear rank statistic. Ann. Statist. 1, 54-66.
Huskov~, M. (1970). Asymptotic distribution of simple linear rank statistic for testing symmetry. Z.
Wahrsch. Verw. Geb. 12, 308-322.
Huskov~i, M. (1977). The rate of convergence of simple linear rank statistics under hypothesis and
alternatives. Ann. Statist. 5, 658--670.
Janssen, P. (1978). De Berry-Esseen stellingen: Een asymptotische ontwikkeling voor U-statis-
tieken. Unpublished Ph.D. dissertation, Belgium.
Jureckov~i, J. and Puri, M. L. (1975). Order of normal approximation for rank test statistic
distribution. Ann. Probab. 3, 526-533.
Karlin, S. and Rinott, Y. (1982). Applications of ANOVA type decompositions for comparisons of
conditional variance statistics including jackknife estimates. Ann. Statist. 10, 485-501.
Lehmann, E. L. (1951). Consistency and unbiasedness of certain nonparametric tests. Ann. Statist.
22, 165-179.
Lehmann, E. L. (1963). Robust estimation in analysis of variance. Ann. Math. Statist. 34, 957-966.
Loynes, R. M. (1970). An invariance principle for reversed martingales. Proc. Amer. Math. Soc. 25,
56--64.
Loynes, R. M. (1978). On the weak convergence of U-statistics processes and of the empirical
processes. Proc. Camb. Phil. Soc. (Math.) 83, 269-272.
Mann, H. B. and Whitney, D. R. (1947). On a test of whether one of two random variables is
stochastically larger than the other. Ann. Math. Statist. 18, 50-60.
McLeish, D. (1974). Dependent central limit theorems and invariance principles. Ann. Probab. 2,
620--628.
Miller, R. G., Jr. and Sen, P. K. (1972). Weak convergence of U-statistics and von Mises'
differentiable statistical functions. Ann. Math. Statist. 43, 31--41.
Nandi, H. K. and Sen, P. K. (1963). On the properties of U-statistics when the observations are not
Rank statistics and limit theorems 171
independent. Part II: Unbiased estimation of the parameters of a finite population. Cal. Statist.
Assoc. Bull. 12, 125-143.
Neuhaus, G. (1977). Functional limit theorems for U-statistics in the degenerate case. J. Mult. Anal.
7, 424-439.
Noether, G. E. (1949). On a theorem of Wald and Wolfowitz. Ann. Math. Statist. 20, 455-458.
Puri, M. L. and Sen, P. K. (1966). On a class of multivariate multisample rank order tests. Sankhygt
Ser. A 28, 353-376.
Puri, M. L. and Sen, P. K. (1969). A class of rank order tests for a general linear hypothesis. Ann.
Math. Statist. 40, 1325-1343.
Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley, New
York.
Puri, M. L., Sen, P. K. and Gokhale, D. V. (1970). On a class of rank order tests for independence
in multivariate distributions. Sankhy~ Set. A 32, 271-297.
Pyke, R. and Shorack, G. R. (1968). Weak convergence of a two-sample empirical process, and a
new approach to Chernoff-Savage theorems. Ann. Math. Statist. 39, 755-771.
Quenouille, M. H. (1956). Note on bias in estimation. Biometrika 43, 353-360.
Randles, R. H. and Wolfe, D. A. (1979). Introduction to the Theory of Nonparametric Statistics.
Wiley, New York.
Sen, P. K. (1960). On some convergence properties of U-statistics. Cal. Statist. Assoc. Bull. 10,
1-18.
Sen, P. K. (1963). On the properties of U-statistics when the observations are not independent. Part
I: Estimation of the non-serial parameters of a stationary properties. Cal. Statist. Assoc. Bull. 12,
69-92.
Sen, P. K. (1965). Some nonparametric tests for m-dependent time series. J. Amer. Statist. Assoc.
(ill, 134-147.
Sen, P. K. (1970). On some convergence properties of one-sample rank order statistics. Ann. Math.
Statist. 41, 2206-2209.
Sen, P. K. (1974). Weak convergence of generalized U-statistics. Ann. Probab. 2, 90-102.
Sen, P. K. (1977a). Some invariance principles relating to jackknifing, and their role in sequential
analysis. Ann. Statist. 5, 315-329.
Sen, P. K. (1977b). Almost sure convergence of generalized U-statistics. Ann. Prob. 5, 287-290.
Sen, P. K. (1981). Sequential Nonparametrics: Invariance Principles and Statistical Inference. Wiley,
New York.
Sen, P. K. (1983). On permutational central limit theorems for general multivariate linear rank
statistics. Sankhyft Ser. A 45, 141-149.
Sen, P. K. and Bhattacharyya, B. (1977). Weak convergence of the Rao-Blackwell estimator of a
distribution function. Ann. Probab. 5, 500-510.
Sen, P. K. and Ghosh, M. (1971). On bounded length sequential confidence intervals based on
one-sample rank order statistics. Ann. Math. Statist. 42, 189-203.
Sen, P. K. and Ghosh, M. (1972). On strong convergence of regression rank statistics. Sankhygt Set.
A 34, 335-348.
Sen, P. K. and Ghosh, M. (1973). A law of iterated logarithm for one-sample rank order statistics
and some applications. Ann. Statist. 1, 568-576.
Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. Wiley, New York.
Sproule, R. N. (1969). A sequential fixed width confidence interval for the mean of a U-statistic.
Unpublished Ph.D. dissertation. UNC, Chapel Hill.
Sproule, R. N. (1974). Asymptotic properties of U-statistics. Trans. Amer. Math. Soc. 199, 55~o4.
Tukey, J. W. (1949). The simplest signed rank tests. Princeton Univ. Star. Res. Group Memo. Report
No. 17.
Tukey, J. W. (1957). Variances of variance components II: The unbalanced single classification.
Ann. Math. Statist. 28, 43-56.
Wald, A. and Wolfowitz, J. (1944). Statistical tests based on the permutations of the observations. ~.
Ann. Math. Statist. 15, 358-372.
Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics 1, 80-83.
P. R. Krishnaiah and P.K. Sen, eds., Handbook of Statistics, Vol. 4 (~
Elsevier Science Publishers (1984) 173-184 27
Kesar Singh
proaches are one and the same. A third approach to a nonlocal comparison,
which is almost always much harder to work with, is again based on N e y m a n -
Pearsonian view point (see Hodges and Lehmann (1956) for example). H e r e
one keeps the alternative 0 fixed and compares the convergence rates of
powers to 1. Yet another notion of nonlocal efficiency is due to Chernoff (1952)
which looks at the exponential decay rate of the minimum value of a linear
combination of the two types of ecrors.
One of the c o m m o n criticisms of P.E. is that to define it one has to work
with a specific kind of sequence of alternatives converging to the null. Indeed,
the requirement that the limiting power is not 1 dictates the choice of
alternative and this is a pure mathematical fact. However, as we will see in the
next section the current definitions of P.E. do not m a k e use of any specific
alternative sequence. A significant advantage in working with P.E. is that
almost always it is computationally more elementary as one usually requires
some weak limit to compute it. It seems worthwhile to make a c o m m e n t here
on nonlocal comparisons. Typically, the performance of a test when the
alternative is far away from the null approaches to the highest limit very
rapidly as the sample size gets big. As a result, a large sample comparison of
two tests for a distant alternative seems like a comparison of two practically
perfect objects. However, this c o m m e n t does not discredit the nonlocal
efficiencies very much since a nonlocal efficiency is defined for every point in
the nonnull space including the ones which are close to the null. It probably
does suggest that while using a nonlocal efficiency one should attach more
importance to the alternatives which are closer to the null.
2. Pitman efficiency
We present in this section a modern definition of P.E. and some very general
tools which can help one investigate its existence and evaluate it in a given
instance. The development is mainly due to Rothe (1981) though the definition
in the present form appears in Wiend (1976) too.
Assume that X1, X 2 , . . . are i.i.d, observations from a probability space
(O, S, P0) where 0 is an unknown p a r a m e t e r taking values in a metric space A.
Consider the testing problem 0 = 00 vs. 0 # 00 where 00 is an accumulation
point in the metric space. Let {T1,} and {T2,} be two competing test statistics
for this problem. Let us fix the sizes of all the tests at a level a E (0, 1). We
define, for a 0 # O0 and/3 E (0, 1),
NI(O, a, fl) = min{m : Power of T1, at 0 is I>/3 for all n/> m}.
N2(O, 8)
e12(ol,/3) = lim NI(0, o~,/3)
0-*0 0
THEOREM (Rothe). Suppose that 7"1, and Tzn are both standard. If
limo-,oogl(O)/g2(O) exists, then e12(a,/3) exists and is given by
More generally,
-1
= l msup
large sample efficiency. Also, one may reasonably argue that an asymptotic
efficiency shouldn't be based on a mediocre performance. Regarding the
second point, it seems easy to extend the definition in the case of a nonsimple
null by defining appropriate lower and upper efficiencies. But the question
remains: When do they coincide? Apparently there hasn't been a satisfactory
study on P.E. in nonnull situations.
Some of the earlier contributions on P.E. often referred to are Pitman
(1949), Noether (1955), Fraser (1957) and Olsen (1967). The reader may see
Noether (1955) for a generalized version of the original definition due to
Pitman.
There are numerous interesting results on various computations and bounds
on P.E. For instance, Hodges and Lehman (1956) showed, among other things,
that the P.E. of signed rank test relative to t-test in the one-sample location
problem is/>0.864 within the class of all symmetric continuous populations. A
similar bound for sign test is ~. Details on P.E. for various nonparametric
statistics can be found in almost any text on nonparametric inference.
3. Bahadur's slopes
The level attained L(_T.) of a test statistic T,, whose large values are
significant, is defined as F,(T,) where
A0 being the null space. The level attained is also popularly known as the
p-value. A p-value is stochastically at least as large as a U[0, 1] random
variable under the null. Thus the size of the test which reject itt L(T,)<~ a is
bounded by a. If T. has a continuous null distribution, as is the case in many
practical examples, the distribution of L(T,) is exactly U[0, 1]. It is desirable to
have a low L(T,) if a nonnull 0 prevails. Since the exact distribution of L(T,),
under a nonnull 0 is often intractable, though there are a couple of papers
having to do with the exact distribution, one has to resort to an asymptotic
measure of its smallness. Under a nonnull 0, L(T,) can be typically expressed
as follows:
For two test statistics T1, and Tz, with slopes c1(0) and c2(0), if
K(O, 00) =
f [log r(x)l dPe; r(O) = dPoo
de0
is assumed to exist for all 0 E A0. In its present form, the result is due to
Raghawachari (1970), though various weaker versions of it appeared in the
earlier papers of Bahadur. Under conditions, likelihood ratio tests are known
to be Bahadur optimal (see Bahadur, 1965; Bahadur and Raghawachari, 1970;
Bahadur, 1971). In many cases it is much simpler to check directly that the
slope of a certain statistic equals the minimum Kullback Leibler distance, than
going through the regularity conditions of a general theorem. In particular this
comment applies to the t-statistic and the )(2 statistic in the N(O, 0 "2) situation.
Another theorem of Raghawachari (1970) states that n-llogL(T,)--->-c(O) in
probability under a nonnull 0 iff n-llog a, ~ -c(O) where a , is the size of a test
based on T, having a fixed power in (0, 1). This theorem unifies Bahadur's and
Cochran's efficiencies. A weaker version of this result is contained in Bahadur
(1967). Kalenberg (1980) has extended Raghawachari's equivalence result to a
'second order' level.
178 Kesar Singh
ax 2 .. ]
1 - F ( x ) = exp - ~ (1 + o(1)) as x ~ ~ .
(2) There is a continuous function b(O) such that, under 0, the sequence
X/n[T, - b(0)] is tight uniformly in 0 belonging to a neighborhood of 00, i.e. for
a given e > 0, there exist positive K and No such that
Pe(X/n IT, - b(0)l > K ) < e for all 0 ~ (00- e, 00+ e) and N ~> No.
THEOREM (Wiend). If T1, and T2, both satisfy condition A and P.E. e12(a,/3)
exists then
where E12(0 ) is the relative (approx.) Bahadur efficiency which exists under A.
Wiend used this relationship for computing some previously unknown P.E.
Asymptotic comparison of tests- A review 179
There do not seem to be any further results available on this line. A more
general study on this phenomenon is felt desirable.
by many (though not all) LR tests in the nonsimple null cases including the
t-test f o r / z and the xZ-test for o- of the N ~ , o-2) population.
Thus we already have quite many notions of deficiency. It is felt desirable to
unify various apparently different notions of deficiencies and examine
numerically how closely the limiting deficiencies approximate the finite sample
exact values.
Let 7"1,1, T2,~. . . . . Tk. k be k test statistics available for the same testing
problem, based on k independent samples of sizes nl, n2 . . . . . nk. Assume that
large values are significant for each of the statistics. Let g g ( T l n I . . . . . Tknk) be =
k
L(g) ~> H L,(T/~,)
i=1
and this implies that the slope of g is ~<E] h;ci(0) where hi = lim,_,=nJE~ni is
assumed to exist; c~(O) is the slope of T/,i at 0. The above simple but important
facts were noticed first by Littell and Folks (1973) The bound on the slope of g
gives rise to a notion of first order optimality (f.o.o.) of combined tests In
particular, Fisher's combination
k
T ( F ) = - 2 ~ log L(T~.,)
1
is f.o.o. Later, it was pointed out by Berk and Cohen (1979) that in fact
k
TG(~, 01, 0 / 2 , ' ' ,
ak) = E F -f l1, o t i (1 - L(T/,))
1
for any Ogl,o~2,. , oLk > 0 is f.o.o, where Ft3,~ denotes the d.f. of the gamma
distribution with parameters/3, a. A higher order asymptotic study on combin-
ing independent tests was carried out in Cohen, Marden and Singh (1982)
(CMS). It turned out that the log p-values of To(fl, al, a2 . . . . . ak) agree up to
three leading asymptotic terms and the comparable terms, which are Op(1),
depends upon a ' s as well as ci(0)'s. Surprisingly enough, it was found that the
normal score combination
k k
TN = ~ (nJn)'/2cI)-l(1 - L(T~.)), n = ~'~ n~,
l 1
182 Kesar Singh
beats any member of TG at the third asymptotic term if the slopes ci's are
equal. The combination TN is not even f.o.o, if the slopes are not all equal.
Thus an asymptotic discrimination between the members of T6 is still an open
problem. CMS also contains an unresolved conjecture regarding lower bounds
for L(g).
How well do Pitman and Bahadur efficiencies approximate the actual relative
efficiency given as N2(O, a, fl)/Nx(O, a,/3) ? Presently, there does not seem to
exist adequate literature on this question. Recently, there has been an interes-
ting study on this issue by G r o n e n b o o m and Oosterhoff (1981) (GO), which
reported some numerical findings demanding general mathematical explana-
tions. It seems worthwhile to include here a few highlights of this paper.
This paper looked at Bahadur efficiency as lim~_~N2(0, a,/3)/N1(0, a,/3) (the
same definition of Bahadur efficiency is given in Kalenberg (1978) too). The
definition of N~(0, ct,/3) in G O is slightly different from the one given in this
review, though most of the numerical results reported in G O remain the same
under the two definitions. The paper considers few specific testing problems
and investigates numerically how well the asymptotic efficiencies and deficien-
cies (in the sense of Pitman and Bahadur) approximate the corresponding exact
quantities N2(0, a,/3)/Nl(O, ct,/3) and N2(0, a,/3)- NI(O, a,/3). The first exam-
ple is about the normal mean 0 when the dispersion matrix is known to be
I. The L R test for the hypothesis 0 <~ 0 vs. 0 < 0 has been compared with
the most powerful test for a fixed alternative. In this comparison, Bahadur
efficiency equals 1. The exact relative efficiency, which turns out to be the same
as Pitman efficiency in this example, was found to be much smaller than 1
(something close to ) for many reasonable (0, a,/3), showing the inadequacy of
Bahadur efficiency in this case. The deficiency expansions; as a ], 0, provided a
fair approximation to the actual deficiencies. The second example studied is the
comparison of the Z-test with the t-test for the univariate normal populations.
H e r e Pitman efficiency is 1 whereas Bahadur efficiency is 0-2log(1 + 02), 0 being
the alternative. According to Table 3 of GO, Pitman efficiency approximates
the actual relative efficiencies very well when 0 = 0.25 (the null 0 equals zero),
though it badly overestimates when 0 = 0.5 and 1.0. Bahadur efficiency is in
good agreement with reality when /3 = 0.5, though it is not so when /3 = 0.9.
One term Pitman deficiency expansion does an excellent job of estimating the
actual (moderate sample) deficiencies. The other two examples studied in G O
are (i) comparison of the sign test and the median test for the location
parameter of the double exponential family; (ii) comparison of the LR test,
Hotelling's test, Roy's test, and Pillai's test for testing multivariate linear
hypotheses on the mean of a multivariate normal population. In general,
Bahadur efficiency seems to display a somewhat haphazard behavior in terms
of approximating moderate sample realities. On the other hand, Pitman
Asymptotic comparison of tests - A review 183
References
Albers, W. (1974). Asymptotic expansions and the deficiency concept of statistics. Mathematical Centre
Tracts, 58, Amsterdam.
Albers, W., Bickel, P. J. and van Zwet, W. R. (1976). Asymptotic expansions for the power of
distribution free tests in the one sample problem. Ann. Statist. 4, 108-156.
Bahadur, R. R. (1960). Stochastic comparison of tests. Ann. Math. Stat. 31, 276-295.
Bahadur, R. R. (1965). An optimal property of the likelihood ratio statistic. In: Proc. Fifth Berkeley
Syrup.
Bahadur, R. R. (1967). Rates of convergence of estimates and test statistics. Ann. Math. Stat. 38,
303-324.
Bahadur, R. R. (1971). Some Limit Theorems in Statistics. SIAM, Philadelphia.
Bahadur, R. R. and Raghawachari, M. (1970). Some asymptotic properties of likelihood ratios on
general sample spaces. In: Proc. Sixth Berkeley Syrup.
Bahadur, R. R., Chandra, T. K. and Lambert, D. (1982). Some further properties of likelihood ratios on
general sample spaces. Preprint.
Berk, R. H. (1982). Stochastic bounds for attained levels. Preprint.
Berk, R. H. and Cohen, A. (1979). Asymptotically optimal methods of combining tests. Jr. Amer.
Statist. Assoc. 74, 812-814.
Bickel, P. J., Chibisov, D. M. and van Zwet, W. R. (1981). On efficiency of first and second order.
lntern. Statist, Rev. 49, 169-175.
Bickel, P. J. and van Zwet, W. R. (1978). Asymptotic expansions for the power of distribution free tests
in the two sample problem. Ann. Stat. 6, 937-1004.
Brown, L. D. (1971). Non-local asymptotic optimality of appropriate likelihood ratio tests. Ann. Math.
Star. 42, 1206-1240.
Chandra, T. K. and Ghosh, J. K. (1978). Comparison of tests with same Bahadur efficiency. Sankhya
Ser. A 40, 253-277.
Chandra, T. K. and Ghosh, J. K. (1980). Deficiency for multiparameter testing problem. Sixtieth
Birthday Volume of C. R. Rao.
Chernoff, H. (1952). A measure of asymptotic efficiency for tests of a hypothesis based on the sum of
observations. Ann. Math. Stat. 23, 493-507.
Cochran, W. G. (1952). The x2-goodness of fit. Ann. Math. Stat. 23, 493-507.
Cohen, A., Marden, J., and Singh, K. (1982). Second order asymptotic and nonasymptotic optimality
properties of combined tests. J. Statist. Planning Infer. 6, 253-276.
Fraser, D. A. S. (1957). Nonparametric Methods in Statistics. Wiley, New York.
Gronenboom, P. and Oosterhott, J. (1981). Bahadur efficiency and small sample efficiency. Intern.
Statist. Rev. 49, 127-141.
Hodges, J. L., Jr. and Lehmann, E. L. (1956). On efficiency of some nonparametric competitors of the
t-test. Ann. Math. Statist.
Hodges, J. L., Jr. and Lehmann, E. L. (1970). Deficiency. Ann. Math. Stat., 41, 783-801.
Hoeffding, W. (1965). Asymptotically optimal tests for multinomial distributions (with discussions).
Ann. Math. Stat. 36, 369-408.
Kalenberg, W. C. M. (1978). Asymptotic Optimality of Likelihood Ratio Tests in Exponential Families.
Mathematical Centre Tracts 77, Amsterdam.
184 Kesar Singh
Kalenberg, W. C. M. (1981). Bahadur deficiency of likelihood ratio tests in exponential families. Jour.
Multi. Analy. 11, 506--531.
Kalenberg, W. C. M. (1981). Relations between Bahadur deficiencies and attained levels. Preprint.
Lambert, D. and Hall, W. J. (1982). Asymptotic log normality of p-values. Ann. Statist. 10, 44-64.
Littell, R. C. and Folks, J. L. (1973). Asymptotic optimality of Fisher's method of combining tests II. J.
Amer. Statist. Assoc. 68, 193-194.
Noether, G. E. (1955). On a theorem by Pitman. Ann. Math. Stat. 26, 64-68.
Olshen, R. A. (1967). Sign and Wilcoxon tests for linearity. Ann. Math. Star. 38, 1763-1769.
Pfanzagl, J. (1979). First order efficiency implies second order efficiency. In: J. Jureckova, ed.,
Contributions to Statist. [J. Hajek Memorial Volume] Academia, Prague, pp. 167-196.
Pitman, E. J. G. (1949). Lecture notes on nonparametric statistical inference. Columbia University
Publ.
Raghawachari, M. (1970). On a theorem of Bahadur on the rate of convergence of test statistics. Ann.
Math. Statist. 41, 1695-1699.
Rothe, G. (1981). Some properties of the asymptotic relative Pitman efficiency. Ann. Statist. 9,
663-669.
Rubin, H. and Sethuraman, J. (1965). Bayes risk efficiency. Sankhya Ser. A 27, 325-346.
van Eeden, C. (1963). The relation between Pitman's asymptotic relative efficiency of two tests and the
correlation coefficient between their test statistics. Ann. Math. Star. 34, 1442-1451.
Wiend, H. S. (1976). A condition under which the Pitman and Bahadur approaches to efficiency
coincide. Ann. Statist. 4, 1003-1011.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1()
dl_~/
O Elsevier Science Publishers (1984) 185-228
Dana Quade
1. Introduction
This Chapter is concerned with the two-way layout, in which there are n >t 2
blocks of random observations, and each observation is given one of m / > 2
treatments. Let Y0k denote the k-th observation given the j-th treatment in the
i-th block, where k = 1. . . . . lij; write li = Ej l~j for the total n u m b e r of obser-
vations in the i-th block. All the analyses which we consider involve the
fundamental.
The null hypothesis of interest, stated somewhat vaguely, is that the treat-
ments m a k e no difference, or that each observation would have had exactly the
same value if given any other treatment. A more mathematical statement is
H0: Observations within a block are interchangeable (an equivalent term is
exchangeable), meaning that for each i the li! permutations of observations
within the i-th block are all equally likely.
This may be tested by straightforward application of Fisher's randomization
principle. Let T be any test criterion chosen (perhaps intuitively) so that large
values should lead to rejection. G e n e r a t e each of the N = lI(li! ) hypothetical
datasets which can be obtained by permuting observations within blocks,
calculate T from each such dataset, and count how many of them (say, N * )
yield values of T as large as that actually observed. Then the exact level of
significance corresponding to T is the P-value P = N * / N . It is obvious that a
randomization test such as just described is not generally feasible unless N is
very small, since the N values of T must be ~alculated anew for every observed
dataset. If an approximate P-value is sufficient, then an alternative approach
may be used: generate a random sample of M of the N datasets, count how
many of them (say, M * ) yield values of T as large as that actually observed,
and set /5 = M * / M . Nevertheless, extensive computations are still required to
obtain reasonable precision. A great simplification can be obtained, however,
185
186 Dana Ouade
li + 1 ~.~ t~j
Rqk = T + ~_~ ~'~ sgn(Yqk-- Yq'k')
j'=l k'=l
for all i, j and k. The ranks within the i-th block are of course some
permutation of the fixed set of integers 1. . . . . li (if there are no t i e s - s e e
discussion later). Thus there is only one family of hypothetical datasets cor-
responding to a given design, and the possibility exists of tabulating the
criterion once and for all. The test is then called a rank randomization test,
based on the method of n rankings. Randomization tests based on the raw data
will be mentioned only briefly in this Chapter.
A n o t h e r simplification involves restricting attention to the complete blocks
design in which each block has exactly m observations, of which exactly one is
given each treatment. Then each lij = 1, and the third subscript on Y and R
may be suppressed. Discussion of m o r e complicated designs will be limited to a
few remarks in Section 9.
For any ordered subset (jbj2 . . . . . fi) of the integers 1. . . . . m, let
P~(j~,j2 . . . . . fi) be the probability that within the i-th block the observation
given treatment jl has a smaller value than the observation given treatment j2,
which has a smaller value t h a n . . , the observation given treatment jr. Then H0
implies that Pi(jl, j2 . . . . . jr) = l/l! for all subsets of size l, l = 2 . . . . . m. This
situation is called random ranking. Many of the tests presented in this Chapter
require no more than Assumption I for validity in testing H0, i.e., for keeping
the Type I error probability under control. However, discussion of their
behavior under alternatives will be restricted to those which satisfy
ASSUMPTION IIa. Pi(jl ..... jl) ==-P ( j l . . . . . jl), i.e., that all blocks have the same
distribution of ranks.
(The 'a' distinguishes this assumption from m o r e restrictive versions IIb and IIc
which will be introduced later.)
Given a complete blocks design subject to Assumptions I and IIa, the most
general alternative which can be detected using ranks is that P(jl . . . . . fi) ~ l/l!
for at least one ordered subset (jl . . . . . fi). Let Ri = (Ril . . . . . Rim)' be the
ranking of treatments within the i-th block, for i = 1. . . . . n, and let nk be the
observed n u m b e r of times that R i = rk, where k = 1. . . . . m! indexes the
permutations of {1 . . . . . m}. Then a suitable test of H0 against this general
alternative may be based on a simple chi-squared, such as the Pearsonian
m!m~( n) 2
X ~ = - n- ~ l = n k - ~ . ,
S21 = sgn(Y/2- Y/l)= -Sl2 (against H21: P(1, 2) < P(2, 1)),
and
s = Is,21 = Is= [ (against Hi: P(1, 2) ~ P(2, 1)).
(It is easily verified that S 2= n X 2, where X 2 is the general chi-squared of the
preceding paragraph.) U n d e r H0, each difference (Y~I- Y~2) is symmetrically
distributed about 0, whence ($12+ n)/2 has the binomial distribution with
parameters (n, 1), and asymptotically (for large n) $12/~/n has the standard
normal distribution; $21 has of course the same distribution as $12.
To illustrate, consider the following Example A (n = 5):
Treatments
1 2
Blocks 1 5 4
2 7 5
3 9 6
4 6 8
5 14 l0
Then the three test criteria a r e S12 = 3, with P = 0.1875 (6/32); $21 = - 3 , with
P = 0.96875 (31/32); and S = 3, with P = 0.3750 (12/32).
There are several ways of generalizing the sign test to more than two
treatments. The earliest was proposed by Wormleighton (1959). Let the column
vector B contain the ('~) sign test statistics
Treatments 1 2 3 1 2 3
Blocks 1 35 29 20 3 2 1
2 37 34 19 3 2 1
3 26 28 21 2 3 1
4 38 25 36 3 1 2
5 41 32 30 3 2 1
For these data 812 = 3, 823 = 3 and $31 = - 5 . Hence X~, = 6.60, whence by
Wormleighton's table P = 0.1088 (846/7776) exactly; or, referring to X2(3),
P = 0.0858 approximately.
Other generalizations of the sign test are presented in Section 4. These tests
are consistent against any alternative for which P ( f i j ' ) ~ for some j ~ j'.
However, such alternatives may still be too broad to be interesting. The next
two Sections discuss tests against somewhat more narrowly focused alter-
natives.
T h e preceding discussion has ignored the difficulties caused by the occur-
rence of tied values within blocks. The definition given earlier then produces
average ranks, but these are no longer a permutation of {1 . . . . . m}, so standard
tables of the test statistic will not be valid. There is considerable controversy as
to what must be done in such a situation. If we suppose that the ties have
resulted from imprecise measurement, so that there exists some correct way to
break them, although it is unknown to us, then it seems reasonable to report
the smallest and especially the largest P-values attainable by breaking them; in
particular, if all ways of breaking the ties lead to the same P-value, then that
P-value must be correct. Alternatively, one may break the ties at random, thus
producing a randomized test, but then the results may be ambiguous, since
different investigators might get different ranks. Some argue, however, that the
only proper basis for inference concerning/4o is the principle of randomization,
which may be applied in particular to the average ranks, even though it yields
Nonparametric methods in two-way layouts 189
little computational advantage over using the raw data. Another approach is
presented in Section 7, in the context of a model for the alternative to H0. In
large samples, of course, where asymptotic approximations are used, one may
as well adopt whatever method is simplest, since there is generally no practical
difference in the results. Further discussion of ties may be found in such texts
as Bradley (1968), Hfijek (1969), or Pratt and Gibbons (1981); for the most part
the problem is ignored in the remainder of this Chapter.
A final point worth noting is that the two-way layout may arise in several
ways. If observations are assigned to treatments within each block entirely at
random, producing by definition a randomized blocks design, then inter-
changeability is automatically assured when the treatments have no effect. But
blocks may represent subjects, and treatments times, in a repeated measure-
ments design; or blocks and treatments together may constitute a factorial
experiment. Analyses suitable for the latter two situations are considered briefly
in Section 9.
It seems intuitively reasonable that the treatments might greatly increase the
likelihood of some particular within-block ranking, with lesser increases for
similar rankings, and corresponding decreases in likelihood for the opposite
rankings. Let P = (P1. . . . . Pro)' be the supposedly favored, or predicted, rank-
ing; then similarity to it may be measured by the average external rank
correlation
n
is the rank sum of the j-th t r e a t m e n t . This statistic was suggested by Lyerly
(1952). L a t e r Page (1963) p r o p o s e d the integer-valued statistic
and t a b u l a t e d its critical values (a = 0.001, 0.01, 0.05) for m = 3(1)10 with
n = 2(1)50; m o r e detailed exact tables for m = 3(1)8 and n = 2(1)10 were
given by O d e h (1977a). U n d e r H0, E[Si] = 0 and V[Si] = 1 / ( m - 1), w h e n c e
E [ S ] = 0 and V[S] = 1 / n ( m - 1), or E[L] = n m ( m + 1)2/4 and V[L] =
nm2(m + 1)2(rn - 1)/144; and either statistic is asymptotically normal for large n
and/or rn. In fact, those parts of Page's tables for which m > 8 or n > 12 are
based on this a p p r o x i m a t i o n . A continuity correction m a y be effected by
subtracting f r o m L. Thus the test is m o r e convenient for c o m p u t a t i o n in t e r m s
of L, although the interpretation as a v e r a g e correlation m a y be useful.
J o n c k h e e r e (1954) suggested a similar criterion using Kendall's rank cor-
relation coefficient
T h e n the a v e r a g e correlation is
P H = Z E PjEN(RI~)
i ]
where EN(j) is the expected value of the j-th order statistic of a random sample
of size m from a standard normal distribution. They provide critical values
(a = 0.01, 0.05, 0.10) with exact probabilities, for m = 4 with n -- 1(1)10 and
m = 5 with n = 1(1)5. (For m = 3 the test is equivalent to using Spearman
correlation.) For large n, P H is asymptotically normal with mean 0 and
variance {nm(m + 1)/12} E E~(j).
The P i r i e - H o l l a n d e r test might be considered as equivalent to an
u n s y m m e t r i c a l - n o t e P/ rather than E N ( P / ) - a v e r a g e rank correlation.
However, alternatively P H (or S) may be thought of as a special case of the
general form E~ EjPyrRi? where ( r b . . . , rm) are general scores such that r ~<
"" ~ rm ; without loss of generality, we may take E rj = 0. This idea is developed
by Berenson (1982a).
Finally, for situations in which it is anticipated that the observed rankings
may tend to agree either with the specified ranking or with its opposite,
Hutchinson (1976) proposed taking the absolute values of the correlations
before averaging: let ffI=E[Si[/n. For m = 3 the quantity 2 n ( 1 - / ~ r ) is
binomial (n, ~) under H0. For m > 3 Hutchinson suggested the integral-valued
test criterion (m 3 - r e ) n ( 1 - / - t ) / 6 , and tabulated its exact critical values (a =
0.001, 0.01, 0.05, 0.10) for m + n ~< 11. The asymptotic distribution is normal
with mean n m ( m + 1){m - 1 - V 2 ( m - 1)/w}/6 and variance nm2(m + 1)2('rr -
2)/36~r. In Example B, /4 = S = 0.8; since m = 3, we calculate 2 n ( 1 - / - ~ ) = 2,
and the exact P = 0.2058 (1600/7776).
= Y~ c.,16),
i<i'
where C~i,= y(Ri, R~,). Such a test criterion seems intuitively reasonable, since if
the treatments tend to produce rankings similar to some P, then these rankings
will ipso facto be similar to each other. Once the measure y of rank correlation
has been chosen, the null distribution of ~ is determined, and it can be tabulated
at least for small m and n. T h e test is equivalent to the simple rank correlation y if
n = 2, of course, and to the sign test if m = 2.
192 Dana Ouade
R = Z { R j - n(m + 1)/2}2 ,
Thus clearly the minimum value is - 1 / ( n - 1), which occurs if the rank sums
are all equal, and corresponds to minimal agreement among the rankings. T h e
m a x i m u m is of course 1, when all Ri are equal. A rescaling of S to lie between
0 and l is the coefficient of concordance
W = [1 + (n - 1)gl/n = 12R/n2(m 3 - m)
+ 1 m
m
L:j = E[R~j] - + ~'~ [P(J', J) - P (1,"l")]
2 j'=l
m 3- m j=L 2 "
Nonparametric methods in two-way layouts 193
Thus clearly the test based on S (or its variants) is consistent against any
alternative for which the expected ranks are not all equal.
T h e following s u m m a r y indicates the most notable original tabulations of the
null distribution for m, n > 2; m a n y of these have been variously reprinted, but
apparently no single source covers the full range. E a c h gives u p p e r tail
probabilities (i.e., P-values), in terms of X } as the a r g u m e n t (except that
Kendall and Smith use R). T h e notation ' x D P ' indicates 'at least x decimal
places' and ' x S F ' indicates 'at least x significant figures'.
aOwen (1%2) provides tables for m = 3 with n = 3(1)15 and m = 4 with n = 3(1)8 which
unfortunately are erroneous.
4(m-l)~(Rj n 2 if m is e v e n ,
proper subset those against which the standard test based on S is consistent.
Ehrenberg provided exact tables for m = 3 with n = 4 and 5 and for m = 5 with
n = 3; these were extended by Quade (1972a) to m = 3 with n = 3(1)10, m = 4
with n = 3(1)6, m = 5 with n = 3 and 4, and m = 6 with n = 3. Van Elteren
(1957) showed that for large n the quantity 3(~)[1 + (n - 1)/~] has asymptotic-
ally the same distribution as (m + 1)X1 + X2, where X1 and X2 are independent
chi-squareds with ( m - 1) and (m~l) degrees of freedom, respectively. Ehren-
berg had already derived the three-moment chi-squared approximation; it is
obtained using -q = 2(2m + 5)/9m (m - 1) and th = 4(2m 2 + 6m + 7)/
2 7 m 2 ( m - 1)2. Correction for continuity is ettected by subtracting 1 from the
numerator of K if n is even, or 2 if n is odd.
A final test criterion based on average internal rank correlation was sug-
gested by Anderson (1959; see also Schach, 1979). His statistic is related to the
correlation measure
A(R.R~,)= [ ~ . A j ( i , i ' ) - I ] / ( m - 1 )
where Ai(i, i ' ) = 1 if treatment j has the same rank in blocks i and i', and
otherwise = 0. Let Ajk be the number of blocks in which treatment j has rank k
(note that EjAjk = EkAjk = m); then
= 2 E A j 2 - n(m + n - 1)
I n ( n - 1 ) ( m - 1)
Write Pjk for the probability that in any block the observation which is given
treatment j will have rank k (note that Ej Pjk = Ek Pjk ----1); then
E[;]=(~P}k--1)/(m--1).
1,2 1 1 1 1
1, 3 i1 -21 ~1 0
1 1
1,4 2 1 3 0
1, 5 1 1 1 1
1 1 1
2, 3 ~ -~ 3 0
2, 4 ~1 1 51 0
2,5 1 1 1 1
1 1 l 1
3, 4 -~ -~ -5 -~
1 1
3, 5 -2 5 0
4, 5 ~1 1 31 0
4. Multiple comparisons
s * = m a x JS.,I
j<j'
where the Sij, are the sign test statistics defined in Section 1. O n e m a y then
declare any two t r e a t m e n t s j and j' significantly different if IS#,] exceeds the
critical value for S*. N e m e n y i p r o v i d e d exact critical values (a = 0.01, 0.05) for
m = 3 with n = 7(1)16. R h y n e and Steel (1967) t a b u l a t e d the exact distribution of
S* - actually ( n - S * ) / 2 - f o r m = 3 with n = 2 ( 1 ) 2 4 ; they also tabulated a
one-sided variant equivalent to
S = m a x Sjj,,
j<j,
which is suitable for use against the o r d e r e d alternative that P(j, j') ~ for j < j '
(with at least one strict inequality). Miller (1966), using an a p p r o x i m a t i o n to the
asymptotic distribution, p r o v i d e d a table of critical values (a = 0.01, 0.05) for
m -- 2(1)10 with n -- 5(1)20(5)50, 100. His table turns out to be slightly anticon-
servative, h o w e v e r , when c h e c k e d against the exact tabulations for m = 3.
Suppose one t r e a t m e n t - w i t h o u t loss of generality let it be the l a s t - i s
actually a 'control', and the only c o m p a r i s o n s of interest are b e t w e e n it and the
others. T h e n suitable test criteria are the many-one sign statistics
then declare treatment j significantly better than (or different from) the control
if ( R j - R i m ) (or ] R j - R j , , ] ) exceeds the critical value for R+m (or R*). These
procedures were first proposed by Nemenyi (1963). Wilcoxon and Wilcox
(1964) provide approximate critical values (c~ = 0.01, 0.05) of both R + and R'm,
for m = 3(1)10 with n = 1(1)25, based on the asymptotic distribution: for large
n, ~ / - 6 R + / ~ n m ( m + 1) is distributed as the maximum, and V ' 6 R * / ~ n m ( m + 1)
as the m a x i m u m of the absolute values, of (m - 1) N(0, 1) random variables
with c o m m o n correlation . Hollander and Wolfe (1973) tabulate the exact
distributions of R+m and R * for m = 3 with n = 2(1)18 and m = 4 with n = 2(1)5.
Youden (1963) suggested declaring a treatment significantly different from
the others if its rank sum comes too close to the minimum possible value (m)
or to the m a x i m u m possible value (mn). Accordingly, T h o m p s o n and Willke
(1963) tabulate approximate critical values of the extreme rank-sum statistic
ASSUMPTION lib. There exist quantities f l l . . . . , ft, (block effects) such that the
random vectors (Y/1- fl~. . . . . Y~m- fl~)' are all identically distributed.
Note that Wjj, is the signed-rank statistic for comparing treatments j and j', and
Sj/ the sign statistic. (Djj,, which might be called the diminished signed-rank
statistic, is of interest in its own right for the case m = 2. It can alternatively be
obtained by calculating W/j, except ranking from 0 to n - 1 instead of from 1 to
n; and its null hypothesis distribution is the same as that of Wj~,but for sample
size n - 1.) Then under H0
where Qi123is the rank of [Y/x+ Y/2-2Y/31 among its values in the n different
blocks. Kepner and Robinson show that under Ho the special signed-rank
statistic W123 is independent of the ordinary signed-rank statistic W12, which
ensures that their test statistic
2
X,2/34 = 6(W]2+ W 2 + W~234)/n(n + 1)(2n + 1)
which has the same distribution; this is easily seen to be invariant under
permutation of the treatments. Unfortunately, it does not seem possible to
extend the K e p n e r - R o b i n s o n approach to m > 4 ; but (as will be seen in
Section 7) this is perhaps not too important since there may be little interblock
information to recover given larger blocks.
For Example B, an ordinary two-way analysis of variance produces VR =
4.47, corresponding to P = 0.0498 using the F-distribution, but P = 0.0586
(456/7776) by randomization. Subtracting the block means (28, 30, 25, 33, 34)
produces aligned observations and corresponding ranks as follows (again using
average ranks for ties):
7 1 -8 14 7.5 2.5
7 4 -11 14 11 1
(aligned observations) 1 3 -4 (aligned ranks) 7.5 9.5 4.5
5 -8 3 12 2.5 9.5
7 -3 -4 14 6 4.5
W#, 13 15 7
S#, 3 5 3
D#, 10 10 4
Nonparametric methods in two-way layouts 203
Thence Y.j (Ej, Vjj,) 2 = 632 and D = 7.24 if the upper bound is used for h, with
P = 0.0261 as the conservative approximate level. Finally, for the K e p n e r -
Robinson tests, we have:
(In all three cases there were ties, for which average ranks were used with no
adjustment for the variance.)
There are also several procedures available for making use of interblock
information in testing against an ordered a l t e r n a t i v e - t h e predicted ranking
(P1. . . . . P,,)', say. One approach, based on simultaneous signed-rank statistics,
was taken in two consecutive papers in the Annals by Hollander (1967) and
Doksum (1967). Their criteria, in standardized form, are
n +=
3 X E sgn(P t - Pi,)Wii,
{n(n + 1)(2n + 1)m(m - 1)[3+ 2(m - 2)p.]} m
and
721mZ(m 2- 1 ) n ( n - 1) "~1/2
D+ = 2n[1 + (m - 2 ) ( - ~ - - - ~)])]+[ - - ~ Z 2 ~ 1 3 - 4 8 A ) - 1]J ~'~ j P / j' Djj,,
where h and the Pn are unknown parameters, h the same as in Doksum's test
against the unordered alternative. Both H + and D + are asymptotically N(0, 1)
under H0. Thus asymptotically distribution-free tests are obtained if the un-
known parameters are suitably estimated, or conservative tests if upper bounds
are substituted for them. Hollander shows that
n 2 + 2 n ( X / 2 - 1) + (3 - 2X/2)
(n + 1)(2n + 1)
PS#, = ~ q % , s g n ( Y 0 - Y),
i
where the @ are block scores; then their criteria are based on E E s g n ( P j -
Pj,)PSjj, (generalizing Hollander's test) and E E PjPSj~, (generalizing Doksum's
test). In order to treat the asymptotic situation as n ~ o% Purl and Sen let ql be
the expected value of the i-th order statistic of a sample of size n from a
distribution function gt*(x)= ~ ( x ) - aF(-x), where ~ ( x ) is symmetric about 0
and satisfies certain Chernoff-Savage regularity conditions. Then their criteria
are asymptotically normal under H0, each with m e a n 0 and standard deviation
estimable from the data. Consider Example B, with (3, 2, 1) as the predicted
204 Dana Ouade
is asymptotically N(0, 1). The simplest special case occurs when JN is constant,
so that the wij are equivalent to the rij; in Example B this produces R A + = 2.39
and P = 0.0084 approximately. De (1976) and Boyd and Sen (1984) extend
these ideas further by employing the union-intersection principle. Their tests
are also related to a procedure of Shorack (1967) which does not utilize
interblock information: he suggested forming 'amalgamated' means
/~(1). . . . . /~(k) from the treatment rank means /~j = Rj/n according to the
process derived by Bartholomew (1959) for testing against an ordered alter-
native in one-way analysis of variance. Then under H0 the distribution of
12n mj[~(i)_m21] 2
2~-m(m+l) j~.
is asymptotically the same as Bartholomew's, where mr is the number of
treatments included in / ~ ) ; in particular, for m = 3 the approximate P-value
given X 2 = x > 0 is P = @ ( - ~ / x ) + e-x/E/6, where @ is the standard normal
distribution function. Note that 2 2 reduces to Friedman's X 2 if the /~j are
exactly in the predicted ordering. This is the case in Example B, whence
. ~ = X 2 = 6.40, and P = 0.0125 approximately.
In concluding this Section, we may mention that Nemenyi (1963) suggested
multiple comparison procedures based on signed ranks, both for comparing all
pairs of treatments and for comparing treatments with a control. These were
further developed by Miller (1966) and Hollander (1966). Other multiple
comparison methods which incorporate interblock information are due to Sen
(1969) and Wei (1982).
6. Weightedrankings
Given Assumption IIb, under which the blocks are comparable, suppose the
observations on different treatments are more distinct in some blocks than in
the others; then it seems intuitively reasonable that the ordering of the
treatments which these blocks suggest is more likely to reflect any underlying
true ordering. These same blocks might more or less equivalently be described
Nonparametric methods in two-way layouts 205
fixed block scores (or block weights), and let rl . . . . . rm be fixed treatment scores
as in Section 3. Then the general weighted-rankings statistic is
of credibility. A simple and intuitive special case arises from choosing linear
block weights qg = i, with treatment scores r) = j. This yields a linearly weighted
average internal Spearman correlation
~L=(n_l)(3n+2) (m 3 - m ) n ( n + l ) ( 2 n + l ) 1 ,
where
L= Qi Rij 2 '
j=l -
and hence
72L
x ~ - m(m + 1)n(n + 1)(2n + 1)"
a = (m - 1)~/3/3,~,
G = ~_~qo,Ci/X~ qi.
Clearly any such statistic is distribution-free under Ho, and its sampling ~-
distribution could be tabulated for small m and n. For larger experiments one
may use a normal approximation: If the block weights ql . . . . . qn are so chosen
that max E q2/q2~O as n ~ , then Cw is asymptotically normally distributed
under Ho, with mean equal to E[C] and variance V[C] E q2/('Z qi)2, where
E[C] and V[C] are the null-hypothesis mean and variance of the correlation
statistic C.
Nonparametric methods in two-way layouts 207
2 ~," 12Is
n(n + 1--)~ QiSi = n(n + 1)(m 3 - m)
where
Is : 2 ~ Oi ~ PjRij- m(m + 1)2n(n + 1)]4,
i=l ]=1
and
gL n(n 2+ 1~)~~-," QiKi = n(n + a)m(m
4IK
- 1)
where
IK = ~'~ Oi ~'~ s g n ( P / - Pf) sgn(Rij - Re).
i= 1 j<j'
The quantities Is and IK are convenient integers; Salama and Quade tabu-
lated exact one-sided critical values in terms of them at a = 0.1000, 0.0500,
0.0250, 0.0100, 0.0050, 0.0025, 0.0010 and 0.0005 for m = 3 with n = 2(1)15,
m = 4 with n = 2(1)10, m = 5 with n = 2(1)9, m = 6 with n = 2(1)7, and m = 7
with n = 2(1)5. Both SL a n d / r have mean 0 under H0;
For large n the asymptotic normality may be used to approximate the dis-
tribution; a continuity correction of 1 to Is or to Ir may be incorporated. Some
other special cases are considered by Salama and Q u a d e (1984).
In Example B, using either range or variance as the measure of credibility,
the block ranks are (4, 5,_.1, 3, 2). Considering first the unordered alternative,
we have L - - 3 4 4 , where SL = 0.688 and X~, = 6.25 and P = 0,0438. Using the
t h r e e - m o m e n t approximation, including continuity correction, we calculate
X 2 = 18.53 with 8.14 degrees of freedom, corresponding to P = 0.0190; from
the table in Q u a d e (1972b) we find P = 0.0193 (150/7776) exactly. The weighted
average external Spearman correlation is SL = 0.867 (Is = 52), with P = 0.0026
(20/7776) exactly; the normal deviate (with continuity correction) is Z = 2.43,
with P = 0.0075 approximately. Similarly, the weighted average external Ken-
dall correlation is/~L = 0.822 (It = 37), with P = 0.0033 (26/7776) exactly; the
normal deviate is Z = 2.54 with P = 0.0056 approximately.
7. E s t i m a t i o n of t r e a t m e n t effects
T o this point attention has been focused on testing the null hypothesis of
random ranking, with the alternative expressed only in terms of the distribution
208 Dana Quade
of the ranks. To progress further we shall require a suitable model for the
alternative, expressed in terms of the original observations. For example, we
may suppose that
Yq = t(Xq; ~))
where the Xij are interchangeable within blocks, the Tj are fixed but unknown
parameters called treatment effects, and t is a known treatment transformation
function such that t(x; O) = x. Note that this otherwise general model does not
allow for any interaction between blocks and treatments, in that the same
transformation applies to every block. However, it is totally nonrestrictive with
respect to the nature of block-to-block differences under H0.
Given such a model, one may test the hypothesis that the effects have any
specified values 7 = (zl . . . . . zm)'; this hypothesis may be denoted H(~-), and the
null hypothesis is then H(0). The test is accomplished by inverting the
treatment transformation function to obtain values Xij = t-l(Yq; zj) and then
testing the interchangeability of the Xij using any of the procedures presented
earlier. In addition, of course, the confidence region for the parameter ~-
consists of those values ~" which, if hypothesized, would be accepted by the test,
where if the test is at level (at most) a, then the confidence coefficient is (at
least) 1 - a. Note how such an approach avoids the difficulties caused by ties:
Since the values of ~" which lead to ties generally form only a lower-dimen-
sional subset of the parameter space, we may declare such values acceptable if
and only if they are limit points of unambiguously acceptable values. This
principle amounts to taking all confidence regions as closed sets, which makes
them and the corresponding tests conservative. Note also that letting the
confidence coefficient decrease toward zero, and taking the limit of the cor-
respondingly shrinking confidence region, may produce a reasonable point
estimate of ~'. This idea was developed by Hodges and Lehmann (1963).
From now on we specialize the model of the previous paragraph to additive
treatment effects, as follows.
Since this assumption does not completely determine the zj, we shall impose
the restriction that % = 0, without loss of generality. (Other restrictions are
equally possible, of course; indeed, to make E r i = 0 is more common.)
Consider first the special case of matched pairs (m = 2). H e r e we are
assuming for each i that Xa = V / 1 - T1 and Xi2 = V / 2 - "/2 = V/2 are interchange-
able. We may test the hypothesis H ( 6 ) that the difference between the
treatment effects is z l - z 2 - - z l = 6, against the alternative that zl:~ 6, by
rejecting for large values of the sign statistic
The corresponding confidence set, with confidence coefficient ( 1 - a), is the set
of values 6 for which H ( 6 ) is accepted at level a ; this is an interval extended
from the k-th smallest to the k-th largest of the n differences d~ = Y~I-Yi2,
where (n + 2 - 2 k ) is the critical value for S(6). As the confidence coefficient
decreases, k increases until k = n/2 for n even or k = (n - 1)/2 for n odd; thus
a reasonable point estimate of z~ is the median of the differences. Confidence
intervals may also be one-sided, of course, based on the one-sided test criteria
Sn and 821.
If we now impose Assumption IIb, which was not required for point and
interval estimates based on the sign test, then we may base estimates on the
signed-rank tests also. Consider the n(n + 1)/2 averages of two (not necessarily
distinct) within-block differences, i.e.,
= Z Z sgn(d,,,)
i<<i'
At a = 0.50 (say), the critical value for S(B) is 3, whence k = 2, so that the 50%
confidence interval for Zl extends from the 2nd smallest to the 2nd largest of
the di, i.e., [1.0, 3.0]. The corresponding point estimate is 2.0, the median of the
di. For W the critical value at a = 0 . 5 0 is 7, whence k = 5 , so the 50%
confidence interval extends from the 5th smallest to the 5th largest of the d~r, or
from 1.0 to 2.5; the point estimate is 2.0 again.
A confidence procedure for the case m = 3 was suggested by Kraft and Van
210 Dana Quade
Eeden (1968). Let T(rb r2) be the value of some test criterion T calculated
from the within-block ranks of the values X~j = Y o - rj, for i = 1. . . . , n, j =
1, 2, 3. Then a confidence region for (rl, r2) may be constructed by taking all
points in the (rl, r2) plane such that T(rl, rE) falls short of the critical value. But
consider how T(rl, r2) may vary as the point (rl, r2) moves along some
continuous path. A small change in (rh r2) may not change the within-block
ranks of the Xij at all, and hence may leave T unaffected. The within-block
ranks can change only at a point (rl, r2) where Y n - rl = Y~3, o r Y i 2 - "/'2 = Y/3,
or Y n - rl = Y~2- r2, for some i. But the set of all such points form 3n lines in
the (r,, r2) plane: one passing vertically, one horizontally, and one at a 45
angle, through each of the points ( Y n - Y~3, Y/2- Y~3). Within any one of the
subregions bounded by these lines, the ranks remain constant, and hence the
test criterion also; in different subregions the ranks are different, and the test
criterion may (but may not) be different also. Thus to construct the confidence
region it is necessary at worst to calculate the value of T(ri, rE) for one point in
each subregion; acceptance or rejection at that point determines acceptance or
rejection of the whole subregion.
All this is quite feasible if n is small, and the shapes of the confidence regions
produced may lend insight to the data, or to the nature of the test. The reader
is invited to draw the three lines through each of the five points (15, 9), (18, 15),
(5,7), ( 2 , - 1 1 ) and (11, 1) determined by Example B, say, and thence con-
struct the confidence region for (r~, r2) corresponding to Friedman's chi-
squared or any other test based on the within-block ranks. (Kraft and Van
Eeden applied their procedure only to Friedman's chi-squared, i.e., average
internal rho; the confidence regions are easier to construct, however, using
average internal or external tau.)
Extension of the Kraft and Van Eeden procedure to tests which make use of
interblock information is possible in principle, but seems not generally feasible
in practice. For example, to construct a confidence region based on ranking after
alignment would require drawing 3n(3n - 1)/2 lines. A slightly simpler situation
is presented by the weighted-ranking methods, at least if block variance is used
as the measure of credibility; then, since ties in the block ranking can also be
shown to correspond to lines in the (r~, r2) plane, there are in all n(n + 5)/2
lines to be drawn. Of course, for all these tests, and for any n, an approximate
confidence region may be conducted by dividing the plane into small subre-
gions arbitrarily, calculating the test statistic at one point in each subregion,
and declaring the whole subregion to lie inside the confidence region if and
only if the one point chosen to represent it does so. The confidence boundaries
may then be determined as closely as desired by making the subregions
sufficiently small. Finally, extension to m > 3 obviously corresponds to plotting
in higher-dimensional spaces.
Let us now consider the point estimation of arbitrary contrasts in the
treatment effects. Let to = E cjrj, where E ci = 0; then it seems reasonable to
estimate to by o3, the median of the n estimates wi = E ciYo; or, if interblock
information is available, by the H o d g e s - L e h m a n n estimate a3, the median of
Nonparametric methods in two-way layouts 211
the n(n + 1)/2 averages wii,= E c/(Y 0 + Y~,j)/2 for 1 ~< i ~< i' ~< n. But a drawback
of this approach is that the estimates of different contrasts are incompatible, in
that, for example, w ~- to1 + o~2 does not imply o5 = o31 + 03z or o3 = 031+ 032. T o
avoid this problem, L e h m a n n (1964) showed how to obtain a set of compatible
estimates to replace the 03; D o k s u m (1967) adopted the same idea for the 03,
and Puri and Sen (1967) generalized the approach to estimates based on
general scores. For each pair of treatments 1 ~<j, j' <~ m, write
= -~1 ~j, [/(j, j')-/(m, j')] and ~. = 1m ~j, [/(j; j') t(m,j')]
8. Efficiency
In this Section we present results concerning the relative efficiency of one test
of H0 with respect to another. Let nA be the number of blocks required by
some test A to detect a specified alternative, and nB the number required by
test B. Then the relative efficiency of A with respect to B is nB/na, and the
asymptotic relative efficiency (ARE) in the sense of Pitman (1948) may be
interpreted as the limit of ndnA as the alternative approaches the null hypo-
thesis (in which case nA and nB both increase to infinity). Although defined in
terms of large samples, Pitmaa A R E has been found a generally reliable guide
for comparing tests even when the samples are quite small.
The basic situation which we consider is that Assumptions I, IIb and IIIb all
hold, where Assumption IIIb is the strengthening of Assumption IIIa in which
'interchangeable' becomes 'independent and identically distributed'. We then
have the linear model
tr2(F) = f x 2 d F ( x )
is the error variance. This test will be used as the standard against which to
compare the others in the unordered case. Sen(1968a) showed that any test
based on an average internal score correlation O has corresponding
ARE(S, VR) = m
provided the density F' exists and the indicated integral converges. Bhapkar
(1963) had considered the B r o w n - M o o d test similarly. Note that for m = 2 all
these tests reduce to the sign test, with A R E equal to (2/3)o'2(F)qt2(F).
For ranking after alignment (RA) on block means, using scores based on the
general function. J, Sen (1968b) found
3m
ARE(X2s, VR) =
4(m - 1){w- 3 tan-l~/(2m - 3)/(2m - 1)"
Doksum (1967) showed that the A R E for his test statistic D (with consistent
estimator A rather than upper bound 7) is
~o < ,
,~ ,~ Z
I ~ +I
t'q
r'- ('q
D'- ~
D'- ('q
c~c~ c5 Z
+I
e~
+I
.=_
~. ~ <. ~.
c~ Z
c5c5c5 ~c5c5~c5
+I ~ +I
0
e~
ccc5 ~c5c5~c
+I ~ +I
~ . ~
II
e~
"5
e~
II
Z
<
Nonpararnetricmethods in two-way layouts 215
analysis of variance of the nonparametric tests listed above, for three error
distributions: uniform, normal and Laplace (double-exponential). The values of
m included are m = 2 (in which case Friedman's X 2 is equivalent to the
two-sided sign test, and the others to the signed-rank test), m = 3, m = 4 and
'm = ~ ' (i.e., the limit as m ~ m). The A R E s shown for weighted rankings are
estimates (+--standard errors) obtained by Silva and Q u a d e (1983) for the
statistic X 2 with linear block weights, using the range as the measure of
credibility; their estimates are also shown for X 2 s in the uniform and Laplace
cases, for which the exact expressions have not been evaluated. (It might be
noted that the S i l v a - Q u a d e method produced an underestimate of the A R E in
17 of 18 cases where an exact value was available for comparison.) For
D o k s u m ' s test the A R E s were calculated using a = 0.2909 for the uniform
distribution and a = 0.2902 for the normal, as given by Lehmann (1964); for the
Laplace distribution, a = 0.2894 (3001/10368). The dual A R E s shown for the
K e p n e r - R o b i n s o n tests represent the minimum and m a x i m u m obtainable by
permutation of the treatments; the starred values at m - - 4 apply to the
invariant statistic X~234.
Turning now to the efficiencies of tests against ordered alternatives, suppose
that for some constant r we have rj = ( P j - P m ) r in the linear model Y~j =
~i + rj + Eij, so that the hypothesis H0: r = 0 may be tested against the alter-
native Hi: r > 0. Given normal errors, the m a x i m u m likelihood estimate of r is
12 (pj m + 1)
?=n(m3-m)~ 2 ~i Yi~'
and the likelihood ratio test, as given by Hollander (1967), rejects for large
values of
Since this test criterion is asymptotically N(0, 1) under H0, as are all the others
we have presented which are aimed at the ordered alternative (except
Shorack's )~2), we are able to compare them using Pitman A R E .
Hollander (1967) found the efficiencies of Jonckheere's test (average external
Kendall correlation) and Page's test (average external Spearman correlation) to
be
where as before F is the distribution function of any error Eij. It may be noted
that ARE(S, T ) is identical with ARE(~, V R ) as given earlier in this Section.
This relationship holds for the whole family of tests based on scores of which S
216 Dana Ouade
and PH are special cases. Thus A R E ( P H , T+), as given (implicitly) by Pirie and
Hollander (1972),can be obtained by substituting EN(j) for rj in the expression
given for A R E ( O , VR). For m = 2, of course, all these tests reduce to the
one-sided sign test.
Hollander (1967) found the efficiency of the test he proposed to be
A R E ( H + ' T+ ) _ (m + 1)~rZ(Fz)'/t2(F2)
3 + 2(m - 2)(12A - 3)'
where F2 and A are as defined earlier in this Section, and Doksum (1967) found
A R E ( D +, T +) ~ - A R E ( D , VR). For m = 2, of course, these results reduce to
those for a one-sided signed-rank test. Puri and Sen (1968) extended these
efficiency results to their generalizations of these tests, and Sen (1968b) showed
that A R E ( R A +, T +) = A R E ( R A , VR).
Similar efficiency results pertain to the contrast estimators presented at the
end of Section 7; note that the parameter ~- we have been considering is itself a
contrast. In particular, Doksum (1967) showed that the A R E of his compatible
median estimator with respect to the least squares estimator of any contrast is
the same as ARE(S, VR) = ARE(S, T+); and Lehmann (1964) has shown that
the A R E of his compatible estimator is
(m/2)o-2(F*) gt2(F *)
A R E ( H E , LS) - T 7 7
where F * is the distribution of the contrast estimator from any single block.
The multiplier of o-2(F*)~tt2(F *) in this expression represents the A R E of the
compatible estimators with respect to the incompatible; it equals 1 at m = 2,
when the two estimators are identical, and increases with m to 1/2(12A- 3),
between 1 and 1.5, as m ~ ~. All these results were extended to general scores
by Puri and Sen (1967).
The preceding presentation using Pitman A R E had to omit from con-
sideration those statistics which are not asymptotically x 2 ( m - 1) (when con-
sidering unordered alternatives), or asymptotically normal (ordered). However,
there have of course been numerous empirical studies to compare the various
tests for small values of m and n. These studies, which began as early as the
original paper of Friedman (1937), are often published only as theses or
technical reports; or if more formally, they tend to be outside the mainstream
of statistical literature. We cite here only a few of them, emphasizing dis-
cussions of techniques for which the Pitman A R E has not been tractable.
Gilbert (1972) compared the powers of the classical variance ratio (VR),
Friedman's XZv, the ranking-after-alignment statistic X~s, and the K o c h - S e n
statistic W*, for m = 3 with n = 3(2)9, given normal errors. Silva (1977) and
Silva and Quade (1980) compared the expected significance levels of VR, X 2,
X ~ s and 10 weighted-ranking statistics including X~. (using 5 measures of
credibility times 2 block scoring schemes), for m = 3, 4, 5 with n = 3, 4~ 5, 6,
given normal, uniform and Laplace errors. Conover and Iman (1980) compared
Nonparametric methods in two-way layouts 217
the powers of VR, X~, X~, and the rank transform, for m = 2, 3, 4, 5, 10 with
n = 10, 20, 30 (and 40, 50 if m = 2; recall that X 2 is equivalent to the sign test,
and X~ to the signed-rank test, in that case), given normal, uniform, Laplace,
Cauchy and lognormal errors. Kepner and Robinson (1982) compared the
powers of their statistics with X 2, for m = 3 with n = 5 and 10, and for m = 4
with n = 4 and 8, given normal, uniform and Laplace errors. Some studies
which consider the ordered alternative are as follows: Salama and Quade
(1981) compared the expected significance levels of S, K, SL, KL and D (using
a bound on ~ rather than estimating it), for m = 3, 4, 5 with n = 3(1)7, given
normal, uniform and Laplace errors. Berenson (1982b) compared the powers of
TT, AT, S, PH, H +, R A + and j~2 for m = 4, 5, 6 with n = 5 and 10, for 11
different error distributions; here TT is a variant of T*, A T is the parametric
'maximin contrast' of Abelson and Tukey (1963), and H was calculated using
the upper bound on On. Boyd and Sen (1984) compared the powers of S and
four statistics derived from the union-intersection principle (one based on
within-block ranks, two based on weighted ranks, and one on aligned ranks),
for rn = 3, 4, 5 with n = 10(5)25, given normal errors.
Let us now attempt to use these various efficiency results, both theoretical
and empirical, to choose among nonparametric tests of the hypothesis of no
treatment effects within the context of a linear model for the complete blocks
design (Assumptions I, lib and IIIb). Such tests will be considered in three
stages: first, comparisons among those which ignore interblock information by
relying strictly on within-block ranks; second, comparisons among procedures
which utilize the interblock information; and finally, comparisons between
these two major classes of tests.
For m = 2, all nonparametric procedures using only the within-block ranks
reduce to the sign test. For m > 2, average Spearman c o r r e l a t i o n - F r i e d m a n ' s
X~ (or ~) for the unordered alternative and Page's L (or S) for the o r d e r e d -
are close to being standard. With respect to the classical procedures based on
assumptions of normality, they have
ARE- m cr2(F)~2(F )
m+l
in the Pitman sense, where F is the error distribution. Hodges and Lehmann
(1956) showed that o-2(F)gt2(F) cannot fall below 0.864 if F is symmetric. Sen
(1968a) showed how to choose the best test against the unordered alternative
from among those based on score correlations. The choice depends on F, but
Spearman correlation is optimal for m = 3 if F is symmetric, and for all m if F
is logistic. _Schach (1979) and Al_vo et al. (1982) indicate that Anderson's test
(based on fi,) and Ehrenberg's (/(), respectively- recall that these are not score
correlation t e s t s - a r e superior to Friedman's on the basis of approximate
Bahadur (1960, 1967) slope. For the ordered alternative, however, a direct
comparison of Spearman with Kendall correlation (Hollander, 1967) shows
m x3+2(m-2)(121-3)
ARE(D+'H+)=m+I 2+2(m-2)(12t 3)'
Consider now the comparison between those tests which do and those which
do not utilize interblock information. For m = 2 this means comparing the
Wilcoxon signed-ranks test to the sign test, yielding
which cannot exceed 3 if F2 is unimodal: see Pratt and Gibbons (1981) for an
excellent discussion of bounds on this and similar A R E expressions. Given
some specific error distributions F we have
Exponential
Distribution: Normal Uniform Laplace or Cauchy
For m > 2 we may compare Doksum's tests, whose overall efficiency properties
seem as good as those of any tests which utilize interblock information, to
Friedman's test or Page's. This amounts to multiplying A R E ( W , S) by the
factor
v(m, A) = (m + 1)/3[1 + (m - 2)(12A - 3)1,
A R E ( e , VR) = 12mo-2
m + l {NI____~ f F~(x) dFi(x) }2 .
220 Dana Ouade
This is minimized if o-1 . . . . . on, and can easily exceed 1 for only moderately
heteroscedastic errors.
Furthermore, in conformity with most authors, Assumption Ilia (requiring
interchangeability of errors within blocks) was strengthened in the foregoing
discussion to Assumption IIIb (requiring that the errors be mutually in-
dependent). However, the tests are all valid under the weaker assumption, and
the relative efficiency results apply with at most slight modifications: see Sen
(1968b, 1968c, 1972) for details.
9. Miscellaneous extensions
where Rj is the sum of the within-block ranks of all observations which receive
the j-th treatment. (Note that if k = m, then X ~ is Friedman's statistic X2v.)
Van Der Laan and Prakken (1972) tabulated the exact null-hypothesis dis-
tribution of XZD for 15 small designs, and discussed asymptotic approximations:
the simplest is that, for large n, X o ' v x 2 ( m - 1). Skillings and Mack (1981)
2
is the corrected rank sum corresponding to the j-th treatment. Define also the
m m matrix V whose (j, j') element is
The quantity Fi incorporates an adjustment for ties; it equals (I~- li)/12 if there
are no ties in the i-th block. Then the Benard-Van Elteren criterion for testing
the hypothesis of interchangeability within blocks against the unordered alter-
native is
m ( m - 1) ~ ( T t - ~)~
Oo = E Bi(m - Bi) '
222 Dana Quade
where I1/= (Yn,. , Yi,,)' and y = (Yl . . . . . Ym)' is any vector of 0s and ls. The
same test was discovered independently by Van Elteren (1963), who showed
that it is equivalent to an average Spearman or Kendall correlation among the
Y~. Van Elteren also tabulated the exact null hypothesis distribution of Q0 for
34 cases where m and n are very small and B i = B is constant for all
i = 1 , . . . , n. Patil (1975) presented a more convenient algorithm for calculating
the exact distribution, and tabulated critical values at a = 0.10, 0.05, 0.01 for
m = 3 with n = 4(1)20. As n ~ ~, Q0 is asymptotically distributed as gZ(m - 1).
Madansky (1963) extended the test to nominal responses with more than 2
categories.
Madansky also suggested a test for a hypothesis of homogeneity, specifically
where qj, is in (j, j') element of the inverse of the matrix whose (j, j') element is
(Ei YqYq,- T/Tfln). Note that H , is implied by the narrower hypothesis H0, but
does not imply it. It can be shown that Q0 is consistent for testing H0 against
HI: (not H , ) , but it is not valid for testing H , . However, Q , is asymptotically
distributed as x 2 ( m - 1) under H , (under the added assumption of identical
blocks, which Q0 does not require) and is also consistent against HI. Bhapkar
and Somes (1976) and Wackerly and Dietrich (1976). developed multiple
comparisons procedures for the probabilities involved in H , .
Let us now return from the special case of a dichotomy to the more general
response variable. As noted at the end of Section 1, interchangeability of
Y~I. . . . . Y~,, is the natural hypothesis expressing absence of treatment effects in
a true randomized-blocks situation; but there may also be interest in a broader
hypothesis, such as,
geneity for a dichotomy. Stuart (1951), Linhart (1960) and Quade (1972a) have
proposed tests for H , . Quade's method actually applies to hypotheses of the
form
E[(] = 0
Yq = / z + ~ + Eij fori=l,...,nandj=l . . . . ,m
where (without loss of generality) rm = 0 as before and the Eij have median 0.
We retain Assumption I (independence of blocks), strength Assumption II to
This last assumption implies, in particular, that any linear combination of the
Eij is distributed symmetrically about 0.
Given this model, the natural hypothesis is
and the Os are as defined in Section 5. Assumption IIIc implies that under H .
the two vectors V~ = (V~I. . . . . Vi,,)' and -V~ are equally likely a priori, for
i = 1. . . . . n, so that W* has 2" (conditionally) equally likely realizations, and
an exact P-value can be calculated; as n ~ ~, W* is asymptotically distributed
as x2(m - 1). Koch and Sen suggested another test criterion (W) for Hx, which
is obtained if V0 is replaced by (R~;- R~,,) when calculating W* as explained
above; this is appropriate without requiring any version of Assumption II, and
the same remarks about its exact and asymptotic distributions apply. They
derived expressions for the noncentrality parameters of both W* and W
under Pitman alternatives, but explicit evaluation would be complicated and
they did not carry it out. Gilbert (1972) simulated the distribution of W* under
Hx for m = 3, given normal errors with several different variance matrices, and
found that the asymptotic approximation provides a conservative test, but with
reasonable accuracy for n as small as 9. Simulations under shift alternatives,
however, suggested that X~, R A and V R may be farily robust under the
diagonal symmetry assumption, and more powerful that W*. T ~ and W were
not considered in his study. Note, by the way, that if m = 2 then T 2, W* and W
reduce to the t, signed-ranks and sign tests for matched pairs, respectively.
In Section 1 we mentioned one further source of two-way layouts: the true
factorial, in which the observations are completely randomized over the
treatment combinations, or there is an independent sample from each. Many of
the procedures presented above can be applied to such data, but we shall not
provide any explicit discussion of factorials.
References
Bennett, B. M. (1967). Tests of hypotheses concerning matched samples. Journal of the Royal
Statistical Society B 29, 468-474.
Bernard, A. and Van Elteren, P. (1953). A generalization of the method of m rankings.
Indagationes Mathematicae 15, 358-369.
Berenson, M. L. (1982a). Some useful nonparametric tests for ordered alternatives in randomized
block experiments. Communications in Statistics - Theory and Methods 11, 1681-1693.
Berenson, M. L. (1982b). A study of several useful tests for ordered alternatives in the randomized
block design. Communications in Statistics- Simulation and Computation 11, 563-581.
Bhapkar, V. P. (1961). Some nonparametric median procedures. Annals of Mathematical Statistics
32, 846-863.
Bhapkar, V. P. (1963). The asymptotic power and efficiency of Mood's test for two-way
classification. Journal of the Indian Statistical Association 1, 24-31.
Bhapkar, V. P. (1970). On Cochran's Q-test and its modification. In: G. P. Patil, ed., Random
Counts in Scientific Work Vol. 2. Penn. State., University Park and London.
Bhapkar, V. P. and Somes, G. W. (1976). Multiple comparisons of matched proportions. Com-
munications in Statistics - Theory and Methods 5, 17-25.
Blomqvist, N. (1950). On a measure of dependence between two random variables. Annals of
Mathematical Statistics 21, 593-600.
Blomqvist, N. (1951). Some tests based on dichotomization. Annals of Mathematical Statistics 22,
362-371.
Boyd, M. N. and Sen, P. K. (1984). Union-intersection rank tests for ordered alternatives in a
complete block design. To appear in Communications in Statistics.
Bradley, J. V. (1968). Distribution-free Statistical Tests. Prentice-Hall, Englewood Cliffs, N.J.
Brown, G. W. and Mood, A. M. (1948). Homogeneity of several samples. The American
Statistician 2 (3) 22.
Brown, G. W. and Mood, A. M. (1951). On median tests for linear hypotheses. In: J. Neyman, ed.,
Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability.
University of California Press, Berkeley, pp. 159-166.
Brunden, M. N. and Mohberg, N. R. (1976). The Bernard-Van Elteren statistic and nonparametric
computation. Communications in Statistics - Simulation and Computation 5, 155-162.
Cochran, W. G. (1950). The comparison of percentages in matched samples. Biometrika 37,
256-266.
Conover, W. J. and Iman, R. L. (1980). A comparison of distribution-free procedures for the
analysis of complete blocks. Unpublished manuscript presented at the annual meeting of the
American Institute of Decision Sciences, Las Vegas.
Conover, W. J. and Iman, R. L. (1981). Rank transformations as a bridge between parametric and
nonparametric statistics. The American Statistician 35, 124-133.
De, N. (1976). Rank tests for randomized blocks against ordered alternatives. Calcutta Statistical
Association Bulletin 25, 1-27.
Doksum, K. A. (1967). Robust procedures for some linear models with one observation per cell.
Annals of Mathematical Statistics 38, 878-883.
Durbin, J. (1951). Incomplete blocks in ranking experiments. British Journal of Psychology
(Statistical Section) 4, 85-90.
Ehrenberg, A. S. C. (1952). On sampling from a population of rankers. Biometrika 39, 82-87.
Friedman, M. (1937). The use of ranks to avoid the assumptions of normality implicit in the
analysis of variance. Journal of the American Statistical Association 32, 675-701.
Gilbert, R. O. (1972). A Monte-Carlo study of analysis of variance and competing rank tests for
Scheffe's mixed model. Journal of the American Statistical Association 67, 71-75.
Hajek, J. (1969). A Course in Nonparametric Statistics. Holden-Day, San Francisco.
Hannah, E. J. (1956). The asymptotic powers of certain tests based on multiple correlations.
Journal of the Royal Statistical Society B 18, 227-233,
Hays, W. L. (1960). A note on average tau as a measure of concordance. Journal of the American
Statistical Association 55, 331-341.
Hettmansperger, T. P. (1975). Non-parametric inference for ordered alternatives in a randomized
block design. Psychometrika 40, 53-62.
228 Dana Quade
Shorack, G. L. (1967). Testing against ordered alternatives in Model I analysis of variance; normal
theory and nonparametric. Annals of Mathematical Statistics 38, 1740-1752.
Silva, C. (1977). Analysis of randomized blocks designs based on weighted rankings. North
Carolina Institute of Statistics Mimeo Series No. 1137.
Silva, C. and Quade, D. (1980). Evaluation of weighted rankings using expected significance level.
Communications in Statistics - Theory and Methods 9, 1087-1096.
Silva, C. and Quade, D. (1983). Estimating the asymptotic relative efficiency of weighted rankings.
Communications in Statistics - Simulation and Computation 12, 511-521.
Skillings, J. H. (1980). On the null distribution of Jonckheere's statistic used in two-way models for
ordered alternatives. Technometrics 22, 431-436.
Skillings, J. H. and Mack, G. A. (1981). On the use of a Friedman-type statistic in balanced and
unbalanced designs. Technometrics 23, 171-177.
Skillings, J. H. and Wolfe, D. A. (1977). Testing ordered alternatives by combining independent
distribution-free block statistics. Communications in Statistics- Theory and Methods 6, 1453-
1463.
Spearman, C. (1904). The proof and measurement of association between two things. American
Journal of Psychology 15, 72-101.
Steel, R. G. D. (1959). A multiple comparison sign test: treatments versus control. Journal of the
American Statistical Association 54, 767-775.
Stuart, A. (1951). An application of the distribution of the ranking concordance coefficient.
Biometrika 38, 33-42.
Thompson, W. A., Jr. and Willke, T. A. (1963). On an extreme rank sum test for outliers.
Biometrika 50, 375-383.
Tukey, J: W. (1949). The simplest signed-rank tests. Mem. Report 17, Statistical Research Group,
Princeton University.
Tukey, J. W. (1957). Sums of random partitions of ranks. Annals of Mathematical Statistics 28,
987-992.
Van Der Laan, P. and Prakken, J. (1972). Exact distribution of Durbin's distribution-free test
statistic for balanced incomplete block designs, and comparison with the chi-square and F
approximation. Statistica Neerlandica 26, 155-164.
Van Elteren, P. (1957). The asymptotic distribution for large m of Terpstra's statistic for the
problem of m rankings. Proceedings Koningklijke Nederlandse Akademie van Wetenschappen 60,
522-534.
Van Elteren, P. (1960). On the combination of independent two-sample tests of Wilcoxon. Bulletin
of the International Statistical Institute 37, 351-361.
Van Elteren, P. (1963). Een permutatietoets voor alternatief verdeelde grootheden. Statistica
Neerlandica 17, 487-505.
Van Elteren, P. and Noether, G. E. (1959). The asymptotic efficiency of the X2-test for a balanced
incomplete block design. Biometrika 46, 475-477.
Wackerly, D. D. and Dietrich, F. H. (1976). Pairwise comparison of matched proportions.
Communications in Statistics - Theory and Methods 5, 1455-1467.
Wei, L. J. (1982). Asymptotically distribution-free simultaneous confidence region of treatment
differences in a randomized complete block design. Journal of the Royal Statistical Society B 44,
201-208.
Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics 1, 80-83.
Wilcoxon, F., Katti, S. K. and Wilcox, R. A. (1964). Critical values and probability levels for the
Wilcoxon rank sum test and the Wilcoxon signed rank test. In: H. L. Harter and D. B. Owen,
eds., Selected Tables in Mathematical Statistics Vol. 1. Markham, Chicago.
Wilcoxon, F. and Wilcox, R. A. (1964). Some Rapid Approximate Statistical Procedures. Lederle
Laboratories, Pearl River, N.Y.
Wormleighton, R. (1959). Some tests of permutation symmetry. Annals of Mathematical Statistics
30, 1005-1017.
Youden, W. J. (1963). Ranking laboratories by round-robin tests. Materials Research and Stan-.
dards 3, 9-13.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1 1
Elsevier Science Publishers (1984) 229-257 dE dE
J. N . A d i c h i e
1. Introduction
Y = a +X'fl + e (1.1)
229
230 J. N. Adichie
authors, see e.g. Pitman (1948), H o d g e s and Lehmann (1956), Chernoff and
Savage (1958).
In this chapter, we discuss the rank tests for the various hypotheses that are
usually tested with respect to the parameters of a linear model. Section 2 deals
with tests of hypotheses involving all fl in (2.1), while in section 3, we discuss
tests of sub-hypotheses. Section 4 treats tests involving comparisons of several
regression lines while Section 5 discusses tests for the multivariate linear
model. In all the discussions, we shall restrict attention to cases where the
design matrix is of full rank.
Ho: fl = 0 (2.2)
where zii = (xji - ~j), with j = n -~ E i xji. This has the effect of reparametrizing
(2.1) to F o ( y ) = F ( y - a - fl'Y~ - / 3 ' z / ) .
T o obtain the required test statistic, define
where Ri is the rank of Y/, while 0,(i) = O(i/(n + 1)) are scores generated by a
given function O(u), 0 < u < 1. Writing S, for ( S , b . . . , S,~), the rank order
statistic for testing (2.2) is given by
For large values of n therefore, an approximate level a test for (2.2) may be
obtained if H0 is rejected for M~ > X 2 ( 1 - a ) , the ( l - a ) fractile of the
chi-square distribution with q degrees of freedom.
REMARK. Observe that Condition B(ii) may not be satisfied for all cases for
which the original design matrix 32, is of rank q. A particular class of X, for
which B(ii) holds, is any orthogonal design matrix with Xl . . . . . Xq-
Y: 14 20 10 16 33 27
XI: 1 5 3 0 8 7
X2: 21 15 13 4 5 8
Y: 14 20 10 16 33 27
zl: -3 1 -1 -4 4 3
z2: 10 4 2 -7 -6 -3
Similarly S2 = -4.86. Also (Z,Z',)= (_~2 233) which satisfies Condition B(ii)
giving M, = 3.156.
Y: 1 5 0 4 4 +1
xl: 1 2 1 3 3 3
x2: 1 1 2 1 2 3
We note that the raw ranks of Y~ are 2, 6, 1, 4, 4, 2, and the average scores are
For the Wilcoxon scores, the average scores are the same as the midranks 2.5,
6, 1, 4.5, 4.5, 2.5. Rewriting X in terms of Z as in (2.3) we obtain $1 = 0.286,
$2 = - 3 . 5 with Z,Z', = u.33r4s33.33)1"33agiving M , ( 0 ) = 4.338. For the Normal scores,
the values of ~,(i) are -0.37, 1.07, -1.07, 0.37, 0.37, -0.37, giving $1 = 0.918,
$ 2 - 1.44 and M , ( ~ ) = 3.7321.
EXAMPLE 3. Consider a sample of size 30 of real life data in Table 1 taken from
page 282 of Steel and Torrie (1960). We assume the model, in (2.1) with q = 3,
where Y is the log of leaf burn in seconds, X a is the percentage of nitrogen, X 2
the percentage of chlorine and X3 the percentage of potassium. Our interest is
to test H0:/~ = 0. The ranks of 3I/are as follows:
Table 1
Percentages of nitrogen Xt, chlorine X2, potassium X3, and log of leaf burn in seconds
Y, in samples of tobacco from farmers' fields
A M = {b'(ZZ')b}B2(F)IA 2 (2.8)
with
B2(F) = f q~'(F(y)) d F ( y )
where 6' denotes derivative with respect to y. We note that for testing H0 in
(2.2) after reparametrization as in (2.3) the test statistic O based on the
likelihood ratio has the form ( n - q)Ol/qOo, where O1 = Y',Z'(Z,Z')-IZ, Y,
and as n ~ 0% Oo/(n - q) tends to 0-2 = o-Z(F), the variance of Y. Under (2.7) it
turns out that qO has an asymptotic noncentral chi-square distribution with q
degrees of freedom and noncentraliy parameter
Ao = {b'(ZZ')b}/0-2(F) (2.9)
where R + is the rank of IYil among IY1]. . . . . ]Y,[, sgn(y)= 1 or - 1 for y > 0 or
y < 0, while 4~,(i) = &((n + 1 + i)/2(n + 1)) are scores generated by a given
function ~b((1 + u)/2), 0 < u < 1. Writing S+ for ( S + b . . . , S~)' the signed rank
test statistic for testing (2.2) is
M +
= S ,+~ ( X , X , r) - I S ,+/ A 2
(2.12)
Y = a + X t l / 3 1 + X n 2 / 3 2 -}- e (3.1)
where Y and a are as defined in (1.1); Xnl , (q X n ) and X,2, (q2 n) form a
partition of X,, (ql + q2 = q) while fix and/32 are corresponding subvectors of/3.
The interest is to test
Although Koul (1970), Puri and Sen (1973) and Adichie (1978) among others
have suggested rank test statistics for (3.2), their work is nevertheless based on
the extra assumption that F, the distribution function of Y is symmetric. Since
we do not need that assumption, we first reparametrize (3.1) as follows:
Y~(fl2) = Yi = ( Y - Z ~ f l 2 ) i (3.4)
and/~i is the rank of ~ given in (3.4) and the scores are as defined in (2.4).
The proposed test statistic for testing H0 in (3.2) would be a quadratic form
in the q~ elements of T,~, with the discriminant being the inverse of the
covariance matrix. It has been shown in Adichie (1978) that subject to
conditions A - D given below, the null distribution of (n-mlb,1) tends to a
qrvariate normal with zero mean and covariance matrix A2(q0C *, where
C * = l i m n -1 C , , with
C*. = - (3.6)
and A2(~b) is as defined in (2.5). It follows that for testing H0 in (3.2), we may
use
It can be shown as in Sen and Puff (1977) that subject to the conditions A - D of
this section, the null distribution of (n-V2S,1) tends to a ql variate normal with
mean zero and the same covariance matrix Az(~b)C * given in (3.6). It follows
that
may also be used for testing H0 in (3.2) particularly if the alignment (3.4) is
done using 'rank' estimate. It is also clear that iV/, in (3.7) and M* in (3.9) have
the same limiting distribution.
The conditions required for the limiting distribution of A7/,(37/*) are as
follows:
Q1 = Y ' W , Y .
EXAMPLE 4. Consider the following artificial example adapted from page 142
of Graybill (1961), where the model is Y = a +/3~xl +/32X2 q- E'. We want to test
Ho: /31 = 0,/32 unspecified. Assume that after reparametrization, as in (3.3), the
adapted observations become:
Y: 6 13 13 29 33 23 46 117
z1: -5 -5 -3 -4 -1 0 3 15
z2: -5 -4 -3 -2 -1 0 2 13
Under H0, we use the LSE of /32 i.e. ( ~ i z 2 i Y i ) / E i z2i which gives /32 = 10.1.
Using Wilcoxon scores we find that, by (3.8), S, = (S,1, S,2)' = (1/9)(59, 59)' and,
by (3.5), 7",1= 1/9(59- (1.158)49) = 0.24 with
240 J. N. Adichie
= (310 264~
z~z"
\264 228 /
and C* = 310-(264)2/228-~ 4.32, giving ~/, = 0.16. It is observed that for this
~ o b l e m the variance ratio criterion gives O = 701/O0 = 0.05, implying as did the
M,-test, that it is an obvious case of nonrejection.
EXAMPLE 5. Let us consider again the data used in example 3 with the model
Y = fllZl+/32Z2+/33Z3+ e (observations are given in x's in the table), where
we now want to test H0:/~3 = 0, /31 and/32 unspecified. Using the least squares
estimate of/31 and/32 under H0, we get/31 = -0.592,/32 = -0.290. Ranking the
aligned observations I~/= (Yi - 131Zli - 132Z2i), we obtain the following:
7 17 3 16 6 1 11 2 29 30
24 15 4 28 25 26 20 22 18 12
27 10 8 14 23 19 21 5 13 9
With (Z,,Z') already given in Example 3, we find, using simply the Wilcoxon
scores that
thus/V/. (3.6) becomes 89.3928. This suggests that the hypothesis be rejected.
That the same result holds for/~/* in (3.9) follows from Sen and Puri (1977).
The normal theory test statistic for H0 in (3.2) can be written as On =
(Q1/ql)- (Qo/(n- q)). When F is not normal, but has a finite variance o-2(F),
R a n k tests in linear models 241
then it can be shown that under Hn in (3.11) and Condition B of Section 3.1, Qn
has asymptotically as n ~ co a noncentral chi-square distribution with ql degrees
of freedom and noncentrality parameter,
A O = (b~f*bl)/O'2(F) (3.14)
where for each j, eq has the same continuous distribution function, F(.) whose
functional form is not necessarily known. Statistical testing problems connected
with (4.1) are of two types: first, testing the parallelism (i.e./3j =/3) and secondly,
testing the coincidence ( a / = a,/3j =/3) of the regression lines.
The first step in constructing a rank test statistic for (4.2) is to align each of the
k samples on /3. But since the common value of/3 is not usually known, the
alignment is on a suitable estimate of/3. Sen (1969) used a rank estimate, but
the least squares estimate
= njlEx,
i y , i
~j=(Y~-/3xq), j = l . . . . . k, i = 1 . . . . . n i, (4.3)
and rank each of the k samples separately. Let Pij denote the rank of ~j in the
ranking of the j-th sample.
For each j - - 1 , . . . , k, let
4.1.2. Tests b a s e d on s i m u l t a n e o u s r a n k i n g
Although Adichie (1974) considered aligned observations (Yij-/3xii), where
/3 is rank estimate of/3, it is has been shown that his method is valid also for
the aligned observations given in (4.3). Let /~i be the rank of ~ in the
simultaneous ranking of all N (=Ej nj) observations. To simplify the notation,
let
c~ ) = yj(x,j - X,), s = 1. . . . . j - 1, j + 1 . . . . . k, (4.7)
= (Tj - 1)(x~ - xs), s = j,
so that
c(/) = N-1 E E c~) = 0, j = 1. . . . . k, (4.8)
s i
Rank tests in linear models 243
and
Z 2= j(a - Z
s i S
Now define
s i
where for the summations, i goes from 1 to nj; j (or s) goes from 1 to k. The
proposed test statistic is
2 = E (T2j[A(~0)G) 2- (4.9)
J
Adichie (1974) gives conditions under which L2 has asymptotically a chi square
distribution under (4.2) with aj = a. For large N, an approximate level a test is
obtained if the hypothesis is rejected for L2 > X~,-I(1- a).
The usual test statistic for (4.2) based on the least squares estimates has the
form
i j
and se2 is the mean square due to error. It is shown that under (4.10), (k - 1)O
has asymptotically a noncentral chi square distribution with (k - 1) degrees of
244 J. N. Adichie
An early attempt at providing a rank test statistic for (4.15) was made by Puri
and Sen (1969) who considered the case of only two regression lines,
where interest is on testing the composite hypothesis given in (4.15) against the
set of alternatives that violate (4.15). Consider the aligned observations ~j =
Yii- fi(xij-j), similar to those given in (4.3). Observe that because rank
(Y~j - a) is the same as rank ~j, it is not necessary to align the observations also on
a suitable estimate & of a. Let/~ii be the rank of ~j in the simultaneous ranking of
all the N (= Ej nj) aligned observations. Write
R a n k tests in linear m o d e l s 245
Aj = (niIN) (4.17)
and let
d,i = O, s# j , (4.18)
=1, s=j,
so that
dO) = N-1 E E d~ ) = At"
s i
Adichie (1975) gave conditions under which M asymptotically has a chi square
distribution with 2(k - 1) degrees of freedom under (4.15). For large N there-
fore an approximate level a test is obtained if the hypothesis (4.15) is rejected
for M > X2(k-1)(1 -- ~ ) .
Yq=aj+fljxq+eq, j = l . . . . . 4, i = 1 . . . . . 14,
and we want to test j~l = f12 = 3 = ]~4- The observations (adapted from the data
on page 395 of Brownlee's Statistical Theory and Methodology in Science and
Engirwering) are shown in Table 2.
Since we do not assume aj = a, we shall use i , of (4.6). With Wilcoxon
scores, T/i = (1/15) Ei fqxq - 7gj. The following quantities are calculated from the
given data: Least squares estimate f i = 0.56; ~1 = 12.5, 2 = 15, 3 = 11.64,
if4 = 16.21; C ] = 1461.5, 858, 1770.15, 5322.24. The ranks rii of the aligned
observations (Yq -/3xq) are shown on Table 3. With these, Tlj -- 26.43, 9.47, 27.72,
26.67 for j = 1 , . . . , 4, g i v i n g / ~ = 13.80, which suggests that the hypothesis of
equality of the regression parameter is rejected.
248 J. N. Adichie
Ho: = 0. (5.2)
The procedure described here is due to Puri and Sen (1969). Under (5.2), Y~,
i = 1. . . . . n, are independent identically distributed p-variate vectors. In
multivariate cases, we rank the marginals of the observations ( 1 , . . . , ,). So
let Rki be the rank of Y,k in the ranking of (Ykl . . . . . Yk,,) k = 1 . . . . . p. Let
be a set of scores generated by functions ~bk(U), 0 < u < 1. T o obtain the test
statistic, define a pq row vector
S~ = (S~I, ,
t
Snp) ~- (Sn.kj; k = 1 . . . . . p; j = 1 . . . . . q) (5.4)
where
Snk = Zngt.k(Rk), k = 1..... p, (5.5)
are q-vectors, and qt,,k(Rk) = (q~,,k(Rkl) . . . . . tP,,k(Rk,,))' with the scores as defined
in (5.3), and Z , is obtained from X, as in (2.3), after reparametrization.
As in the univariate case, the statistics required would be a quadratic form in
the pq elements of S~ where the discriminant of the quadratic form is the
inverse of the asymptotic covariance matrix G say of ,.q,. Unlike the situation in
the univariate case, G, even under (5.2) depends on the marginal distributions
F(k) and F(k,k') defined in (5.7) below. T o obtain the expression for G, first define
with F(k)(y) and F(k,k')(y, Z) being the marginal distribution functions of the
respective k-th and (k, k')th component of Y, and
where (~ denotes the Kronecker product, and Z Z ' = lim n-I(Z,,Z'). Because
we do not know the functional form of F, A ( F ) is not available for use in
constructing the test statistic. Consider a sample measure of A ( F ) given by
where
dt.,kk' = (n - 1)-1 "~ {gC.k(Rki)~b.k,(Rk,i)- $.k$.k'} (5.11)
i
and
~,k : n -1 ~'~ O,k(i), k = 1. . . . . p. (5.12)
i
Let
: A. z z": (5.13)
~n : SnGnlSn = ( L ~ 1. . . . . S..)G.
r A -1 (S.I,...,
t S%)'
P P q q
: E Z Z Z &kj, &k,j,, (5.14)
k k' i j'
where
((~k'))= A:' and ((z~')): ( Z . Z ' . ) - ' . (5.15)
The ~ , test in (5.14) rejects H0 in (5.2) if ~ , is large, where the cut off point is
obtained from the distribution of ~,. It turns out that in the multivariate case,
the exact distribution of the ranks of the observations, even under H0 in (5.2),
depends on F(y). Since the functional form of F is not assumed to be known, it
is not possible to obtain the exact distribution of ~,. However, a conditional
distribution is obtainable (see Puri and Sen (1966) for details), but even at that,
the enumeration of this permutation distribution of the ranks, and hence that
of ~,, can be tedious for a given n.
A way out is to resort to limiting distribution if one is available. It has been
shown in Puri and Sen (1969a) that under conditions A - C given below, ( n - m S , )
has asymptotically a pq variate normal distribution with zero mean and
covariance matrix G when (5.2) holds. Hence ~ in (5.14) has asymptotically,
as n becomes large, a chi-square distribution with pq degrees of freedom when
H0 in (5.2) holds.
It follows that for large n, and under conditions A - C below, an approximate
level a test for H0 in (5.2) may be obtained if H0 is rejected for ~?. > X~q(1 - a).
The conditions required for the limiting distribution of ~ . are as follows:
REMARKS. 1. For generating functions ~bk(U) like the Wilcoxon and the Nor-
mal Scores condition B(i) can be replaced by the milder Noether Condition,
i.e. maxi(z~/Ei z~i)--->O, j = 1 . . . . . q.
2. As remarked in the univariate case, Condition B(ii) may not be satisfied in all
cases where the original matrix X, of regression constants is of rank q.
P q q q
AL = I~'G-11~ = E E E E bkjbk'j', Zff, 7kk'(F) (5.19)
k k' j j'
where
((rkk'(F))) = (O'kk'(F)))-' (5.20)
with
Zkk,(F) = akk,(F)lBk(F)Bk,(F), k = 1. . . . . p .
In the normal theory, there are at least three well known methods that can
be used in .testing (5.2), namely the likelihood ratio test, the maximum
(minimum) characteristic criterion and the trace criteria (see e.g. Chapter 8 of
Anderson, 1958). In discussing the performance of the ~n test, we shall
compare it with the test based on the likelihood ratio or which is the same
thing, on the test based on the least squares estimates (LSE) of /3. The
likelihood ratio test statistic can be written as - 2 log A,, where
and fin is the LSE of/3, while HAll denotes the determinant of A. By expanding
(5.21) and writing the pq elements of fin as a row vector
we can write
-2logAn ( ~-'
nl .... , f -,
l n p ) ( $ n -1 Z n Z n ),( ~ n -,
l ..... -
fltnp)'
P P q q
z,~jj,o'n (5.22)
k k' j j'
where
= 2 1.
P P q q
= 2; ~, 2; 2; bkjbk'j', Zjj'o'kk'(F) (5.23)
k k' j j'
where Chk(A) denotes the k-th largest characteristic root of A, and r(F) is as
given in (5.20).
T h e efficiency in (5.24) including the bounds in (5.25) has been studied in Sen
and Purl (1967).
/3 = (,8o),/3(2)) (5.26)
{ gnl~
Z , = \Z,2/ Z,~ is of order q, n .
is the k-th component of the i-th aligned observation. Define a Pql row vector
by
At
= Tnp) = ( L , kj'~ k = 1, . .
. , p ; j = 1, . .
., ql) (5.31)
where
Tnk = Znl(In - Cn,2) 1/-[nk(/~l), k = 1 ..... p, (5.32)
and ql vectors g%(t~k)= (~nk(I~kl) ..... ~nk(l~kn))' while I~ denotes the identity
matrix of order n and C,~2 is as given in (3.5). As in Section 5.1, the statistics
required would be a quadratic form in the Pql elements of T(1),~ with the~
discriminant being the inverse of the asymptotic covariance matrix G* say Of
the distribution of T0),~. G* is a pq~ x Pql matrix given by
G* : A ( F ) @ C*
where
C * = l i m n -1 C , with C* = Znl(I n -- Cn,2)Z~l ,
G~*
. = A . @ c . *= ((a.kk,C.j,;)),
~ * k, k' = 1 .... . p , j, j' = 1,.. ., ql,
(5.33)
the proposed test statistic for testing H0 in (5.27) can be written as
~q~n = '-~"
l ( 1 ) , . U~ _n * - l ' f1(1),n=
" ( T^,. I , . . . , T~,' p ) G^*-1
, (T,1
^, . . . . . T^'
np) '
p p ql ql
= Z Z E E LkJ,
,
1 n,k,j,t* n t, n (5.34)
k k' ] j'
It can be shown in the manner of Sen and Puri (1977) that under H0 in (5.27)
and subject to the Conditions A - C of Section 5.1 and Condition D of Section
3.1, (n- 1/2T(n,, ) has asymptotically a Pql variate normal distribution with mean
254 J. N. Adichie
zero and covariance matrix G* given in (5.33). It follows then that under H0 in
(5.27) and subject to the Conditions mentioned above, ~7, given in (5.34) has
asymptotically a chi-square distribution with pqa degrees of freedom. Hence for
large values of n, an approximate level ot test of H0 in (5.27) may be obtained if
H0 is rejected for ~?, > X 2 ( 1 - a).
In the case where a 'rank' estimate /3c2), is used to align the observations
(5.30), Sen and Puri (1977) have proposed a rank order statistic Aq* for testing
H0 in (5.27). 5* is a simplified version of ~?, in (5.34). To obtain the test
statistic, define a pq~ row vector by
are ql vectors. The statistic proposed by Sen and Puri can be written as
p p ql ql
: ~ Z ~ ~ S,.kjS,,k'j', ~kk'c*Z' (5.37)
k k' j j'
and subject to the conditions of Section 5.1 and condition D of section 3.1,
n-1/2(Sncl)) is asymptotically distributed as a qPl variate normal with mean/x and
covariance, matrix G*, where
= bkjbk,j,c#,~" (F)k,
k k' j j'
where "ckk' is as given in (5.20), while cjj* is defined in (5.33). As for J?, of (3.4) it
is straightforward to show that, under H , of (5.38) and subject to the conditions
of Section 5.1 and Condition D of Section 3.1, n-1/2(T,(a)) is asymptotically
distributed as a pq~ variate normal with the same mean /~ given in (5.39). It
follows that the noncentrality parameter of ~ , is the same as that given in
(5.40). Observe that if p = 1 (univariate), G *-1 reduces to A-a(qJ)C *-~, while
(5.39) becomes b ' C * B ( F ) so that (5.40) reduces to
(b~C* bl)B2(F)/A2(~b)
given in (3.13).
The usual test statistic for the hypothesis (5.27) is based on the likelihood
ratio criterion, (see e.g. Chapter 8 of Anderson, 1958). The test statistic can be
written as - 2 log A, where
which is the same as (5.24), the efficiency of rank tests relative to the likelihood
256 J. N. Adichie
ratio tests in testing for regression in the multivariate case. If follows that the
remarks made about e.~,x in Section 5.1.1 including the bounds in (5.25) remain
valid for (5.44).
References
Puri, M. L. and Sen, P. K. (1973). A note on asymptotic distribution-free tests for subhypotheses in
multiple linear regression. Ann. Statist. 1, 553--556.
Sen, P. K. (1969). On a class of rank order tests for parallelism of several regression lines. Ann.
Math. Statist. 40, 1668-1683.
Sen, P. K. (1972). On a class of aligned rank order tests for the identity of intercepts of several
regression lines. Ann. Math. Statist. 43, 2004-2012.
Sen, P. K. and Puff, M. L. (1967). On the theory of rank order tests for location in the multivariate
one sample problem. Ann. Math. Statist. 38, 1216-1228.
Sen, P. K. and Puff, M. L. (1977). Asymptotically Distribution-Free aligned rank order tests for
composite hypotheses for general multivariate linear models. Zeit. Wahr. verw. Geb. 39, 175-186.
Sffvastava, M. S. (1970). On a class of nonparametffc tests for regression parameters. Statist. Res.
4, 117-132.
Steel, Robert G.D. and Torffe, J. H. (1960). Principles and Procedures of Statistics. McGraw-Hill,
New York.
van der Waerden, B. L. (1952, 1953). Order Tests for the Two-sample Problem and Their Power.
Indag. Math. 14, 453-458, 15, 303-316.
Vorlickova, Dana (1972). Asymptotic Properties of Rank Tests of Symmetry under Discrete
Distributions. Ann. Math. Statist. 43, 2013-2018.
Wilcoxon, Frank (1945). Individual Comparisons by Ranking Methods. Biometrics 1, 80-83.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1 ~')
Elsevier Science Publishers (1984) 259-274 _lk
1. Introduction
259
260 James C. Aubuchon and Thomas P. Hettmansperger
y = l a + X f l + e = l a * + X~fl + e, (])
second equation, we have the centered design matrix Xc = X - 1 2 ' where .~' is
a 1 x p vector of column means from X and a * = a + 2'/3. The details of an
example, with data, are given in Section 4.
Working assumptions will be listed as they are needed in the discussion. The
reader should consult the primary references for the regularity conditions
needed for the asymptotic theory.
H0: /~2= 0,
where d-z is the usual unbiased estimate of the error variance ~rz, assumed to be
finite.
The general F statistic for/40:f12 = 0 can be derived from (3) by removing
the effects of the nuisance parameters/31 from both y and X2~ before applying
(3). H e n c e in (3) replace y by y - Xlcfll, where fll = (XI~Xlc)
, , the reduced
-1XI~y,
model least squares estimate, and replace Xc by Z = X2~ - X l c ( X l c, ~ i c ) -1XlcX2c.
,
Now p is replaced by q, the dimension of f12 and (3) becomes the usual F
statistic, written in unusual form,
(y - x l c t h ) ' z ( z ' z ) - l z (y - x l d J 3
F p6 -2 (4)
These tests are easiest to implement when we use the Wilcoxon score
function; other score functions are discussed in the references. Let
and define a(i) = 4)(i/(n + 1)). Then a(1) ~<.-. <~ a(n) are called the Wilcoxon
scores. Note that fd 49(u) du = O, fd th2(u) du = 1, and El' a(i) = O.
We will let a(R(fl)) denote the n x 1 vector whose i-th componen t is
a ( R ( y i - x } c f l ) ) , where R ( y i - x ~ / 1 ) is the rank of yi-x~cfl among the n
uncentered residuals.
If the error distribution has finite variance and if the maximum leverage
tends to 0, then the results of H u b e r (1981, p. 157) show that the least-squares
estimate satisfies A3. For testing H0:132 = 0 we first align the observations and
construct a(R(fll)), the vector of rank-scored reduced-model residuals. The
aligned rank test statistic is constructed from the numerator of (4) by replacing
the residuals with a(R(fll)):
C3. Compute the F statistic on the values in C2; then A is the numerator
sum of squares.
Note that for each hypothesis of the form H0:132 = 0 which is to be tested a
new set of reduced model residuals must be computed. Minitab and SAS
programs are provided in Section 3.
Summary: We consider the three questions raised in the Introduction. (1)
There are no published studies of the small sample properties of A. We do not
know if the test, which is not nonparametric for finite sample size, maintains its
level near the nominal level. We do not know how large the sample size should
be before it is reasonable to use A. There are many possibilities for choice of
On the use of rank tests and estimates in the linear model 263
This test is constructed from a quadratic form in the full model rank estimate
of fl; see Rao (1973) for a general discussion of the Wald test statistic. The
computation of the rank estimate requires specific programs that are not
generally available in statistical packages. In Section 3 we discuss the com-
putational problems that must be overcome to be able to implement the Wald
test. By 1984 there will be a rank-regression command in the Minitab statistical
computing system which will provide all of the necessary computations. Until
that time special programs are required to implement the test.
We begin with Jaeckel's (1972) measure of dispersion of the residuals. Using
(6), define
the rank test statistic corresponding to the j-th component of ft. Jaeckel points
out that - S ( f l ) is essentially the gradient of D(~), so setting (9) to zero yields a
set of nonlinear normal equations derived from D(~).
A rank estimate /~ minimizes D(fl) or solves S ( f l ) - 0 . Jureckova (1971)
suggests the equivalent method of minimizing E ISi(fl)l.
w = ?2 (11)
1 1
~/ -- n312hn [- n ( n - 1)hn ~ ~" w , (12)
icj
! ^
The estimate fig) found by iterating (15) has the same asymptotic distribution
for any k = 1, 2 , . . . , as/~, the rank estimate that minimizes D(fl) in (8). Hence
ffk) could be used in W to construct a test. Generally, ffk) does not converge
to /~ as k increases. It is probably best to take around 4 or 5 steps and then
construct W.
Summary: (1) There are no published studies that show how the level of W
behaves for small samples. There is no indication of how large the sample size
should be before the asymptotic distribution provides a good approximation.
(2) Computation of W requires special programs and cannot be carried out using
existing statistical packages. In 1984 the Minitab statistical computing system
will contain a command that will produce in its output the rank estimate/3 and
the test statistic W. There is further discussion of computation in Section 3. (3)
With the exception of the small unpublished simulation by Hettmansperger and
McKean (1981) no study of the small sample power of W is available. In the
simulation just mentioned the estimate (14) was used after some small sample
adjustments. For example z* is multiplied by the bias correction ( n / ( n - p))1/2.
It was found in the parallelism designed that W was often liberal and needed
further correction to reduce the probability of a type I error. Its small sample
power was comparable to the power of the F, A and D* (in the next
subsection) tests.
2iii. Test based on reduction due to fitting the full and reduced models
This method in the rank case is analogous to the F statistic which can be
written as the reduction in sum of squares due to fitting the full and reduced
models. The aligned rank test and the Wald test are not directly based on the
comparison of a reduced and full model. The aligned test is constructed from
reduced-model residuals and the Wald test is a quadratic form in the full-model
estimates. It might seem most natural to combine estimation with fitting in a
robust fashion in order to have a set of strategies parallel to least squares. Then
data analytic methods such as plotting have direct counterparts based on ranks.
266 James C. Aubuchon and Thomas P. Hettmansperger
D* = D ( ~ I ) - D(~) (16)
/2
where/~1 and ~ are the reduced and full model rank estimates, respectively,
and ~ is a consistent estimate of r. Under regularity conditions, McKean and
Hettmansperger (1976) show that D* has a limiting chi-square distribution with
q degrees of freedom. Hence, the test based on (16) rejects H0: f12= 0 if
D * >1x2(q). Computation of D* requires special programs for ill, fl and ~. The
forthcoming Minitab rank-regression command will incorporate this test as part
of its output. See Section 3 for further aspects of the computational problems.
The test is illustrated on data in Section 4.
S u m m a r y : (1) Hettmansperger and McKean (1977) provide a small simula-
tion which indicates that D* along with z*, tuned for small samples, has a
significance level close to the nominal level. McKean and Hettmansperger
(1978) provide simulation results for the k-step estimate, D* and ~-*. Again,
the test seems to have a stable level. There are no simulation studies of D*
with ~. There is no indication of how large the sample size should be before the
asymptotic distribution of D* with provides a good approximation. (2)
Because of the computational problems involved in computing/~ and ? or z*
(see Section 3) special programs are required to compute D*. In 1984 the
Minitab statistical computing system will produce /~, D* and or z* in the
output of a rank-regression command. (3) In an unpublished simulation study
by Hettmansperger and McKean (1981) the test based on D* with z* had
power comparable to the F test and the test based on W.
Finally, it should be emphasized that the use of z* requires the assumption
of symmetry of the error distribution. The estimate ~ does not require
symmetry. It is not yet known how well will work in the asymmetric case and
it is not known if ~ will be a viable substitute for z* in the symmetric case.
Consistency of "~ has only been established for the Wilcoxon scores; see
Aubuchon (1982).
3. Computations
Recall that - S ( f l ) is the gradient of the dispersion, (9). Two considerations led
us to set C = (X'~X~)-1 in (17). First of all, since the asymptotic variance-
covariance structure of/~ is given by a constant times (X'cX~) -~, a natural norm
for fl is II/~11 = q J ' x ' = x J 3 ) 1/2. Results of Ortega and Rheinboldt (1970) show that
the direction of steepest descent with respect to this norm is precisely
(X~)-lS(ffr)). On the other hand, Jaeckel (1972) shows that the dispersion
function may be approximated asymptotically by a quadratic:
where fl0 is the vector of true regression parameters. The minimum of this
quadratic is attained for
Thus, if we substitute fir), our current estimate, for fl0 in (19) we are again led
to take a step in the direction (X~Xc)-IS(ffK)).
It remains to choose the step size, t ~K). We might search for the minimum of
D ( f f r+l)) as a function of t ~K)using any good linear search m e t h o d - the golden
section search or one of the other methods described in Kennedy and Gentle
(1980), for example. McKean suggests that this search might be conducted by
making use of the asymptotic linearity of the derivative of D [ f f r ) +
t(X'~X~)-IS(ffK))] with respect to t, given below in (20). (Compare Hett-
mansperger and McKean (1975).) Specifically, he suggests application of the
Illinois version of false position, as discussed by Dowell and Jarratt (1971), to
find the approximate root of this derivative:
which is easy to compute and which we would most likely desire for com-
parative purposes in any case. Another choice would be some more resistant
estimate, such as the LI estimate, which is, however, more expensive to
compute. It is not clear what the trade-offs in computational efficiency would
be in making such choices. For a convergence criterion it may be best to focus
on relative change in the dispersion, since the value of fl which minimizes the
dispersion is not generally unique. Criteria which check whether the gradient is
(approximately) zero will not be useful, since the gradient is a step function and
may step across zero.
If, in (17), we let C = (X~Xc)-1 as suggested and set t ~m = ~(x), an estimate of
~- computed on the residuals at the K-th step, we essentially have an iterative
scheme based on the K-step estimates discussed in Section 2ii. While such
estimates may be of interest in their own right, early experience of McKean
and others indicates that, taken as an algorithm for minimizing the dispersion,
this scheme can behave rather poorly for some data sets, failing to converge to,
and in fact moving away from, a minimizing point.
We should also mention that Osborne (1981) and others have developed
algorithms for minimizing the dispersion using methods of convex analysis.
Although iterative methods are not needed to compute the window estimate
of 7, a naive approach will not be very efficient. Schweder (1975) suggests an
interesting scheme for computing El Y,jI{]ri- rA < h,/2} but does not give
details. A time- and space-efficient algorithm based on Schweder's suggestion
may be found in Aubuchon (1982).
With the assumption that the error distribution is symmetric, McKean and
Hettmansperger (1976) show that a consistent estimate of ~"may be obtained by
applying a one-sample rank procedure to the uncentered residuals, ri =
Yi- J', using the one-sample score function corresponding to ~b: ~b+(u)--
~b((u + 1)/2). If (&L, &V) is the 100(1- a) % confidence interval obtained for the
center of symmetry in this fashion, then = X/n(&v - d~L)/(2Z~/2) is a consistent
estimate of r. This approach is an extension of the work of Sen (1966) to the
linear model.
If Wilcoxon scores are used, there are at least three approaches to obtaining
C~L and &u. If storage space and efficiency are not of critical importance, the
n(n + 1)/2 pairwise (Walsh) averages may be computed. Then &L and &v are
the (c + 1)st and (n(n + 1)/2- c)th order statistics from this set, where c is the
lower critical point of a two-sided, size-a Wilcoxon signed-rank test. This
critical point may be obtained from tables or from a normal approximation.
Any fast algorithm for selecting order statistics might then be used to find ~L
and &u; see, for example, Knuth (1973). An approach which is faster and which
requires much less storage is based on Johnson and Mizoguchi (1978), with
improvements discussed by Johnson and Ryan (1978). These papers actually
present the algorithm for the two-sample problem; but simple modifications
make it applicable to the present case as well. One advantage of this method is
that it still selects exact order statistics from the set of Walsh averages, without
computing and storing all of them. A third method, relying on the asymptotic
On the use of rank tests and estimates in the linear model 269
linearity of signed-rank statistics, does not guarantee exact results but is quite
fast and space-efficient. The Illinois version of false position is used to find
approximate solutions to the equations (21) defining c~L and &v in terms of a
signed-rank statistic:
4. Example
aCensored observations.
270 James C. Aubuchon and Thomas P. Hettmansperger
delay entry into a chamber. The rats were divided into three groups of ten, a
control group and two experimental groups. The experimental groups each
received some antidote to the treatment, while the control group received
none. The time taken by each rat to enter the chamber was recorded before the
treatment and again after the treatment and a n t i d o t e - i f any.
We consider the measurement before treatment as a covariate and test for
interaction between the grouping factor and the covariate; i.e., we test for
unequal slopes. The observations are strongly skewed; we applied the natural
log transformation to gain some degree of symmetry so that the estimate T* in
(14) may be applied to the data.
Computations for the aligned rank test, using least squares to fit the reduced
model, can be carried out in the SAS statistical computing system (see Helwig
and Council, 1979) using the following program:
DATA;
INPUT BEFORE AFTER ANTIDOTE;
L O G _ A F T - - L O G (AFTER);
CARDS;
{data goes here}
P R O C GLM;
CLASS A N T I D O T E ;
MODEL LOG_AFT = ANTIDOTE BEFORE;
O U T P U T O U T = R E S I D R E S I D = RESID;
PROC SORT D A T A = RESID;
BY R E S I D ;
DATA RSCORE;
SET RESID;
R S C O R E = SORT(12) * (_N_/31 - .5);
PROC GLM DATA = RSCORE;
CLASS A N T I D O T E ;
MODEL RSCORE = ANTIDOTE BEFORE ANTIDOTE*
BEFORE;
r
The desired test statistic will be the Type 4 sum of squares for
A N T I D O T E * B E F O R E in the second G L M output.
The same calculation can be made in the Minitab statistical computing
system (see Ryan, Joiner and Ryan, 1981). Some manipulation is necessary to
create the design matrix so that the R E G R E S S command can be used. Indicator
variables for the first two groups are put in ' A I ' and 'A2'; then two interaction
columns, ' I N T E R I ' and 'INTER2', are produced by multiplying each of these
by the covariate.
On the use of rank tests and estimates in the linear model 271
Table 2
Fitting full and reduced models a
Full Reduced
Table 3
Tests for equal slopes
Test statistic
Estimate of r W D*
1.12 1.38
r* 1.15 1.40
s q u a r e s F statistic. A l l of t h e s e statistics m a y b e c o m p a r e d t o t h e u p p e r
a - p o i n t of t h e c h i - s q u a r e d i s t r i b u t i o n w i t h t w o d e g r e e s of f r e e d o m . In p r a c t i c e ,
w e w o u l d m o s t l i k e l y a p p l y s o m e s m a l l - s a m p l e t u n i n g t o ~-* o r ~:. F u r t h e r , w e
w o u l d d i v i d e all o f t h e t e s t statistics e x c e p t , p e r h a p s , A d i c h i e ' s by t h e n u m e r a -
t o r d e g r e e s of f r e e d o m q (q = 2 f o r this e x a m p l e ) a n d c o m p a r e t h e r e s u l t s t o
t h e u p p e r a - p o i n t of t h e F d i s t r i b u t i o n w i t h q a n d n - p - 1 d e g r e e s of
freedom.
References
Adichie, J. N. (1978). Rank tests of sub-hypotheses in the general linear regression. The Annals of
Statistics 5, 1012-1026.
Aubuchon, J. C. (1982). Rank Tests in the Linear Model: Asymmetric Errors. Unpublished Ph.D.
Thesis. The Pennsylvania State University.
Bauer, D. F. (1972). Constructing confidence sets using rank statistics. Journal of the American
Statistical Association 38, 894-904.
Doweil, M. and Jarratt, P. (1971). A modified Regula Falsi method for computing the root of an
equation. B I T 11, 168-174.
Draper, N. R. and Smith, H. (1981). Applied Regression Analysis. Wiley, New York, 2nd ed.
On the use of rank tests and estimates in the linear model 273
Graybill, F. A. (1976). Theory and Applications of the Linear Model. Duxbury Press, North
Scituate, MA.
Helwig, J. T. and Council, K. A. (1979). S A S User's Guide. SAS Institute, 1979 edition.
Hettmansperger, T. P. and McKean, J. W. (1977). A robust alternative based on ranks to least
squares in analyzing linear methods. Technometrics 19, 275-284.
Hettmansperger, T. P. and McKean, J. W. (1981). A geometric interpretation of inferences based
on ranks in the linear model. Technical Report 36, Department of Statistics, The Pennsylvania
State University.
Hodges, J. L., Jr. and Lehmann, E. L. (1962). Rank methods for combination of independent
experiments in analysis of variance. The Annals of Mathematical Statistics 33, 482-497.
Hodges, J. L., Jr. and Lehmann, E. L. (1963). Estimation of location based on rank tests. The
Annals of Mathematical Statistics. 34, 598-611.
Huber, P. S. (1981). Robust Statistics. Wiley, New York.
Jaeckel, L. A. (1972). Estimating regression coefficients by minimizing the dispersion of the
residuals. The Annals of Mathematical Statistics 43, 1449-1458.
Johnson, D. B. and Mizoguchi, T. (1978). Selecting the Kth element in X + Y and XI + X2 + - +
X,,. S I A M Journal of Computing 7, 147-153.
Johnson, D. B. and Ryan, T. A., Jr. (1978). Fast computation of the Hodges Lehmann estimator-
theory and practice, in: Proceedings of the American Statistical Association Statistical Computing
Section.
Jureckova, J. (1971a). Nonparametric estimate of regression coefficients. The Annals of Mathema-
tical Statistics 42, 1328-1338.
Jureckova, J. (1971b). Asymptotic independence of rank test statistic for testing symmetry on
regression. Sankhya Ser. A 33, 1-18.
Kennedy, W. J., Jr. and Gentle, J. E. (1980). Statistical Computing. Marcel Dekker, Inc., New
York.
Knuth, D. E. (1973). The Art of Computer Programming, Sorting and Searching, Vol. 2. Addison-
Wesley, Reading, MA.
Koul, H. L. (1970). A class of ADF tests for subhypothesis in the multiple linear regression. The
A n n a l s of Mathematical Statistics 41, 1273-1281.
Kraft, C. H. and van Eeden, C. (1972). Linearized rank estimates and signed-rank estimates for the
general linear model. The Annals of Mathematical Statistics 43, 42-57.
Lehmann, E. L. (1963). Nonparametric confidence intervals for a shift parameter. The Annals of
Mathematical Statistics 34, 1507-1512.
McKean, J. W. and Hettmansperger, T. P. (1976). Tests of hypotheses based on ranks in the
general linear model. Communications in Statistics - Theory and Methods A5 (8), 693-709.
McKean, J. W. and Hettmansperger, T. P. (1978). A robust analysis of the general linear model
based on one step R-estimates. Biometrika 65, 571-579.
McKean, J. W. and Ryan, T. A., Jr. (1977). An algorithm for obtaining confidence intervals and
point estimates based on ranks in the two-sample location problem. Association for Computing
Machinery Transactions on Mathematical Software 3, 183-185.
McKean, J. W. and Schrader, R. M. (1980). The geometry of robust procedures in linear models.
Journal of the Royal Statistical Society Set. B 42, 366-371.
Neter, J. and Wasserman, W. (1974). Applied Linear Statistical Models. Richard D. Irwin, Inc.,
Homewood, IL.
Ortega, J. M. and Rheinboldt, W. C. (1970). Iterative Solution of Nonlinear Equations in Several
Variables. Academic Press, New York.
Osborne, N. R. (1981). A Finite Algorithm for the Rank Regression Problem. Unpublished
manuscript, Australian National University.
Policello, G. E., II and Hettmansperger, T. P. (1976). Adaptive robust procedures for the one
sample location problem. Journal Of the American Statistical Association 71, 624--633.
Puri, M. L. and Sen, P. K. (1973). A note on asymptotically distribution free tests for sub-
hypotheses in multiple linear regression. Annals of Statistics 1, 553-556.
Rao, C. R. (1973). Linear Statistical Inference and its Applications. Wiley, New York, 2nd ed.
274 James C. Aubuchon and Thomas P. Hettmansperger
Ryan, T. A., Jr., Joiner, B. L. and Ryan, B. F. (1981). Minitab Reference Manual. Minitab Project,
Pennsylvania State University, University Park, PA.
Schrader, R. M. and Hettmansperger, T. P. (1980). Robust analysis of variance based upon a
likelihood ratio criterion. Biometrika 67, 93-101.
Schuster, E. (1974). On the rate of convergence of an estimate of a functional of a probability
density. Scandinavian Acturial Journal 1, 103-107.
Schweder, T. (1975). Window estimation of the asymptotic variance of rank estimators of location.
Scandinavian Journal of Statistics 2, 113-126.
Sen, P. K. (1966). On a distribution-free method of estimating asymptotic efficiency of a class of
nonparametric tests. The Annals of Mathematical Statistics 37, 1759-1770.
Sen, P. K. (1980). On M tests in linear models. Biometrika 69, 245-248.
Sen, P. K. and Puff, M. L. (1977). Asymptotically distribution-free aligned rank order tests for
composite hypotheses for general multivariate linear models. Zeitschrift fuer Wahrscheinlich-
keitstheorie und Verwandte Gebiete 39, 175-186.
Shirley, E. A. (1981). A distribution-free method for analysis of covaffance based on ranked data.
Applied Statistics 30, 158-162.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1 "~
It , J
(~) Elsevier Science Publishers (1984)275-297
A . K. M d . E h s a n e s Saleh a n d P r a n a b K u m a r S e n
I. Introduction
275
276 A. K. Md. Ehsanes Saleh and Pranab Kumar Sen
Again, ~12 may be the unbiased and optimal (in some sense) for testing H(o12),
while El. may be so for testing H(o1"), ~:* may not be unbiased or optimal. On the
top of that if al2, av and a 2 be the size of the test ~12, ~1- and ~:2, then the size of
~:* need not be equal to (1 - a2)O/12d- a2av; indeed, it may depend on the joint
distribution of the test statistics in a more involved way and also on whether 02
is equal to 0 or not. Nevertheless, * is likely to be more robust than 12 against
02 away from 0 and more efficient than s~l. when 02 is close to 0. The effect of
preliminary test on the size, power and robustness of the ultimate test is the
main item of study.
In the subsequent sections, we specialize this model to various specific
models (where nonparametric estimators and tests work out well) and for-
mulate NPTI procedures, and present various available results on their per-
formance characteristics. Some general discussions are made in the concluding
section.
The two-sample location model is a special case where the ci may only be 0 or
1. We are primarily concerned with inference on 0, when it is suspected but not
evident that /3 = 0 (or some other specified value). As has been discussed in
(1.1)-(1.2), we incorporate a preliminary test on the hypothesis H~02):/3 = 0
(against /3 0 or /3 >0): If H~02) is tenable, (2.1) reduces to the classical
one-sample model, so that the usual signed-rank statistics (on I11. . . . . Y,) may
be employed to draw inference on 0, while if H~02)is not tenable, based on some
rank estimator (/3,) of/3, we define the residuals Y~ = Y~ -/3,ci, 1 ~< i ~< n and
use (aligned) signed-rank statistics on these residuals for drawing inference on
0. Our primary concern is to study the effect of the preliminary test on/3 on the
performance characteristics of the NPTI procedures for 0. For this, we intro-
duce, first, the preliminary notions and basic regularity conditions:
(i) F E ~, the class of all absolutely continuous, symmetric (about 0) d.f.
with (almost everywhere) absolutely continuous probability density function
(pdf) having finite Fisher information
? , = n - x ~n
ci and Q,= i (ci-6,) 2 . (2.4)
i= 1 i=1
(ii) T h e r e exists Q* (0 < O * < ~) and 6 (]cl < ~), such that
for i = 1 , . . . , n, w h e r e O < U , I < " " < U., < 1 are the o r d e r e d r.v.'s of a
s a m p l e of size n f r o m the uniform (0, 1) d.f. N o t e that by definition, d, =
n -~ ET--1 a , ( i ) = f01 q~(u) du = O, Vn i> 1. Let then
A 2 = (n - 1) -1
i=1
[a,(i) - K,]E, A2= f01
~2(u) d u , (2.8)
A ." 2 - n - 1
i=l
(a*(i)) 2 and A ~.2- _ f01
(~0*(u)) 2 d u . (2.9)
Finally, let Y, = ( Y 1 , . . . , Y.)', 1, = (1 . . . . . 1)' and for every real (a, b), let
Y , ( a , b) = I1, - a l , - bc,. Let R,a(a, b) ( = R , / ( b ) ) (or R+i(a, b)) be the rank of
Yi - a - bci (or IYi - a - b c i ] ) a m o n g Y1 - a - bcl . . . . . Y , - a - be, (or I Y2 -
a-bcd ..... ]Y,-a-bc,[), for i = l , . . . , n ; note that R , i ( a , b ) does not
d e p e n d on a. C o n s i d e r then the statistics
Theorems 4.5.1 and 5.5.1 of Sen, 1981). Also, if, in (2.1), 0 = 0 =/3, then both
S,(0, 0) and L,(0) are distributed symmetrically about 0. As in Adichie (1967),
we consider the following estimators of 0 and/3:
where o~2 (0 < O~2 < 1) is the level of significance and I.,.2 can be obtained (for
small n) by enumeration of the exact null-distribution of L. (over the n!
equally likely permutations of the ranks); for large n, I.,.2-> r~2, where r~ is the
upper 100e % point of the standard normal distribution (G), i.e., G ( r . ) = 1 - e,
0 < e < 1. For the two-sided case, in (2.15), we replace L. by ]L.] and 1.,~2 by
In*,a2, where I,~,~2 'T(1/2)a 2 as n -> oo.
The PTE O* of O, as in (1.1), is then defined by
It is clear from (2.15) and (2.17) that the size and the power of the PTT will not
only be dependent of al, a2, a3 but also on whether /3 = 0 or not. The test
function ~(S,) may not perform that well when /3 = 0, while ~:(S,) looses
efficiency when /3 = 0. Thus, the robustness of ~* against fluctuations of /3
around 0 remains to be explored.
We intend to throw light on the performance characteristics of 0* and ~:*,
mostly, in the asymptotic theory framework, developed by the current authors
(1978, 1982a). Note that ~:(L,) in (2.15) is a consistent test-function, so that for
a fixed /3, different from 0, (positive, for the one-sided test), ~(L,) will be
asymptotically equal to 1, with probability 1, so that 0* will be asymptotically
equivalent to 0, and ~:* to ~:(5~.). For 0, and ~(S,), detailed asymptotic theory
have been developed (see Adichie, Chapter 11), and hence, the same would be
applicable to 0* and ~*, respectively. The picture is, however, different when
/3 = 0 (or is very close to 0), so that ~(L,) does not converge to 0 or 1, in
probability, and 0* (or s~*,) does not share the common properties of either 0,
or 0, (or ~(S,) or ~:(S,)). For this reason and the basic fact that the PTI is
appropriate only when /3 is suspected to be close to 0, we conceive of the
following sequence {K,} of local alternatives
and study the asymptotic properties of {0"} along with that of {t~.} and {0.}
when {K.} holds. Similarly, for the PTT problem, we conceive of {K*} where
(),0, A fixed) and study the asymptotic properties of {sc*} along with those of
{~(S.)} and {s(S.)}, when {K*} holds. For this purpose, we define
1
A~ =
f0 qtZ(u)du, qJ(u) = -f'(F-l(u))]f(F-l(u)), 0<u<l,
(2.20)
')/(lit , ~0) = f01 ~0(U)II/(Ig) du. (2.21)
Note that A~ = l ( f ) < oo. Then, as in Saleh and Sen (1978), we have
At (1+e /0 *
\ (2.22)
yz(g0, g,) \ -e/O* 110* ]}
and when/3 = 0,
/
~ ( n 1/2(0 n -- 0)) -") "]~fll(0, A__bLA
~,2(~, q,)j. (2.23)
\
Nonparametric preliminary test inference 281
X2 = A~2/1/Y2(q~'
qJ)0 0*) " (2.27)
and G(x) is the standard normal d.f. For the two-sided preliminary test, the
corresponding expression is given by
where g(y) stands for the standard normal pdf. Similarly, the asymptotic mean
square errors, in the two cases, are respectively,
+ Av )I
+/-t22[(T(l/2)a 2 -- h v2)g(T(1/2)a 2 -- h/"2) "~ (T(1/2)a 2 + llkP2)g(T(1/2)a2)] }
(2.35)
(2.38)
g=O~ t o.T = o . ~ = o r . ore A / y ( G qJ),
so that all the three estimators behave similarly and the preliminary test does
not entail any difference in their (asymptotic) properties. Secondly, if ~ # 0, but
A = 0 (i.e., H0: fl = 0 holds), then/z~ =/z. =/zc = 0, but/z]' = -gu~lg(T~2 ) and is
< or > 0 according as ~ is >0 or not. Thus, for the null hypothesis case, the
two-sided PT has an advantage over the one-sided test so far as the asymptotic
bias is concerned. As regards their asymptotic mean squares errors, we have
the following result due to Saleh and Sen (1978): Under H0: fl = 0 and for
~#0,
0 < orc < o-~ < or~ < or. < oo, (2.39)
so that 0* performs better than 0,, but, 0, is better than 0"; one-sided P T E is
better than the two-sided case. The picture is different when A 0. For ~ > 0,
/z is negative at A = 0, is /~ in A E ( - ~ , A0) where A0>0, at A = A0, #T is
positive but <g'A0 and for A > A0, it is N in 3. and goes to 0 as A ~ ~. /x ~ is a
symmetric and nonnegative function of A which vanishes at A = 0 and as
A--> _+~, and on [0, ~) (or ( - ~ , 0]) it is bell shaped. Thus, the bias Ag for the
constrained estimator dominates the scene. A similar case holds when ~ < 0
(see Saleh and Sen, 1978). With respect to the asymptotic mean square errors,
there exists an interval J containing 0 as an inner point, such that O-c< orT <
o'~ < or. for all A E J, while outside this interval, 0* performs better than the
others. The picture, of course, depends on a2, and some discussion on the
appropriate choice of a2 will be made later on.
We proceed on to study the asymptotic properties of the PT-F so* in (2.17)
Nonparametricpreliminary test inference 283
under {K*~} in (2.19). First, we consider the case when ~(L,), ~(&) and ~(S.)
are all one-sided test functions. Then, by (2.15), and the discussion following
(2.16)-(2.17), we conclude that a* = size of the ~ = E{~* ]H0:0 = 0} is given
by
+ P{-~,~>~l,~2, A, n 1/210 ~- 0}
~--~n ]e2~ (2.4o)
which, in general, depends on 5~, a2, 53 as well as on/3. We would like to study
(2.40) when {K,} in (2.18) holds, and similarly, replacing {0 = 0} by {0 = n-V2h0}
in (2.40), the corresponding probability will give us the power of the PTT. We
denote the asymptotic size and power of the PTT by 5*(A) and zr*()t0, h),
respectively. Also, for every real (a, b) and p: - 1 < p < 1, let h(a, b; p) be the
quadrant probability P{zt >~a, z2 >1b}, where
z:{zi,
Note that 7r.(0, )t) = 1 - G ( % ) = 53, "q)t, while ~*(0, )t) and ~-c(0, A) may both
depend on )t. In particular, if we let a~ = 53 = a (0 < 5 < 1), then for g = 0,
so that all the three tests have the asymptotic s i z e a and are power equivalent
under {K*}, whatever be a2 (U(0, 1)). The picture is different when ~ ~ 0. First,
~-c(0, h) is > or <ax according as hg is > or <0, while ~-.(0, h) = a a (---%), VA.
Secondly, for A6X0, ~rc(0, h)Xzr*(0, A), where ~r*(0, h) is closer to a.
Thus, as regards the asymptotic size of the tests is concerned, ~* is more robust
284 A. K. Md. Ehsanes Saleh and Pranab Kumar Sen
than ~:(S.), though sc(S.) remains a better choice. On the other hand, for A0 ~ O,
the asymptotic power of the constrained test ~:(S.) is greater than that of the
unconstrained test ~:(S.) whenever
where for g ~ 0, the right hand side is smaller than 3.o whenever A0 > 0, so that
unless gA is largely negative, (2.44) holds. A similar case holds for the
two-sided tests. In any event, the asymptotic power of the PTI" lies in between
that of the constrained and the unconstrained tests, where the latter does not
depend on A while the former is grossly affected by A. The PTI" is not so
sensitive to extreme fluctuations of Z. This illustrates the efficiency-robustness
of the PT-F. The choice of a2 (for the PT) in the context of PTI will be
discussed later on.
f f
= ~ ( f ) = J " " J [(d/dx)f(x)][(d/dx)f(x)]' d F ( x ) , (3.1)
EP
where f ( x ) = (d/dx)F(x).
(iii) For each coordinate j (= 1 , . . . , p), we define the score functions ~pj =
{~oj(u), 0 < u < 1} and ~o~ = {q~~(u), 0 < u < 1} as in before (2.7) and the scores
a,~(i), a'j(2), 1 <~ i ~< n as in (2.7). The ~oi for different j need not be the same.
(iv) For each j (= 1. . . . . p), considering the Y0, 1 ~ < i ~< n, the ranks R,ii(b)
and R+ij(a, b), 1 ~ i <~ n are defined as in after (2.9). Also we let R,ii(O) = R,q,
l~<i~<n, for j = l , . . . , p , and denote the p n matrix ((R,ii)) by Rn, the
rank-collection matrix.
Nonparametric preliminary test inference 285
(v) Finally, for each j (=1 . . . . . p), replacing the Y / b y Yi/, a, (or a*) by a ,
(or a*/) and the Rni(b) (or R+,i(a, B)) by R,i/(b) or R+.i/(a, b)), as in (2.10)-(2.11),
we define S,j(a,b) and L,j(b). Let then L,j(O)=L,/, l<_j<~p and L , =
(Lnl . . . . , L,p)'.
To formulate the PT for H0:/~ = 0 against H~: j0 ~ 0, we define M, = ((m~)))
by letting for every 1 <~j, l <- p,
n
~ , = nO-,~{L'M-~L.}, (3.3)
{~ if ~ , >/~,~,, (3.4)
~:(~") = otherwise ',-
where Og2 (0 < O~2 < 1) is the level of significance of the PT. For small n, the
conditional (null) distribution of ~ , can be derived from the n! equally likely
column permutations of R,, so that ~,~2 can be obtained by direct enumera-
tion, while for large n,
and X~ has the central chi square d.f. with p degrees of freedom (D.F.).
For each j ( = 1 , . . . ,p),Awe proceed as in (2.12)-(2.14) and denote the
derived rank estimates by.0j,, fli, and /~j,, respectively. Let /J, = (01,, . . . , 0p,),^'
= . . , fit,.), O~ ( O i , , . . . , Op,)'. Then parallel to (2.16), the PTE 0* of 0
is defined by
62 ( 1 - Hp+2OG
T { I + O* A,.,)+F2T-1AA(2'H.p+2(X2~;A,)2 , -- Hp+4(Xp,} ct,.2A ) ) .
(3.18)
If we denote the dispersion matrices in (3.15), (3.16) and (3.18) by F*()t), F2()t)
and F*Ot), respectively, then we have for )t = 0, when ~ # 0, FifO)-FI(0),
F i f 0 ) - F * ( 0 ) and F * ( 0 ) - F I ( 0 ) are positive definite, when T is so. From this
point of view, employing the 'generalized variance' as the inverse of the
efficiency of the estimator, we conclude that under H0: )t = 0, 0*. performs
better than 0,, while 0, performs better than 0*. The picture is thus opposite to
that in the case of asymptotic bias. For )t 0, iV'l()t ) = T + {7"2)t)t ', where )t)t' is
the rank of 1, so that Trace(T-~)t)t ') = largest root of (T-a)t)t ') = A = )t'T 1)t >
0. Hence 0n performs (asymptotically) better or worse than 0, according as
) t ' T - 1 ) t is > o r < [ ( 1 + ~ . 2 / Q . ) p _ 1 ] / ( ~ . 2 / Q . ) , s o that for )t away from 0, /in may
not perform at all well. On the other hand, for )t in the neighbourhood of the
origin, 0* performs better than 0, and it performs even better than /}, as )t
moves away from the origin. Thus, for )t away from 0, 0* is preferred to both
0, and /},, while for )t close to 0, it is a good compromise. This explains the
robustness property of the PTE. For further details, we may refer to Sen and
Saleh (1979).
Let us now consider the case of the P I T in (3.13); this has been studied in
detail in Saleh and Sen (1983a). Like the univariate case, we consider here a
sequence {K*.} of local alternatives, where under K*,, (0, f l ) = n-1/2()t0, )t ).
Then, by (3.9) and (3.11), E{~:(Sf,)[)t0=0}~a3, irrespective of )t, while
E{~:(~,) I )to = 0} ~ P{X~(,~) > X2,,1} ( p a l ) , where z~ = ()to + ?a)'T-l()to + 6)t), so
that for ho = 0, ,~ = ?2)t'T-a)t = FZA (~>0). Thus, the asymptotic size of ~:(5~,) is
~>al, where the equality sign holds when )t = 0, or, in other words, ~:(coc~,)is not
robust against a # 0, and hence, the actual significance level of ~:(c~,) may be
higher than al when )t ~ 0 . In fact, this may be quite different from al,
depending on A. Further, E{~* [ )to = )t = 0}-> al(1 - ot2) + P{Z >1(Xp.,,3, 2 -~P,2 a2)}'
where Z has the same distribution as of two correlated chi-square variables
with the same D.F.p. Details of this are given in Saleh and Sen (1983a). On the
other hand, if A # 0, we have
E{sC*, [ )to = 0}-+/-/e (X2,~z; A)[1 - Hp(X2,,~; ,~)] + P{Z(A ) >t (X~,~3,X~.~2)},
/-Lp(Xv,~,A)[1
2 . - H.p(gp,~,,d)]+a3^
2 . [1 - H.u(Xp,~=,A)]
2 .
Nonparametric preliminary test inference 289
depend on A. On the other hand, for ~,, we have the asymptotic power
1 - H , v(Xe.%,zl)
2 . where ,~= (A0+6A)'T-I(A0+6A). Thus, ~(~,) may be more
powerful than sc(~,) (where we let al = a3 = a ) when Zi ~>A , and this depends
on A0 and A both (as well as ~). Thus, ~(~,) may not be efficiency-robust. Thus,
~(~*,) emerges as an over all robust competitor of the others.
o , = ((0,,3)= \ c ? V c?)').
We assume that
(i)
max c~D~lc~ ~ O, as n ~ (4.2)
l<.i<_n
The d.f. F is assumed to satisfy all the regularity conditions in Section 3. Also,
for every j (1 <~j ~<p), we define the score functions ~0j, q~~ and the scores a,j(i),
a*j(i), 1 <~ i <~ n, j = 1 , . . . , p as in Section 3. Further, replacing bci by Bci, for
each j, we define the L , j ( B ) as in Section 3. Note that since the c~ are q-vectors,
the L,j(B) are q-vectors- and are functions of the p q matrix B. For each j,
290 A. K. Md. Ehsanes Saleh and Pranab K u m a r Sen
R.~i(B ) is defined as in Section 3 with Y~(B) = Y~ - Bc~, 1 <- i <~n. T o estimate fl,
we p r o c e e d as in Jure~kovfi (1971) and Sen and Puri (1977), and define
p ql }
A* = B*: ~ ~'~ IL,k(B*)I = minimum . (4.5)
j=l k=l
& = (4.8)
j,j'=1,..., p; k,k'=ql + 1..... q
Then, for testing H(02): ~ 2 = 0 against /32 ~ 0, we proceed as in Sen and Puri
(1977) and Saleh and Sen (1982b) and consider the test statistic
U n d e r H~ 2), 5!,z) has asymptotically chi square distribution with Pq2 D.F., so
that its critical value ~'~21(z) converges to X~q2,~2
2 as n ~ ~. Then, the preliminary
test rank order estimator ( P T R O E ) of fll may be defined as
{
A** = B**: ~'~ ~ q [L,jk(B*)[ = minimum }. (4.12)
j=l k=ql+l
N o n p a r a m e t r i c preliminary test inference 291
W e denote by
/~o).
ml = R,a~(0,/3,2),/~(2).
no = R,~j(fl,,, 0), l~ni j = R,ij(fi.). (4.13)
Then, we let
and let
where 37/0) is defined as in (3.2) with R,ii replaced by/~(li~, for 1 ~< i ~< n, j t> 1.
Let then
..#o) t o ) ~
l~(n 1) = ~ L n j k l~nj'k'J)l<j,j<~p;l<k,k,<~q 1 ,
(4.18)
({/~(2)/~(2) "~'~
--'t)= ~-\ njk nj'k'"; . . . . " (4.19)
I<<.1,1<~p;ql+l<.k,k ~q
Finally, let Lnj k = Lnjk (0), 1 <~j ~<p, 1 ~< k, k' ~< ql and
H. = ((L.ikL.i,k,)) , (4.20)
l<-.j~p'<-.p;l<_k,k'<ql
Note that 5~. is the test statistic for testing H~ol):/31 = 0 when 2 is given as 0,
while ~ ) is the test statistic when /~2 is treated as the nuisance parameter.
Finally, the preliminary test function for testing H~01)112= 0 is based on
As in Sections 2 and 3, the main interest centers around the robustness study
292 A.K. Md. Ehsanes Saleh and Pranab Kumar Sen
of the NPTE and NPTT when H~2): flz = 0 may or may not be tenable, and, as
in earlier sections, we confine ourselves to some sequence of local alternatives,
for which meaningful asymptotic results are derivable. Against H(o2~, we con-
sider a sequence {K,} of alternative hypotheses, where
while in the context of the PTI', we consider the sequence {K*~} of alternative
hypotheses, where
@ 1 (4.28)
nm(/J,, - 181)~ N~o,(-,~2a21a il, T Q a ~1~).
For ~n, the asymptotic size of the test is equal to & when /31 = 0, /32 ~- 0, while
for/32 # 0, fll = 0, the asymptotic size may be larger than 6. Under {K,}, the
size of the test based on ~ , converges to P{g~o~(60)I>X~,~} where
where 6 may or may not be greater than 6 in (4.31) depending on A1, /~-2 and
a * As for the PTF ~ * , the asymptotic size is bounded from above by
(~(1 -- O~2) -[- O~2 A a 0, when ~.2 = 0, where for t~ and t~ when chosen close to each
other (as it should be the case), this upper bound provides a close ap-
proximation. For A2 # 0, like the multivariate simple regression model case in
Section 3, the size of the PTI" is asymptotically expressible in terms of d.f. of
correlated chi square variables, and through _it will depend on A2, it behaves
robustly in the sense that it does not go to +1 as A2 moves away from 0. The
picture with the asymptotic power is also very similar to that in these. For
details, we may refer to Saleh and Sen (1983b). This explains the robustness
properties of the P I T procedures.
294 A. K. Md. Ehsanes Saleh and Pranab Kumar Sen
Parametric procedures for PTE or PTT for the models in Sections 2, 3, and
4, when F is treated as a normal d.f., are based on the classical least squares
estimators for the constrained and unconstrained models. For example, in (4.1),
the unconstrained estimator of fl is
The normal theory likelihood ratio test may be used to test for HC02):2 = 0 and
with this the PTE fl*lCL)may be defined as in (4.10). The case of the F I T is also
similar to that in (4.23), where for the constrained and the unconstrained
likelihood ratio test, the chi square approximation works out well. For the PTE
based on these least squares estimators, (4.26)-(4.30) holds with the notable
change that T has to be replaced by ,Y, the covariance matrix Of Y (assu:l~i~d~o
be finite), and also the same change is needed in (4.31). As such, the results_run
parallel to the nonparametric setup and the same asymptotic properties hold
for the PTE and ~ procedure based on the least squares estimators.
However, here we need 2 to exist, while in the nonparametric case, this is not
needed.
Secondly, in the nonparametric case, one may use a quadratic form in the
rank based estimates for the testing purpose too. This will give some asymp-
totically equivalent tests. However, in that case, one would need to use some
consistent estimators of y~ (1 ~<j ~<p) in (3.14), which is not needed in our case;
such estimates are available in the literature (cf. Jure~kovfi, 1971; Sen and Puri,
1977, though are usually computationally quite tedious. Hence, we prefer not
to advocate this alternative procedure.
Finally, we should like to comment on t h e r o l e and choice of the significance
level of preliminary test in PTI.
For the PTE and PTI" procedure, a natural question arises as to the choice of
optimal level of significance of the preliminary test. For the PTE problem, this
can be achieved by drawing tables and graphs of the A R E ' s of 0T relative to 01
and 01 as a function of unknown parameters, which will dictate the choice of a2
for which 0T will be preferred to either 01 or 0~ at least for certain region of
values of the parameters involved. In the Pq'T problem, one constructs tables
and graphs for the power function of sc*, ~:12and ~:1 as a function of unknown
parameters, which will dictate the optimal choice of a2 for which ~:* will be
Nonparametric preliminary test inference 295
preferred over ~:12or ~1 at least for certain region of values of the parameters
involved. In such studies one sets a~2 = al = a to determine optimal a2 and it
will be necessary to make detailed study of each type F I T problem to ensure
proper choice of a2.
A demonstration of analytical approach to the P T E problem of a appropriate
choice of the level of significance in the univariate simple linear model
discussed in Section 2 is given here. Consider the two-sided preliminary test for
the estimation of the location parameter. The A R E of 0* relative to 0 as given
in Saleh and Sen (1978) is
where T~2/2> 0 for every a2 E (0, 1) and for every Ta2/2~ [0, 00) and ~ E ( - ~ , 0o)
and 6 = Av2,
Acknowledgement
This work has been supported by the NSERC grant No. A3088 (Canada) and
by the U.S. National Heart, Lung and Blood Institute. Contract NIH-NHLB1-
71-2243-L from National Health Institutes of Health.
References
[1] Adichie, J. N. (1967). Estimates of regression parameters based on rank tests. Ann. Math.
Statist. 38, 894-904.
[2] Ahsanullah, M. and Saleh, A. K. Md. Ehsanes (1972). Estimation of intercept in a linear
regression model with one dependent variable after a preliminary test of significance. Rev.
Inst. Internat. Statist. 40, 139-145.
[3] Bancroft, T. A. (1944). On biases in estimation due to use of preliminary test of significance.
Ann. Math. Statist. 15, 190-204.
[4] Bancroft, T. A. (1953). Certain approximate formulas for the power and size of a general
linear hypothesis incorporating preliminary test of significance. Unpublished prelim, report.
Stat-Lab. Iowa State Univ. Ames.
[5] Bancroft, T. A. (1964). Analysis and inference for incompletely specified models involving use
of preliminary tests of significance. Biometrics 20, 427-442.
[6] Bancroft, T. A. (1972). Some recent advances in inference procedures using preliminary
tests of significance. In: Statististical Papers in Honour of George W. Senedecor. Ames Iowa
State Univ. Press, Chapter 2, pp. 19-30.
[7] Bancroft, T. A. and Han, C. P. (1976). On pooling of means in multivariate normal
distributions. In: Ikeda et al., eds., Essays in Probability and Statistics: Ogawa Volume.
Tokyo, pp. 353-366.
[8] Bancroft, T. A. and Han, C. P. (1977). Inference based on conditional specification: A Note
and Bibliography. Int. Inst. Statist. Review 45, 117-127.
[9] Bancroft, T. A. and Han, C. P. (1980). Inference based on conditionally specified ANOVA
models incorporating preliminary testing. In: P. R. Krishnaiah, ed., Handbook of Statistics,
Vol. 1. North-Holland, Amsterdam.
[10] Hfijek, J. and ~id~ik, Z. (1967). Theory of Rank Tests. Academic Press, N.Y.
[11] Han, C. P. and Bancroft, T. A. (1968). On pooling of means when the variance is unknown. J.
Amer. Stat. Assoc. 62, 1333-1342.
[12] Jureckova, J. (1969). Asymptotic linearity of a rank statistic in regression parameter. Ann.
Math. Statist. 40, 1889-1900.
[13] Jure~kovfi, J. (1971). Asymptotic independence of a rank statistic for testing symmetry on
regression. Sankhyd Ser. A 33, 1-18.
[14] Jure~kovfi, J. (t971). Non parametric estimates of regression coefficients. Ann. Math. Statist.
42, 1328-1338.
[15] Kraft, C. H. and Van Eeden, C. (1972). Linearized rank estimates and signed rank estimates
for the general linear hypothesis. Ann. Math. Statist. 43, 42-57.
[16] Mosteller, F. (1948). On pooling data. J A S A 43, 231=242.
[17] Puff, M. L. and Sen, P. K. (1969). A class of rank order tests for general linear hypotheses.
Ann. Math. Statist. 40, 1325-1343.
[18] Puff, M. L. and Sen, P. K. (1971). Nonparametric methods in Multivariate Analysis. Wiley,
New York.
[19] Saleh, A. K. Md. Ehsanes (1973). Pretest estimators of means of multivariate normal
distribution. Carleton Math. Ser. 94.
[20] Saleh, A. K. Md. Ehsanes (1977). Pretest estimators of component means of multivariate
normal distribution. In: Proceedings of ISI meeting at Delhi, India, pp. 454-457.
Nonparametric preliminary test inference 297
[21] Saleh, A. K. Md. Ehsanes and Sen, P. K. (1978). Non parametric estimation of location
parameter after a preliminary test on regression. Ann. Statist. 6, 154-168.
[22] Saleh, A. K. Md. Ehsanes and Sen, P. K. (1982a). Nonparametric tests for location after
preliminary test on regression. Comm. Statist. 11(6), 639-652.
[23] Saleh, A. K. Md. Ehsanes and Sen, P. K. (1982b). Nonparametric estimation following a
preliminary test on regression (a collection of four papers). Carleton Mathematical Lecture
Notes No. 41.
[24] Saleh, A. K. Md. Ehsanes and Sen, P. K. (1983a). Nonparametric tests for location after a
preliminary test on regression in the multivariate case. Common. Statist. 12(16); 1855-1872.
[25] Saleh, A. K. Md. Ehsanes and Sen, P. K. (1983b). Asymptotic properties of tests of hypothesis
following a preliminary test. Statistics and Decisions 1, 455--477.
[26] Sen, P. K. (1982). Asymptotic properties of likelihood ratio tests based on conditional
specification. Statistics and Decisions 1, 81-106.
[27] Sen, P. K. and Puri, M. L. (1969). On robust nonparametric estimation in some multivariate
linear models. In: P. R. Krishnaiah, ed., Multivariate Analysis II. Academic Press, New York,
pp. 3-52.
[28] Sen, P. K. and Puri, M. L. (1977). Asymptotically distribution free aligned rank order tests for
composite hypotheses for general multivariate linear models. Z. Wahrsch. Verw. Gebiete 39,
175-186.
[29] Sen, P. K. and Saleh, A. K. Md. Ehsanes (1979). Nonparametric estimation of location
parameter after a preliminary test on regression in multivariate case. J. Multivariate Analysis
9, 322-331.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 la
O Elsevier Science Publishers (1984) 299-326
Ralph A. Bradley*
1. Introduction
*The work of the author was supported in part by the Office of Naval Research under Contract
N00014-80-C-0093 with the Florida State University. Reproduction in whole or in part is permitted
for any purpose of the United States Government.
299
300 Ralph A. Bradley
Table 1
Summary of results of a taste test
T1 T2 T3 7'4 ai
T1 -- 28 15 23 66
T2 112 -- 46 47 205
T3 39 17 -- -- 56
T4 34 11 -- -- 45
W e shall return to analysis of the data of Table 1, which gives values of a~j,
after discussion of models for paired c o m p a r i s o n s and establishment of basic
procedures.
This c h a p t e r is organized in such a way as to give initial attention to the
analysis of basic paired c o m p a r i s o n s data like those of Table 1. T h e n exten-
sions of the m e t h o d are d e v e l o p e d for factorial t r e a t m e n t c o m b i n a t i o n s and for
multivariate responses, responses on several attributes for each paired com-
parison. T h e emphasis is on the m e t h o d o l o g y and applications, although
properties of p r o c e d u r e s are n o t e d and references given. W e conclude with
c o m m e n t s on additional m e t h o d s of analysis.
some sense 7/1 and 7/"2 are measures of 'worth' of T 1 and T 2. Binomial theory
applies and the sign test may be used to test the hypothesis H0: rrl = 7r2.
Bradley and Terry (1952a) proposed a basic model for paired comparisons,
extended by Dykstra (1960) to include unequal values of the n~j. The approach
was a heuristic extension of the special binomial when t = 2. Treatment
parameters, ~-1,..., rr,, 7r~ t> 0, i = 1 , . . . , t, are associated with the t treatments,
T~. . . . . T,. It was postulated that these parameters represent relative selection
probabilities for the treatments so that the probability of selection of T~ when
compared with Tj is
The model proposed imposes structure in that the most general model might
postulate binomial parameters ~'~j and ~-# = 1 - ~r~jfor comparisons of T~ and Tj
so that the totality of functionally independent parameters is (I) rather than
( t - 1) as specified in (2.1) and (2.2).
The basic model (2.1) for paired comparisons has been discovered and
rediscovered by various authors. Zermelo (1929) seems to have proposed it first
in consideration of chess competition. Ford (1957) proposed the model in-
dependently. Both Zermelo and Ford concentrated on solution of normal
equations for parameter estimation and Ford proved convergence of the
iterative procedure for solution.
The model arises as one of the special simple realizations of more general
models developed from distributional or psychophysical approaches. Bradley
(1976) has reviewed various model formulations and discussed them under such
categories as linear models, the Lehmann model, psychophysical models, and
models of choice and worth.
David (1963, Section 1.3) supposes that T~ has 'merit' V~, i = 1 . . . . . t, when
judged on some characteristic, and that these merits may be represented on a
merit scale. H e defined 'linear' models to be such that
It is apparent from (2.4) and (2.5) that the two models are very similar. The
choice between the models is much like the choice between logits and probits
in biological assay. The use of log 7r~ as a measure of location for T~ in the first
model is suggested.
Models (2.4) and (2.5) give very similar results in applications. Comparisons
are made by Fleckenstein, Freund and Jackson (1958) with test data on
comparisons of typewriter carbon papers. In general, more extensions of model
(2.4) exist and we shall use that model in this chapter.
3. Basic procedures
The general approach to analysis of paired comparisons based on the model
(2.1) is through likelihood methods. On the assumption of independent res-
ponses for the n~j comparisons of T~ and T/, the binomial component of the
likelihood function for this pair of treatments is
?,,(
~ri + % / \~ri + ~rj/ = rrT'JTrTJ'/(Tri+ ~rJ)n'J'
It is seen that al, at constitute a set of sufficient statistics for the estimation
,
of 7rl. . . . . 7rt and that ai is the total number of preferences or selections of T~,
i -- 1. . . . . t, for the entire experiment.
Paired comparisons: Some basic procedures and examples 303
3.1. L i k e l i h o o d e s t i m a t i o n
ai ~ nq = 0 , i = l, . . . , t, (3.2)
Pi Pi + Pj
and
E P~ = 1. (3.3)
i
Solution of equations (3.2) and (3.3) is done iteratively. If p~k) is the k-th
approximation to p~,
p~k) = p * ( k ) / ~ p*(k) ,
where
p*(k)=ai/Z [nij/(p~k-,)+p}k-t))l ' k=l, 2. . . . .
1
j#i
The iteration is started with initial specification of the p!0); one may take
p~O) = 1/t, i = 1 . . . . . t, and this is adequate, although Dykstra (1956, 1960) has
suggested better initial values.
W e return to the example of Table 1. Values of ai are given in the table and
values of nij precede the table. Solution of equations (3.2) and (3.3) was begun
with p!0~= ~, i = 1 , . . . , 4. Results for initial iterations are summarized in Table
2 along with final values for p~; typically approximately 10 iterations are
sufficient for four-decimal accuracy in the final values. It is this iterative
procedure that Ford (1957) has shown to converge. T h e procedure is easy to
program on computers because of the symmetry of the equations to be solved.
Bradley and Terry (1952a) and Bradley (1954a) have provided tables giving
values of the p~ for equal values of the n~j = n, t = 3, n = 1 , . . . , 10; t = 4,
n = l . . . . , 8 ; t = 5 , n = 1 . . . . ,5.
In small experiments, small values of the n~j, perhaps with poorly selected
Table 2
Values of the estimators in the iterative solution
T/ p!0) p!l) p!2) p!3) p!,) p!S) Pi
0.2698, and P4 = 0. But there is also information on the relative values of 7ra
and ~4- We find Pl/P4 = 23/34 = 0.4035/0.5965 and can write Pl = 0.40358 and
p4 = 0.59658, 8 infinitesimal. A formal analysis may be conducted through
maximization of log L with respect to ~'T, ~'2, 7r3, 7r], 7r2+ 7r3 = 1, ~'T + 7r] = 1,
where 7r~ = 87rT, qT4 = 8"ff~ and 8 is small. Indeed, the maximum value of log L
may be found in this way and it is needed in the computation of likelihood
ratios as discussed below. Bradley (1954a) provides additional discussion of
these special boundary problems, problems not usually encountered in ap-
plications.
(i) The major test proposed by Bradley and Terry (1952a) was that of
treatment preference or selection equality. The null hypothesis is
1Common logarithms were used to compute BI in these tables. In this paper, natural logarithms
are used throughout.
Paired comparisons : Some basic procedures and examples 305
For the values of the a~ of Table 1, the noted values of the n~i above that
table, N = 372, and the values of the p~ in Table 2, we have B1 = 206.3214 and
- 2 log A1-- 103.06 with 3 degrees of freedom. There is a clear indication that
the ~ri are not equal and that treatment preferences differ.
(ii) It is always encumbent on statisticians to check the validity of models
used in statistical analyses when possible. We have noted above that a general
'multi-binomial' model with (~) functionally independent parameters ~-~jmay be
posed that ignores the structure of paired comparisons in the sense that the
same treatment is compared with more than one other treatment. The multi-
binomial model fits the data of tables like Table 1 perfectly. This permits a test
of the more restrictive model of (2.1).
The following likelihood ratio test was proposed by Bradley (1954b) and
extended by Dykstra (1960). Consider the null hypothesis,
Under Ha, the likelihood estimator of 1r0 is Po = ao/no when n~j > 0 and the
estimator is not needed when n 0 = 0. Under H0, p~ is the estimator of ~ from
equations (3.2) and (3.3). Designating A2 as the likelihood ratio statistic, we
have
For large n~, -21ogA2 is taken to have the chi-square distribution with
( I ) - (t - 1) = ~(t - 1)(t - 2) degrees of freedom under H0. An alternative statis-
tic, asymptotically equivalent to that of (3.9), is
Dykstra has noted that the test statistics may be distorted when some n 0 are
small. Since there is no basis for pooling terms in this case, he suggested
omitting terms in (3:11) with very small values of n 0 (and hence nji ) and
deleting one degree of freedom for each pair of terms so deleted.
For the data of Table 1, n34 = 0 and the tests for the fit of the model have
306 Ralph A. Bradley
(3)(2) - 1 = 2 degrees of freedom. From (3.9), - 2 log h2 = 2.02 and there seems
to be no reason to doubt the appropriateness of the model (2.1). The statistic in
(3.10) is evaluated also for illustrative purposes. Values of the a~j are given in
Table 3 and they may be compared directly with the values of a# in Table 1.
Computation yields X2 = 2.00; the close agreement of the two computations is
typical.
Table 3
E s t i m a t e d f r e q u e n c i e s f o r th,z d a t a o f T a b l e 1
Row
T1 TE T3 "/'4 Sums
- 2 l o g a3 = 2 ( B , - ~ BI,,),
U=I
where Blu is computed from (3.6) for the data within group u and B1 is
computed similarly for the pooled data from all of the groups. For large values
of the niju, the number of comparisons of T~ and Tj in group u, - 2 log A3 has the
central chi-square distribution with ( g - 1 ) ( t - 1) degrees of freedom under H0
of (3.12).
Paired comparisons: Some basic procedures and examples 307
A n o m n i b u s test of t r e a t m e n t e q u a l i t y m a y b e d e s c r i b e d :
b a s e d on t h e p o o l e d d a t a .
B r a d l e y a n d T e r r y (1952a) g a v e a small e x a m p l e for two tasters e v a l u a t i n g p o r k
roasts f r o m h o g s with differing diets, t = 3, g = 2, ni~u -- 5 for all i, j, u, i ~ j. T h e
d a t a a r e s u m m a r i z e d in T a b l e 4, a n d T a b l e 5 is t h e analysis of c h i - s q u a r e table.
T h e l a r g e t o t a l t r e a t m e n t effect is s e e n to b e d u e to d i s a g r e e m e n t of t h e t w o
j u d g e s on p r e f e r e n c e s .
Table 4
Roast pork preference data for two judges
Table 5
Analysis of chi-square, roast pork data
Large-sample theory may be used to obtain variances and covariances for the
estimators Pl . . . . . Pt or their logarithms in paired comparisons. Bradley (1955)
considered this theory with each nij = n and Davidson and Bradley (1970),
considering the multivariate model discussed in Section 6, obtained results for
general n~j as a special case.
Let /l/j = niJN, N = E~<~ nij. q h e n ~ / N ( p l - 7rl) . . . . . X/N(p, - 7r,) have the
singular multivariate normal distribution of dimensionality (t - 1) in a space of t
dimensions with zero mean vector and dispersion matrix X = [o'ij] such that
i=l,...,t,
ji
and (3.15)
a o = - / . d ( r r , + ~ r j ) 2, ij, i,j=l .... ,t.
In order to use these results in applications, o-ij must be estimated; this is done
through substitution of pi for 7r~ in (3.15) to obtain the h~j, and subsequent
substitution in (3.14) yields the d'i/s.
For the data of Table 1, values of Pl . . . . . P4 in Table 2 are used to obtain
A =
I104963
-0.9558 0.4304
-1240
-0.3022
-20.4259-]
- .3553
]
- 1.2740 -0.3022 0.7441
-2.4259 -0.3553 0 3.12371
_1
from whence
E -0.0695
-0.0314
0.0208
0.6644
-0.4689
-0.1260
-0.4689
0.6784
-0.1781
Note that ,~ is singular, the row and column sums being zero.
-0.1260
-0.1781
on ~rl i s developed from the fact that ~/N-(Pi - 7ri)/X/~ is standard normal for
large N. In the example, the 0.95-confidence interval for 7ra is (0.0795, 0.1369). Let
~r* be a vector containing any subset of t* distinct parameters of the set, t* < t.
The (1 - a)-confidence region for these t* parameters is that ellipsoidal region of
the parameter subspace for which
Paired comparisons: Some basic procedures and examples 309
With a = 0.01, t * = 2, ,)(2.01,2= 9.210, it may be verified that (3.17) yields the
0.99-confidence region,
Since it may be appropriate to regard log 7ri as the location parameter for T~,
i = 1 , . . . , t, in view of (2.4) and (2.5), confidence intervals or regions on the
log 7r~ may be desired. It follows that ~ / N ( l o g p x - l o g 7 q ) , . . . , ~/N-(log p , -
log 7rz) have the singular multivariate normal distribution with zero mean
vector and dispersion matrix D , ~ D where D is the diagonal matrix with typical
element 1/~ri. Estimated variances and covariances are as follows:
est.var.(~/N log Pi) = o'iijp2i, est.covar.(~/N log Pi, N/N log pj) = oij/PiPj, i # j.
Confidence intervals or regions on the log 7r~ may be obtained analogously to
those shown above for the 7r~.If a method of multiple comparisons is to be used,
the necessary variances and covariances may be obtained from the information
given.
in the very special case when each nij = n, approximate variances and
covariances may be obtained if the treatments are not too disparate. Then, on
the assumption that ~-~ = 1/t, i = 1. . . . . t, o-~ = 2 ( t - 1)2/t3 and o-~j= - 2 ( t - 1)/t 3,
i ~ j , while N = n(~). Like the binomial with its stable variance for its
parameter in a middle range, so are the variances and covariances stable in
paired comparisons when the 7ri are near 1/t and the n~j = n. This can reduce
computational effort for balanced experiments.
The basic paired comparisons experiment forces decision on the part of the
respondent and data like those of Table 1 result. Nevertheless, ties or 'non-
selection' judgments often arise, for example, in consumer testing.
The treatment of ties in the sign test has received considerable attention.
Hemelrijk (1952) demonstrated that the most powerful test of significance was
obtained by omission of ties and use of a conditional binomial test on the
sample results so reduced. But the treatment ties must depend on experimental
objectives, see Gridgeman (1959), and estimation of potential share of a
consumer market surely must require other considerations. Decisions for
paired comparisons must be similar to those for the sign test. Two formal
methods for the treatment of ties in paired comparisons are available.
Rao and Kupper (1967) introduced a parameter 0 t> 1 and adjusted prob-
abilities associated with the comparison of T~ and Tj to obtain
and
P(T~ = Tj) = (02- 1)~',Trj/(~', + 0~-j)(0~ri + ~,j)
where r / = log 0. It is seen that the model extends the linear model of (2.4) and
that log 0 is, in a sense, a threshold parameter associated with discriminatory
ability.
Rao and Kupper extended the theory in parallel with that given above.
Unfortunately, they assumed that n# = n, but the work is easily extended. We
summarize only the results leading to the test of treatment equality, although
they provide other asymptotic results including variances and covariances for
their estimators. We use our notation. Let N = Ei<~ n o and bij be the sum of the
number of ties and the number of preferences for T~ in the n# comparisons of
T~ and Tj. Let b~ = Ej.]~ bij and let b0 be the total number of ties in the
experiment. The likelihood equations are:
Paired comparisons: Some basic procedures and examples 311
bi ~, bq _ ~, bq = 0 , i = 1 . . . . . t,
Pi J' Pi+OP' J" Obi+Pi
j'i j#i (4.2)
X Pi = 1 , obO__l X bqp, = 0
i i#j Pi + Opj '
- 2 log A T = 2 N log 2 N - 2bo log 2b0 - 2(N - b0) log(N - bo) - 2 B ' f ,
(4.3)
where
B ~ = ~ bij log(p/+ Opj) - ~ bi log Pi - bo log(0 2 - 1). (4.4)
ij i
Again, for large N and under H0, - 2 1 o g A ~ has the central chi-square
distribution with ( t - 1) degrees of freedom. An iterative solution of equations
(4.2) is suggested by R a o and Kupper. They provided also a test of the
hypothesis, 0 = 00, against the alternative, 0 # 00.
Davidson (1970) proposed probabilities corresponding to those of (4.1) as
v >/O. This model preserves the odds ratio, P ( T i -~ Tj)/P(Tj ~ Ti) = 7ri/rrj, con-
sistent with the Luce (1959) choice axiom. In addition, the probability of a tie is
a m a x i m u m when ~-~ =~-j and diminishes as 7r~ and ~-j differ, an intuitively
desirable effect.
Let b~ be the sum of the n u m b e r of ties and twice the n u m b e r of preferences
for T/ in the nij comparisons of T/ and Tj and let b~' = Xj, j~,i bib Davidson's
likelihood equations are
Davidson also proposed an iterative solution for the equations (4.6) and
examined large-sample theory. H e showed that the R a o - K u p p e r test and the
Davidson test for treatment equality are asymptotically equivalent.
The choice between the two methods for extending the basic paired com-
parisons model to a model allowing for ties seems to be a matter of intuitive
appeal. Both give very similar results in applications.
The corresponding probabilities for the model with multiplicative order effects
are
The model given by (4.8) and (4.9) requires that 16d ~< min(Tri, ~ ) , an awkward
feature, while the model (4.10) only requires that 0ij > 0 . Advantages of the
multiplicative model (4.10) are:
(i) Preference probabilities depend on the worth parameters 7ri and 7rj only
through the ratio 7r-dcrj.
(ii) Model (4.10) admits a sufficient statistic whose dimension is that of the
parameter space.
(iii) Model (4.10) is a linear model and, for example,
Paired comparisons: Some basic procedures and examples 313
ai rlqO nq =
i=l,...,t,
P* ~ (Op* + p : ) - ~ (p* + Opt) O,
jei j~i (4.11)
p* = 1, f _ ~, nqp* = O,
i 0 ~j' ( @ * + P ~ )
where f is the total number of preferences for the first presented item of a pair,
p* is the estimator of ~'i and ~J of 0, while n~j is the number of judgments on the
ordered pair (Ti, Tj) and nji is the number of judgments on the ordered pair
(Tj, Ti). The likelihood ratio statistic for H0: 7r~ = 1/t, i= 1 , . . . , t, versus
Ha: rri ~ 1/t for some i in the presence of an order effect is
Again, under H0, - 2 log )t~/has the central chi-square distribution with (t - 1)
degrees of freedom. A test for the presence of a common order effect, H0:0 = 1
versus Ha: 0 ~ 1, follows immediately. For this test,
(4.16)
i<j
The basic model for paired comparisons can be extended to triple com-
Paired comparisons: Some basic procedures and examples 315
parisons in at least two ways. Bradley and Terry (1952b) proposed the model,
P(T~ --) T~--) Tk) = Irzicrj/[cr2(1rj+ rk) + 7r2(~'i + 7rk) + 7r~(~'i + ~'j)].
(4.19)
In both models, the 7r's may again be regarded as worth parameters with
Z/~-~ = 1. Both models have some desirable properties as discussed in the
second reference. Model (4.18) is consistent with the Luce choice axiom and
can be written as a Lehmann model (see Bradley (1976)). Model (4.19) has the
property that the set of treatment rank sums constitutes a set of sufficient
statistics for the estimation of 7rl. . . . . 7r,. Basic methodology for the second
model is well developed including estimation procedures, tests of hypotheses
including goodness of fit, and asymptotic theory.
We show only the estimating equations and the basic test for model (4.19). If
Pl Pt are the estimators of 7rl. . . . , ~'t, they result from solution of the
. . . . .
equations,
2_}_ 2
ai nijk[2p~(Pj + Pk) + P j
P~] = 0, i = 1 . . . . . t, ~pi= 1,
Pi j<k Dijk (P ) i
j. k~,i (4.20)
where
Oijk (P ) ~" P~(Pj
2 2
+ Pk) + Pj(P, + Pk) + P~(P~ + Pj) (4.21)
and nijk is the number of repetitions or rankings on the triplet (T~, Tj, Tk),
i < i < k. The quantity ai in (4.20) is such that
where Ri is the total sum of ranks for T~ in the experiment. Pendergrass and
Bradley suggest iterative means of solution of the equations (4.20) although they
held each nijk = n for all i < j < k.
The likelihood ratio test of H0: ~ri = 1/t, i = 1, . . . , t, versus Ha: ~r~:~ 1/t for
some i, is based on
where N = Ei<j<k nljk. Under H0, -21ogA5 has the central chi-square dis-
tribution with ( t - 1) degrees of freedom for large N.
Park (1961) applied the Pendergrass-Bradley procedures to experimental
316 Ralph A. Bradley
data and compared the results with those from companion experiments using
paired comparisons. He found good model fits and estimator agreement.
~ l o g 7ri = 0 . (5.1)
i=1
The change does not affect the estimation of probabilities of pairwise pref-
Table 6
P r e f e r e n c e d a t a in coffee t e s t i n g
T r e a t m e n t not p r e f e r r e d , T~
000 - - 15 15 16 19 14 19 16 114
001 11 -- 10 15 15 14 15 12 92
010 11 16 -- 15 15 14 18 15 104
Treatment
011 10 11 11 - - 14 11 15 13 85
preferred,
a 100 7 11 11 12 -- 9 14 13 77
T,
101 12 12 12 15 17 -- 16 18 102
110 7 11 8 11 12 10 -- 12 71
111 10 14 11 13 13 8 14 -- 83
Paired comparisons: Some basic procedures and examples 317
erences and the two sets of p a r a m e t e r s are related: G i v e n 7~"1. . . . . 3Tt subject to
(5.1) and 7r~ . . . . , 7r *t with E i 7r *i = 1, 7r *i = 7rl/E i 7ri and zri = ~" *i/(IIi 7r .i) 1/t L e t B.,
be an m t matrix consisting of m z e r o - s u m o r t h o n o r m a l rows, 0 ~< m ~< (t - 1). A
set of m o r t h o n o r m a l t r e a t m e n t contrasts is defined by
w h e r e log ~" is the t-element column vector with typical e l e m e n t log 7r~. Clearly
a set of contrast representing factor effects or their interactions can be
f o r m u l a t e d by choice of the e l e m e n t s of B,, exactly as d o n e for usual analysis
of variance for factorials.
Only o n e estimation p r o b l e m need be considered, the estimation of the
e l e m e n t s of ~" given constraints,
w h e r e l't is the t-element row vector of unit elements, ~ - ' = (7rl . . . . . 7rt), and
O,,1 is the (m + 1)-element column vector of zeros. E s t i m a t i o n is by the
m e t h o d of m a x i m u m likelihood and the resulting r e d u c e d estimating equations
are
a i / p i - chi(P) = 0, i = 1,..., t,
(5.4)
log Pi = 0, Bm log p = O,,,
i
w h e r e p = (Pl . . . . . P,),
E l ( p ) : al - ~ nqpi/(p, + p j ) , (5.6)
J
D = It - B ' B , . , (5.7)
Du > 0, It, the t-square identity matrix. N o t e the similar f o r m s in (5.6) and (3.2).
If m = 0, the estimation process involves solution of (3.2) with (3.3) replaced by
the second e q u a t i o n of (5.4).
Iterative solution of equations (5,4) is discussed briefly by Bradley and
E I - H e l b a w y (1976) and in detail by E1-Helbawy and Bradley (1977). In the
latter reference, it is shown that the p r o p o s e d iterative p r o c e d u r e converges
318 Ralph A. Bradley
= I-n.,,1
Bmo IBm1J
be matrices like Bin, 0 <- ma, ml <~mo <~( t - 1), mo = ma + m~. With the con-
dition that E~ log 7r~ = 0, we test
where B1 is defined in (3.6), and, for large N = Yi<j nij and under H0 in (5.8),
the statistic has the central chi-square distribution with ml degrees of freedom.
In (5.10), p0 is the solution of (5.4) where Bm= Bmo and Pa, the solution when
Bm= Bin. Basically, the test involves the assumption that
]
j log ~ = Om+l
8.o
and a test of the additional constraints,
Bm1log ~ = Oral ,
Bm~=7--~,
1[!1_11111!]
-1 1 -1 -1 1 -1 ,.
-1 .1 1 1 -1 -1
The statistic - 2 log Am0.m has the central chi-square distribution with 3 degrees
of freedom and is large. It is possible also to partition this chi-square into three
chi-squares, each with 1 degree of freedom, as is done in Table 7.
The general test procedure for hypothesis (5.8) versus (5.9) based on the
statistic (5.10) may be used repeatedly to produce an analysis of chi square
table. Two such analyses are given in Tables 7 and 8 for the data of Table 6.
Rows in these tables correspond to rows of the usual analysis of variance table
for a 23 factorial and similar descriptive terms have been used. In order to
preserve orthogonality of the various chi-squares, they must be sequenced
properly; each row requires that certain conditions be assumed, equivalent to
the specification of Bm. Both Tables 7 and 8 are shown to illustrate two
different sequencings of the rows and to suggest that the choice of sequencing
does not have substantial effects on the inferences that may be made. Ad-
ditional details on computations for Tables 7 and 8 are given by Bradley and
El-Helbawy (1976).
The analyses below were done through recognition of the factorial structure
of the treatments. Factorial parameters may be introduced formally, although it
is not necessary to do so. We illustrate with the 23 factorial. Let 7r~ replace ~'i
for the treatment T,, -- Ti, where a = ( a l , o/, a 3 ) , a r = 0 or 1, r = 1, 2, 3. We
reparameterize by writing
3
= l-I l-I "ll-(')ar% . ~l~la3
. (5.11)
r= ] r<s
The parameters on the right-hand side of (5.11) are new factorial parameters.
The transformation is linear if logarithms are taken; the logarithms of the new
factorial parameters are subject to the usual linear constraints for factorial
parameters in the analysis of variance in order to make the transformation
one-to-one. Estimators of the factorial parameters are functions of the estima-
tors p,,. A full explanation of these procedures is given by E1-Helbawy and
Bradley (1976).
Special treatment contrasts may be of interest in paired comparisons. Sup-
pose that, in a coffee taste test experiment with t = 4, T4 represents an
experimental coffee produced by a new process while the other treatments
came from a standard process. One may wish to compare 7"4 with the other
three treatments. Two approaches are possible. The first assumes nothing,
ma = 0, and takes
320 Ralph A. Bradley
E
O
"6
,D
O O
0
.=_ .=_
~
O
0
"~ O
.o
~a
~7
ZZZZZZZZZ O O O O O O O O
o O
o
Z Z Z Z Z Z Z Z Z
8
O o O o
'.r,
ca
.=-
o~ ~B
o oo ~ ~
~ ~ . ~
~ ~ ...... .ID
cn
.<
Z Z Z Z Z Z Z Z Z
Paired comparisons: Some basic procedures and examples 321
1
Bm~= ~ / ~ (1, 1, 1, - 3 ) .
[1/~/2 - 1/~/2 0 0]
B~. = Ll/~/6 1/~/6 -2/~/6 0
and retains the s a m e B~ 1. With these matrices defined, the general test
p r o c e d u r e of this section is used.
W e h a v e p r e s e n t e d a m e t h o d for the examination of specified t r e a t m e n t
contrasts and the analysis of factorial paired c o m p a r i s o n e x p e r i m e n t s t o g e t h e r
with examples. T h e s e m e t h o d s provide m u c h new flexibility.
where ~" has typical element ~-~ and ~-~ is the a-th row of ~'. The quantity
BI(~'~) is the function B1 of (3.6) with pg there replaced by 7r~ and ag replaced
by a~, the total number of preferences for T~ on attribute a. In addition,
f(sl i, j) is the number of times the preference vector s occurs among the n~j
responses to the pair (T~, T/). We may express the logarithm of the likelihood
function as
Consider first a test for independence:/40: p = 0 versus Ha: p~o # 0 for some
</3, a,/3 = 1. . . . , p. Under H0, the likelihood equations reduce to equations
(3.2) and (3.3) for each a --- 1. . . . . p. If p0 is the solution for the a-th set of
equations and becomes the a-th row of p0, p0 estimates rt under H0. Under Ha,
the equations to be solved are:
~ P ~ i = l, a = l . . . . . p,
i
where
and, under Ho, it has the central chi-square distribution with p(p - 1) degrees
of freedom.
If it is assumed that p = 0 , tests on the parameters ~'~ may be made
separately as in the univariate case for each a = 1 . . . . . p.
Paired comparisons: Some basic procedures and examples 323
with the central chi-square distribution with p(t- 1) degrees of freedom under
H0.
A likelihood ratio test of the fit of the model (6.1) is given by Davidson and
Bradley. An alternative test may be based on
Treatment
pair Cell frequencies f(s I i, j) Frequency
i, j Cells s no
(iii) (rio (iji) (/~/i) (iij) (]ij) (ijj) (l'lj)
1,2 8 1 1 1 0 2 0 9 22
(7.93) (1.09) (1.15) (1.69) (0.76) (0.97) (0.37) (8.03)
1, 3 6 0 1 1 1 0 1 9 19
(6.25) (0.60) (1.24) (0.92) (1.12) (0.62) (0.64) (7.61)
2, 3 7 1 1 1 3 1 1 6 21
(6.92) (0.37) (1.26) (0.60) (1.70) (0.75) (1.10) (8.31)
324 Ralph A. Bradley
Table 10
Test statistics for hypotheses for the chocolate pudding data
Tests are summarized in Table 10. It is seen that the major effects are the high
correlations among responses on attributes.
As a final comment on the example, cell frequencies are small and asymp-
totic theory must be regarded only as approximate. The tests do, however,
seem to work well and be adequately indicative.
References
Abelson, R. M. and Bradley, R. A. (1954). A 2 2 factorial with paired comparisons. Biometrics
10, 487-502.
Beaver, R. J. (1976). Discussion: Science, statistics and paired comparisons. Biometrics 32, 233-235.
Beaver, R. J. and Gokhale, D. V. (1975). A model to incorporate within-pair order effects in paired
comparisons. Comm. Statist.-Theor. Meth. 4, 923-939.
Bliss, C. I., Greenwood, M. L. and White, E. S. (1956). A rankit analysis of paired comparisons for
measuring the effects of sprays on flavor. Biometrics 12, 381-403.
Bradley, R. A. (1953). Some statistical methods in taste testing and quality evaluation. Biometrics
9, 22-38.
Bradley, R. A. (1954a). The rank analysis of incomplete block designs. II. Additional tables for the
method of paired comparisons. Biometrika 41, 502-537.
Bradley, R. A. (1954b). Incomplete block rank analysis: On the appropriateness of the model for a
method of paired comparisons. Biometrics 10, 375-390.
Bradley, R. A. (1955). Rank analysis of incomplete block designs. III. Some large-sample results on
estimation and power for a method of paired comparisons. Biometrika 42, 450-470.
Bradley, R. A. (1976). Science, statistics and paired comparisons. Biometrics 32, 213-232.
Bradley, R. A. and El-Helbawy, A. T. (1976). Treatment contrasts in paired comparisons: Basic
procedures with application to factorials. Biometrika 63, 255-262.
Bradley, R. A. and Terry, M. E. (1952a). The rank analysis of incomplete block designs. I. The
method of paired comparisons. Biometrika 39, 324-345.
Bradley, R. A. and Terry, M. E. (1952b). Statistical Methods for Sensory Difference Tests of Food
Quality, Appendix A. Biannual Rpt. No. 4, Virginia Agric. Exp. Sta., Blacksburg, Va.
Clatworthy, W. H. (1955). Partially balanced incomplete block designs with two associate classes
and two treatments per block. J. Res. Nat. Bur. Stand. 54, 177-190.
David, H. A. (1963). The Method of Paired Comparisons. Griffin, London.
Davidson, R. R. (1970). On extending the Bradley-Terry model to accommodate ties in paired
comparison experiments. J. Amer. Statist. Assoc. 65, 317-328.
Davidson, R. R. and Beaver, R. J. (1977). On extending the Bradley-Terry model to incorporate
within-pair order effects. Biometrics 33, 245-254.
Davidson, R. R. and Bradley, R. A. (1969). Multivariate paired comparisons: The extension of a
univariate model and associated estimation and test procedures. Biometrika 56, 81-95.
Davidson, R. R. and Bradley, R. A. (1970). Multivariate paired comparisons: Some large sample
results on estimation and tests of equality of preference. In: M. L. Puri, ed., Nonparametric
Techniques in Statistical Inference. Cambridge University Press, 111-125.
Davidson, R. R. and Bradley, R. A. (1971). A regression relationship for multivariate paired
comparisons. Biometrika 58, 555-560.
Davidson, R. R. and Farquhar, P. H. (1976). A bibliography on the method of paired comparisons,
Biometrics 32, 241-252.
Davidson, R. R. and Solomon, D. L. (1973). A Bayesian approach to paired comparison
experimentation. Biometrika 60, 477-487.
Dykstra, O. (1956). A note on the rank analysis of incomplete block designs - applications beyond
the scope of existing tables. Biometrics 12, 301-306.
Dykstra, O. (1960). Rank analysis of incomplete block designs: A method of paired comparisons
employing unequal repetitions on pairs. Biometrics 16, 176-188.
EI-Helbawy, A. T. and Bradley, R. A. (1976). Factorial treatment combinations in paired
comparisons. In: Proc. Int. Conf. on Statistics, Computer Science and Social Research 1,
121.1-144.1, Cairo University Press, Cairo.
EI-Helbawy, A. T. and Bradley, R. A. (1977). Treatment contrasts in paired comparisons:
Convergence of a basic iterative scheme for estimation. Commun. Statistic.- Theor. Meth. 6,
197-207.
EI-Helbawy, A. T. and Bradley, R. A. (1978). Treatment contrasts in paired comparisons:
Large-sample results, applications and some optimal designs. J. Amer. Statist. Assoc. 73,
831-839.
326 Ralph A. Bradley
Fienberg, S. E. and Larntz, K. (1976). Log linear representation for paired and multiple com-
parisons models. Biometrika 63, 245--262.
Fleckenstein, M., Freund, R. A. and Jackson, J. E. (1958). A paired comparison test of typewriter
carbon papers. Tappi 41, 128-130.
Ford, L. R. Jr. (1957). Solution of a ranking problem from binary comparisons. Amer. Math.
Monthly 64(8), 28-33.
Glenn, W. A. and David, H. A. (1960). Ties in paired-comparison experiments using a modified
Thurstone-Mosteller method. Biometrics 16, 86-109.
Gridgeman, N. T. (1959). Pair comparison, with and without ties. Biometrics 15, 382-388.
Grizzle, J. E., Starmer, C. F. and Koch, G. G. (1969). Analysis of categorical data by linear models.
Biometrics 25, 489-504.
Guttman, L. (1946). An approach for quantifying paired comparisons and rank order. Ann. Math.
Statist. 17, 144-163.
Harris, W. P. (1957). A revised law of comparative judgment. Psychometrika 22, 189-198.
Hemelrijk, J. (1952). A theorem on the sign test when ties are present, lndag. Math. 14, 322-326.
Kendall, M. G. (1955). Further contributions to the theory of paired comparisons. Biometrics 11,
43-62.
Kendall, M. G. and Babington Smith, B. (1940). On the method of paired comparisons. Biometrika
31, 324-345.
Luce, R. D. (1959). Individual Choice Behavior. Wiley, New York.
Mehra, K. L. (1964). Rank tests for paired-comparison experiments involving several treatments.
Ann. Math. Statist. 35, 122-137.
Mosteller, F. (1951). Remarks on the method of paired comparisons: I. The least squares solution
assuming equal standard deviations and equal correlations. Psychometrika 16, 3-9.
Park, G. T. (1961). Sensory testing by triple comparisons. Biometrics 17, 251-260.
Pendergrass, R. N. and Bradley, R. A. (1960). Ranking in triple comparisons. In: I. Olkin et al.,
eds., Contributions to Probability and Statistics. Stanford Univ. Press, 331-351.
Puri, M. L. and Sen, P. K. (1969). On the asymptotic theory of rank order tests for experiments
involving paired comparisons. Ann. Inst. Statist. Math. 21, 163-173.
Raghavarao, D. (1971). Construction and Combinatorial Problems in Design of Experiments. Wiley,
New York.
Rao, P. V. and Kupper, L. L. (1967). Ties in paired-comparison experiments: A generalization of
the Bradley-Terry model. J. Amer. Statist. Assoc. 62, 194-204, Corrigenda 63, 1550.
Saaty, T. L. (1977). A scaling method for priorities in hierarchical structures. J. Math. Psych. 15,
234-280.
Sadasivan, G. (1982). A Thurstone-type model for paired comparisons with unequal numbers of
repetitions. Comm. Statist. - Theor. Meth. 11, 821-833.
Scheffr, H. (1952). An analysis of variance for paired comparisons. J. Amer. Statist. Assoc. 47,
381-400.
Thurstone, L. L. (1927). Psychophysical analysis. Amer. J. Psych. 38, 368-389.
Wei, T. H. (1952). The Algebraic Foundations of Ranking Theory. Unpublished thesis, Cambridge
University.
Zermelo, E. (1929). Die Berechnung der Turuier-Ergebnisse als ein Maximumproblem der Wahr-
scheinlichkeitsrechnung. Math. Zeit. 29, 436-460.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1
O Elsevier Science Publishers (1984) 327-345 1_...)
Restricted Alternatives
I. Introduction
In many testing problems under both the parametric and the nonparametric
set-up the aspect of the underlying model which is of direct interest to the
experimenter may be described by a k-dimensional (k >/1) real vector
(01, 0 2 , . . . , Ok)'= 0 and the null hypothesis is Ho: 0 = 0. In such situations, so
far as the choice of the structure of the test is concerned, the most important
consideration should be which alternative values of 0 we are especially
interested in detecting.
Traditional tests for H0 have been developed keeping in view all possible
alternative values of 0. These are tests against unrestricted alternatives or
unrestricted tests (of Chapters 1, 5). However, in practice often experimental
conditions are such that 0, if it deviates from 0, can do so only in some but not
all directions. For example, in an agricultural trial 01, 02. . . . . Ok may represent
the effects (reparametrized so that E 0i = 0) of increasing doses of a manure,
and if the manure is effective at all, it may be expected that 01, 02. . . . . Ok would
have progressively increasing values in that order. Similarly, 01, 02. . . . . Ok may
represent the changes in the levels of some physiological character produced by
a drug, and it may be that, if the drug is effective, one or more of the 0i's would
be positive. While it would be perfectly legitimate to use the unrestricted tests
in such situations, it is obvious that it would be more paying if the test, instead
of being sensitive to deviations in all directions, concentrates on the relevant
directions only. In other words, in such situations, specific tests against restric-
ted alternatives or restricted tests are called for.
2. A brief review
327
328 Shoutir Kishore Chatterjee
when we have k > 1. In the parametric context solutions to such problems have
been derived mostly by the likelihood ratio technique (Bartholomew, 1959,
1961; Chacko, 1963; Kudo, 1963; Nueseh, 1966; Shorack, 1967; Perlman, 1969,
Pincus, 1975; a detailed account of many of these results is given by Barlow et
al., 1972). Krishnaiah (1965) derives what he calls finite intersection tests for
certain multiparameter linear hypotheses against restricted alternatives of
special types under the multinormal set-up. These are union-intersection tests
(Roy, 1957) based on a finite family of statistics corresponding to a represen-
tation of the alternative in terms of a finite number of sub-hypotheses.
Under the nonparametric set-up the tool of likelihood is no longer available.
There is also difficulty in extending the union-intersection technique in a
straightforward manner since, generally, no obvious choice for the optimal
tests against the sub-hypotheses is available; nor is the distribution problem
under the nonparametric set-up easily resolvable in terms of independent sets
of statistics as in Krishnaiah (1965). From intuitive considerations non-
parametric restricted tests for a number of testing problems have been pro-
posed and studied by several workers. Particular attention has been received by
the problem of testing homogeneity against ordered alternatives. For this
problem rank solutions have been considered in the case of one-way layout by
Jonekheere (1954), Chacko (1963) and Shorack (1967), and in the case of
two-day layout, by Jonckheere (1954), Page (1963), Hollander (1967), Doksum
(1967), Shorack (1967) and Sen (1968). All these solutions except those of
Chacko (1963) and Shorack (1967) are based on some intuitively suggested
function of rankscores which tends to be large when a trend alternative of the
appropriate type obtains. The solutions of Chacko and Shorack are derived by
using ranks in place of observations in the corresponding parametric likelihood
ratio tests. Among other nonparametric restricted testing problems considered,
we may mention that of comparing the locations of two bivariate populations
against directional alternatives. David and Fix (1961) studied a test for this
based on marginal Wilcoxon statistics whereas Bhattacharya and Johnson
(1970) proposed a test based on 'layer ranks'. Studies of performance of these
nonparametric tests indicate that against the restricted class of alternatives
most of them are fairly sensitive in certain directions and relatively insensitive
in others. This is obviously a consequence of the element of arbitrariness
present in their formulation. To devise a method of derivation of comprehen-
sive restricted tests for nonparametric problems, an adaptation of the union-
intersection technique was considered by Chatterjee and De (1972, 1974), De
(1976, 1977) and Chinchilli and Sen (1981a, 1981b, 1982). In the next section we
describe this approach in some detail.
where
ON = sup A'TN, (3.3)
ItEA l
as our critical region for testing H0 against H~. The cut-off point c would have
to be adjusted with reference to the asymptotic distribution of ON so that the
330 Shoutir Kishore Chatterjee
asymptotic size of the overall test has the stipulated value. In many problems it
would happen that O(~) and hence, A1 would depend on some unknown para-
meters of the parent distribution. Naturally then, ON given by (3.3), as also its
limiting distribution under H0, would involve the same. In such cases it would
generally be possible to find consistent estimates of the unknown parameters,
and substituting these in QN we would get a statistic ON determined solely by
the sample. Generally, continuity considerations will show that ON and ON
would be asymptotically equivalent. A similar substitution may have to be
made in the limiting distribution for determining the cut-off point up to an
asymptotic approximation.
The approach outlined above, may seem rather too narrow, the component
tests being chosen as right-tailed tests based on linear compounds of the form
~'TN. Nevertheless, for all the restricted testing problems that we have in mind
we get reasonable solutions even confining ourselves to such a special class. In
contrast with the classical union-intersection approach, the proposed adaptation
has one distinctive feature. In the classical approach we first identify the
subhypotheses HI~ comprising /-/1 and choose a good test against each HIA,
whereas here we start from a class of component tests and demarcate the
regions O(~) against which the component tests perform best. This reversal of
priorities helps us to utilize known results on the asymptotics of standard
statistics and simplifies the problem.
One crucial technical question in carrying through the above programme is
how to demarcate the region O(~) against which the right-tailed tests ,~'TN > c
perform best. In Chatterjee and De (1972) and De (1976,77) the performance
of a sequence of tests against any fixed alternative 0 ~ 0 was measured in terms
of the corresponding Bahaudur slope and O(~) was defined as consisting of
those points 0 for which this slope is maximum for the A considered. One
difficulty in this approach is that in it one often requires to have 0-free
estimates of certain correlation coefficients. As this creates difficulty in the
more complex restricted testing problems, in this article, we follow Chinchilli
and Sen (1981a) to judge performance of a sequence of tests in an entire
direction such as {0:0 = M~', M > 0} (M is a variable scalar; ~- ~ 0 determines
the direction) by the asymptotic power against contiguous alternatives ap-
proaching H0 along that direction. O(~) consists of the direction in which the
tests ),'TN > c maximize the asymptotic power (The assumption of consistency
of the tests ,~'TN > c against some 0 ensures that for each ~ the asymptotic
power can be meaningfully defined in some directions). Here the region 01 is
assumed to be positively homogeneous in the sense that it consists of a
collection of complete directions and the union-intersection technique is
applied on that basis. However, for two major types of restricted testing
problems namely, those involving orthant alternatives and order alternatives
(See Section 4), in single-sample and multi-sample situations, the Bahadur
slope approach and the asymptotic power approach lead to identical or almost
identical tests.
Restricted alternatives 331
aiu(a), a = 1. . . . , N , (4.2)
N
SiN = ~ aiN(Ria), i=1 ..... p. (4.3)
a=nl+l
n2/N ~ v . (4.5)
So that
332 Shoutir Kishore Chatterjee
We write
1 & 1 N
a,N = ~- ,,=~,a,t~(a), V~jN= ~ ~=~={a~N( R , . ) - ai.N}{ajN(R#,)-- gtj.N}
1 t"
~i =
f0 ~oi(u) du, vii = J {q~i(Ftil(Xi))- gS,}{q~j(Ft/l(xj))- qSj}f(x) dx,
where Ftil(x ) denotes the cdf of the i-th variable for the distribution f(x), dx
stands for 17dx~ and the integral in v# is taken over R p.
Let
It can be shown that under the assumptions made, under H0, N-~ ~,
.Le
o,, ~ NAo, hG), aN L G (4.10)
(see Hfijek and Si&ik, 1965, Chapter V; and Puri and Sen, 1971, Chapter V).
If the variables in the parent distribution are not a.s. functionally related, G
is p.d., and hence, GN is p.d. in probability, so that from (4.10) we get
Here the fixed vector z represents the direction along which 0 (N) approaches 0.
We now assume that the density f ( x ) has continuous first partial derivatives
and the Fisher information matrix for f ( x ) exists. Then (4.5) implies that the
Restricted alternatives 333
d x
fill(x) = ~x Fill(X)'/tO(x) = dxx f[il()
O,(u) : - / t , l ( F ~ ( u ) ) / f t , l ( F ~ ( u ) ) (4.13)
Yi = v,i l/2
I0' q~i(u)q'i(u) du, F = diag(yl . . . . . yp),
Therefore using q~(-) to denote the standard normal cdf, the asymptotic
power of the tests A'TN > c against 0 (N) comes out as
Thus, given ~', the asymptotic power would be maximum for I given by (4.16).
Reciprocally, given A, we can say the tests A'TN > c perform best in the
direction 7 given by (4.16) (in the sense that for that direction no other A*
gives a higher asymptotic power). Therefore, we can define
as the subclass of alternatives against which the sequence of tests )t'TN > c has
best performance.
We now assume that yi > 0, i = 1 , . . . , p. From (4.13) it may be shown that
this holds if, for each i, Oi(u) is increasing and/or limx-~*~~pi(F[il(X))f[il (X) = O.
334 Shoutir Kishore Chatterjee
Under this assumption (3.1) holds for O~a) given by (4.17) and 0 1 =
{0: 0/>0, 0 # 0 } is we take
A 1 = {A: G1/=lt/> 0, A'~t = 1}. (4.18)
where we write Gl/Z)t = u. (We replace 'sup' by 'max' since the function is
continuous and the domain compact.) As noted in the paragraph following
(3.3), in practice, G would have to be replaced by its estimate GN (cf. (4.10).
Then, by (4.11), the approximate union-intersection statistic so obtained would
be
-Qn-- hTq1/2 max u'GTvlDN. (4.19)
u~>0
u'G~1u= 1
ON = hg:/E(DhG?v~DN) m if DN > 0 ,
= hTvm max{DpN - g12.ND2N, DaN - g12.ND1u}/(1- g~2.N)1/z (4.20)
if DN :~0,
XZ~denoting a chi-square r.v. with d.f. 1:. As noted earlier, in practice, we have
to use g12.N in place of g12 while determining the cut off point c from (4.21).
For general p, the boundary of the domain of u in (4.19) is more complex
and as such the determination of ON is rather complicated. Viewing the
problem as one "of nonlinear programming, we can apply the K u h n - T u c k e r
theory (see Hadley, 1964) and observe the following.
(i) The maximum in (4.19) would be attained at a point
Restricted alternatives 335
U" /,/1 > 0 . . . . . /,/k > 0, Uk+l . . . . . I,/p = 0 ( l ~ < k <~p) (4.22)
ON = hN-1/2 [Dv2.NGw2.NDv2.N]
t -I 1/2 , (4.25)
where
-1
D1.2.N = D1.N -- G12.NG22.ND2-N,-1GI1.2.N = G11.N -- Glz.NG22.N GZl.N.
(4.26)
(ii) If for some 1 ~< il < " " " < ik <~p, the m a x i m u m is attained at a point
we can permute the elements of u so that uil, , uik occur in the first k positions.
If we now m a k e the same permutation of the rows and columns of GN and the
elements of DN (SO that the maximization problem remains same) conditions
(4.24) would hold .for the new GN and DN and ON would have the expression
(4.24) in terms of these.
(iii) If G?vlDN ~ 0 conditions (4.24) will hold for some k and some per-
mutation of the elements. If G?~IDN <~O, this is no longer true, and there,
writing G h 1 = (gij.N)
ON = h ) 1/2 max
.=
gq-UDj.N
/, g,.~)m, i = 1 , . . . , p
1 . (4.28)
where it is to be understood that for each choice of i l , . , ik, rows and columns
of G are to be permuted so that i r t h . . . . . ik-th rows (columns) occur in the
first k positions and from the transformed matrix G l 1 . 2 and G22 are to be
determined as in (4.23) and (4.26). By standard techniques we can then deduce
that, for c >/0,
p
The details we have worked out above for the location case hold with minor
modifications for other p-parameter problems against orthant alternative. For
the p-variate two-sample scale problem, the samples (4.1) are supposed to be
from two populations with density functions f(x~-lZx, . . . , X p - / X p ) and
e-Z' f(e-l(xl -/Zl) . . . . . e-p(Xp -- JA,p)), ].Z x ..... I,.p being nuisance parameters.
The hypotheses, with 0 as defined here, are same as before. The scores (4.2)
and ~i(u) in (4.4) are no longer nondecreasing as in the location case. One
should use here some system of scores appropriate for the scale problem, such
as at(a) = [(a/N + 1 ) - 1, ((a/N + 1 ) - ~)2 or Elfln:~ (see e.g. H~ijek and Sid~ik,
1965, Chapter III). The functions q~i(u) would now be expressible as the
difference of two functions each satisfying the conditions (a)-(c). The rest of
the development remains same except that the functions ~bi(u) in (4.13) have to
be replaced by their counterparts for the scale problem (Hftjek and Sid~tk,
1965, Chapter VI). The expressions for the test statistic and the tail probability
remain same.
For the p-variate regression problem we have N independent observations
X~, a = 1. . . . . N, X~ following the distribution f ( x - c,,nO) where c~N,a =
1 . . . . . N, are known numbers. We want to test H 0 : 0 = 0 against /-/1:0/> 0,
0 0 as before. Converting the observations into rank scores just as in the
location case, we now set up
N 1 N
~ (can - gN)a,N(R~),
Sis = ~=1 gN = Na~=l= C~l~, i = 1. . . . , p. (4.31)
1 1 N
max
( c , ~N , - ~'N)2 --->0, hN = -'~a=l(CaN= -- CN) 2 ' ' ) h , (4.32)
are set up. The scores satisfy (4.4) with respect to functions ~oi(u) which are
positive nondecreasing and satisfy the conditions (b)-(c) stated earlier. (Some
possible choices of the scores are aiN (a) = 1, od(N + 1), or E V ~ : ~, where Vfv:, is
the a-th smallest absolute value for N observations from N(O, 1)). Writing
1
~, aiN(R,~)a~N(R j~) sign(X/~) sign(Xj~),
vq.N N ot=l
and defining GN and G as in (4.7) and (4.8), we see that (4.10)-(4.11) hold with
h/v = h = 1. After modifying the definition of the functions ~bi(u) appropriately
(cf. Hfijek and Sidfik, 1965, Chapter VI), from contiguity (4.14) follows. The
subsequent development leading to the expressions for ON and its tail prob-
ability carries through as before.
In all the applications considered above we have assumed that under H1, 0
lies in the positive orthant. If H1 is represented by some other orthant the
definition of A1 given by (4.18) would have to be appropriately modified. This
would entail obvious modifications in the form of ON and its tail probability.
For the two-sample and single sample location problems and the regression
problem, an alternative approach would be to make appropriate initial changes
in the signs of the variables so that the problem is transformed into a positive
orthant problem. The earlier expressions can then be used for the transformed
set-up.
Then writing
(UN:, is the a-th smallest of N observations from the distribution rect. (0, 1))
an estimate of F can be derived from the results of Jure6kovfi (1969, 1971) (see
Chinchilli and Sen, 1981a). Thus in the two-sample location case
S (1)
iN = S i N ( X i l , " " " ~ Xi~; Xinl+ 1 - 1,. " ", X ~ - 1)
Restricted alternatives 339
where 0~'s are treatment effects, aj's are block effects, and e~i's are errors. The
vectors ei = (eli. . . . . epi)', i = 1 . . . . . n , are i.i.d, with a permutation-invariant
common distribution. The problem is to test H : 0 = 0 against a simple order
alternative HI: 01 <~ 02 ~< ~< Op, 01 < Op. We consider here an approach to the
problem based on ranking after alignment (cf. Puri and Sen, 1971, Chapter
VII). The approach is quite flexible and can be extended to less simple designs.
The first step is to align the observations to eliminate the block effects. For
simplicity, we consider only alignment based on block means. If X0j =
E~=I
P x o / p , we take Yii = X i i - X o i , i = 1 , . . . , p , ] = 1, . . . , n, and rank together
all the N = p n Y~i's. If R~i is the rank of Y0 so obtained, taking a set of N
nondecreasing scores a N ( a ) , a = 1 . . . . . N , satisfying (4.4) with respect to some
~0(u), we form
Writing
1 " P _ 1 ~ aN(Rij),
v, - n ( p - D Z Z { a u ( R o ) - tij..} z, aj.u : P i=1
(4.39)
J j = l i=1
340 Shoutir Kishore Chatterjee
we set up
Simple expressions similar to (4.20) and (4.21) would hold for a test derived
from two linear compounds. Some studies made by De (1977) indicate that
such compromise tests would generally have satisfactory performance and that
they would represent definite improvements over earlier tests based on single
linear compounds (see references given in Section 2).
To apply the restricted tests described in the preceding section one would
require tables for determining the critical value c corresponding to different
probability levels, or equivalently, the tail areas corresponding to different
values of the statistic. As has been seen, for the problems considered in
Subsections 4.1-4.3, the tail area can be expressed as the weighted sum of the
tail areas of certain x2-distributions. The tail probability of likelihood ratio
criteria for certain parametric restricted testing problems (Barlow et al., 1972,
Chapters III-IV) comes out in the same form. In both situations the problem of
determination of the tail probability would be solved if one can give workable
342 Shoutir Kishore Chanerjee
procedures and tables for computing the weights, which are in reality orthant
probabilities of a multinormal distribution.
In the case of bivariate two-parameter problems the weights in the expres-
sion (4.21) for the tail probability are simple trigonometric functions of the
single correlation coefficient g12 and hence, can be readily determined. A table
for c based on these is reproduced in Barlow et al. (1972, Table A.1) for
negative values of g12 (such values only are possible in the context of the
problem of homogeneity of three ordered means in connexion with which, the
table was prepared. Note, however, that the values tabulated in Barlow et al.
(1972) correspond to 0 ~ in our notation). An unpublished table covering both
negative and positive values of g12 is contained in De (1977).
Real difficulty in tabulation, however, arises in the case of problems involv-
ing more than two parameters. The weights in the expression (4.30) depend on
~ p ( p - 1) correlation coefficients represented by the p p matrix G and it would
be impracticable to attempt a table covering the ranges of so many parameters.
One needs here workable expressions for these weights in terms of G. Useful
expressions are available for p = 3. For 4 ~<p ~<6 one can follow a procedure for
computer evaluation of multinormal orthant probabilities described by Milton
(1972). In the general case a satisfactory solution to the problem valid for all G
remains to be evolved. An account of the available results and further references
are given in Barlow et al. (1972, Chapter III).
For the problem of testing homogeneity against ordered alternatives in
one-way or multi-way lay-outs, tabulation is relatively simple if all the treat-
ments are equally replicated. (This condition naturally obtains in the case of a
RBD.) The expression for the tail probability here depends only on p, the
number of treatments. In this case a table of critical values covering p ~< 12 is
reproduced by Barlow et al. (1972, Table A.3).
6. Gain in sensitivity
native described in Section 4. The unrestricted test statistic here is always given
by the first expression under (4.20). Asymptotically, it has a xz-distribution
under H0 and the distribution of X~.8 (noncentral chi with d.f. 2 and noncen-
trality parameter 8) where 6 = h~r'FG-1F~r. Thus the asymptotic power of the
size-a unrestricted test is given by
Asymptotic power, of the size-a restricted test based on ON, derived from
(4.14) and (4.20), depends not only on 6 but also on g12 and the ratio r, yl/r2"yz.
Using the properties of a M L R family of distributions, for r > 0 a lower bound
to this can be derived in the form
Now, using the notation fs(x) to denote the density function of X,.8
,2 and Z to
denote a r.v. following N(0, 1), from (6.1) and (6.2),
Table 1
M a x i m u m and m i n i m u m asymptotic power of the restricted test against quadrant alternative versus
the asymptotic power of the unrestricted test for bivariate two-parameter problems
Level of Noncentrality p a r a m e t e r 6
significance
= 0.05 1.00 4.00
(1981b). The extension of the result to the general case, however, is still an
open problem.
For bivariate two-parameter problems with quadrant alternative, the maxi-
mum and the minimum asymptotic power of the restricted test depend
on the noncentrality parameter 6 and the correlation coefficient g12. Table
1 reproduced from Chatterjee and De (1972) compares their values with
the asymptotic power of the unrestricted test at 5 per cent level of significance
for two values for 6. The figures show a considerable gain for the restricted test
the improvement being more marked with g12 becoming negatively large.
Numerical studies performed i n the trivariate case (see Chinchilli and Sen,
1918b) and in the case of the homogeneity problem against order alternative
with multiple treatments (Barlow et al., 1972, Chapter III) seem to indicate
even more remarkable gains for the restricted tests.
In the above we have compared the performances of tests in terms of
asymptotic power. Since the power functions of restricted and unrestricted tests
have different forms no single value of A R E can be given here. One can think
of comparison in terms of some other measures of test performance. It can be
shown that, for the type of problem we have considered here, the unrestricted
and restricted tests have identical Bahadur slopes (Bahadur, 1960) so that their
relative Bahadur efficiency is 1. Chandra and Ghosh (1978) discuss the com-
parison of tests with relative Bahadur efficiency 1 in terms of what they call
Bahadur Cochran deficiency. This approach may throw some light on the
problem considered by us. The details, however, would require considerable
working out.
References
Bahadur, R. R. (1960). Stochastic comparison of tests. Ann. Math. Statist. 31, 276-295.
Barlow, R. E., Bartholomew, D. J., Bremner, J. M. and Brunk, H. D. (1972). Statistical Inference under
Order Restrictions, Wiley, New York.
Restricted alternatives 345
Bartholomew, D. J. (1959). A test of homogeneity for ordered alternatives. Biometrika 46, 36-48.
Bartholomew, D. J. (1961). A test of homogeneity of means under restricted alternatives, J. Roy.
Statist. Soc. Ser. B 23, 239-281.
Bhattacharya, G. K. and Johnson, R. A. (1970). A layer rank test for ordered bivariate alternatives.
Ann. Math. Statist. 41, 1296-1309.
Chacko, V. J. (1963). Testing homogeneity against ordered alternatives. Ann. Math. Statist. 34,
945-956.
Chandra, T. K. and Ghosh, 3. K. (1978). Comparison of tests with same Bahadur efficiency. Sankhya
Set. A 40, 253-277.
Chatterjee, S. K. and De, N. K. (1972). Bivariate nonparametric tests against restricted alternatives.
Calcutta Statist. Assoc. Bull. 21, 1-20.
Chatterjee, S. K. and De, N. K. (1974). On the power superiority of certain bivariate location tests
against restricted alternatives. Calcutta Statist. Assoc. Bull. 23, 73-84.
Chinchilli, V. M. and Sen, P. K. (1981a). Multivariate linear rank statistics and the union-intersection
principle for hypothesis testing under restricted alternatives, sankhya Set. B 43, 135-151.
Chinchilli, V. M. and Sen, P. K. (1981b). Multivariate linear rank statistics and the union-intersection
principle for the orthant restriction problem. Sankhya Ser. B 43, 152-171.
Chinchilli, V. M. and Sen, P. K. (1982). Multivariate linear rank statistics for profile analysis, jr.
Multivariate Anal. 12, 219-229.
David, F. N. and Fix, E. (1961). Rank correlation and regression in non-normal surface, in: Proc. Fifth
Berkeley Symposium, Vol. 1, 177-197.
De, N. K. (1976). Rank tests for randomized blocks against ordered alternatives. Calcutta Statist.
Assoc. Bull. 25, 1-27.
De, N. K. (1977). Multivariate nonparametric tests against restricted alternatives. Ph.D. thesis.
Calcutta University.
Doksum, K. (1967). Robust procedures for some linear models with one observation per cell. Ann.
Math. Statist. 38, 878-883.
Hadley, G. (1964). Non linear and Dynamic Programming, Addison-Wesley, Reading, MA.
Hfijek, J. and Sid~ik, Z. (1967). Theory of Rank Tests. Academic Press, New York.
Hollander, M. (1967). Rank tests for randomized blocks when the alternatives have a priori ordering.
Ann. Math. Statist. 38, 867-877.
Jonckheere, A. R. (1954). A distribution-free k-sample test against ordered alternatives. Biometrika
41, 135-145.
Krishnaiah, P. R. (1965). Multiple comparison tests in multiresponse experiments. Sankhyd Set. A 27,
65-72.
Kudo, A. (1963). A multivariate analogue of the onesided test. Biometrika 50, 403-418.
Milton, R. C. (1972). Computer evaluation of the multivariate normal integral. Technometrics 14,
881-889.
Niiesch, P. E. (1966). On the problem of testing location in multivariate populations for restricted
alternatives. Ann. Math. Statist. 37, 113-119.
Page, E. B. (1963). Ordered hypothesis for multiple treatments, a significance test for linear ranks. J.
Amer. Statist. Assoc. 58, 216-230.
Perlman, M. D. (1969). One-sided testing problems in multivariate analysis. Ann. Math. Statist. 40,
549-567.
Pincus, R. (1975). Testing linear hypotheses under restricted alternatives. Math. Operationsforsch.
Statist. 6, 733-751.
Puri, M. L. and Sen, P. K. (1971). Nonparame#'ic Methods in Multivariate Analysis. Wiley, New
York.
Roy, S. N. (1957). Some Aspects of Multivariate Analysis. Wiley, New York.
Sen, P. K. (1968). On a class of aligned rank order tests in two-way lay-outs. Ann. Math. Statist. 39,
1115-1124.
Shorack, S. R. (1967). Testing against ordered alternatives in model I analysis of variance; Normal
theory and nonparametric. Ann. Math. Statist. 38, 1740-1753.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 llt~
/l_J
Elsevier Science Publishers (1984) 347-358
Adaptive Methods
M. Hugkovd
1. Introduction
The aim of this chapter is to present the main ideas of adaptive procedures,
to summarize their basic structure, to state their properties and to review the
typical ones. In this context, the main emphasis is laid on the one-sample and
two-sample models, which are presented below.
One-sample model: Let X1 . . . . . X, be a sample from a distribution with the
probability density function (p.d.f.) f(x - 0), x ~ R (the real line), where f is
symmetric about 0, absolutely continuous and has a finite Fisher information
347
348 M. Hu~kovd
q , ( X / - t) -~ o, (1.3)
i=1
where R+(t) is the rank of IXi - t[ among I X 1 - t[ ..... IX. - t[, for i = 1 . . . . . n,
and 4~ is a monotone, square integrable score function (defined on (0, 1)). The
L-estimator has the form
where X ( 1 ) ~ ' " ~ X ( n ) stand for the ordered random variables (r.v.) cor-
responding to X~ . . . . . X,. Sometimes, linearized versions of M- and R-estima-
tors are used; these may be defined as follows:
(f -1 n
Ol~(~b) = O - n -1 ~(x)f'(x) d x ) ~ 0 ( X / - 0), (1.6)
/ i=1
For the two-sample model, these estimators are defined in an analogous way
(see Chapter 21 (by Jure~kovfi)).
If the pdf f is known and satisfies some regularity conditions, then the test
for H0 against A, based on S+(tbf) and SN(4q) leads to asymptotically most
powerful tests for contiguous alternatives (see Chapter 3 (by Hugkovfi) for the
one-sample model); here,
(nI(f))'/2(O- 0)~2'(0, 1) as n ~ .
J ( t ) = ( 1 - 2 a ) -1 f o r a ~ < t ~ < l - a ,
=0 otherwise.
2. Restrictive procedures
with O~ and /S~ being the average of 100a % the largest and smallest order
statistics, resp. The motivation for this decision rule, if n ~<20, comes from the
fact that the optimal translation and scale invariant test that the considered
sample is from the uniform distribution versus double exponential one can be
(approximately) based on O given by (2.1). As for n > 20, notice that
Q~3.3 [ f E ~(fl),
0-->2.6 in probability as n ~ ~ for ~ f E ~(f2),
O ~ 1.96 [rE ,~(f3)
Then the test statistics are chosen according to the general rule except
o%(f3)-the authors recommend to use some modified Wilcoxon one-sample
test.
Some modifications of this decision rule were used to obtain other adaptive
procedures (also for two-sample case), e,g. Hogg, Fisher and Randles (1975)
considered the decision rule based on Q and a measure asymmetry
Adaptive methods 351
O* - 0o.o5-
1 ~ 0 . 5 - J~O.05 '
where 1 ~ is the average of 100a % middle order statistics. Harter, Moore and
Curry (1979) suggested adaptive estimators of location and scale parameter
with three kinds of decision rules (the sample kurtosis, Q and the sample
likelihood). Another version of the decision rule based on O was developed by
Moberg, Ramberg and Randles (1978, 1980) to obtain adaptive M-type esti-
mators with application to a selection problem and in a general regression
model.
Another procedure based on tail behavior was suggested by H~ijek (1970) for
the family o~ = { o ~ ( f l ) . . . . , o~(fk)}, where fi are distinct symmetric densities.
The decision rule consists, in fact, in choosing ~0~) for which the quantile
function corresponding to )~ is close to the sample quantile function. The
procedure is very quick but no properties were studied.
Jones (1979) introduced the family ~ = {f~, A ~ R 1}, where f~ satisfies
F 2 ' ( u ) = (u :~ - (1 - U)A)/A
= (log 2)-1 log{[lX I(n-2M+l)- IXI (n-4M+l)] " [IX] (n-M+l)- Ixl (n-2M+l)]-1}
where M is chosen in some proper way reflecting the behavior of the tail. As
the resulting ~0-function is taken ~o(u,f~).
As examples of procedures not motivated by the behavior of tails, we shall
sketch two procedures published by Hfijek (1970) for a general family f f =
{~(fl) . . . . . ~(fk)}, where l b . - - , fk are distinct densities and the procedure by
Albers (1979). In the first procedure, the decision rule is the Bayesian trans-
lation and scale invariant rule and the second one is based on the asymptotic
linearity of rank statistics, the third one utilizes the estimate of the kurtosis. In
order to have the decision rule dependent only on the ordered sample
IXI(1) . . . . . IXI(, ) (corresponding to IXd . . . . . IX, I) we define new random
variables X* = VilX](o.~, i = 1 . . . . . n, where (Ol . . . . . O,) is a random per-
mutation of (1 . . . . . n) and (V, . . . . . V,) are i.i.d, with P(VI = 1) = P ( V / = - 1 ) =
independent of X, . . . . . X,. Then under H the random vector ( X ~ ; , . . . , X * ) is
independent of (RT,...,R+~) and (sign Xx . . . . , s i g n X , ) and distributed as
(Xl,... ,X,).
The Baysian. translations and scale invariant rule (if all types are a priori
equiprobable) yields the following: choose ~ ( f l ) if maxl~j~k p ~ n ( X T , . . . , X * ) =
352 M. HuJkovd
pi.(X~f . . . . , X *) where
where fij, and d-j, are maximum likelihood estimators of location and scale for
j-th distribution, instead of p j , ( X T . . . . . X * ) .
The decision rule based on the asymptotic linearity of rank statistics (fa . . . . . fk
absolutely continuous and I0~) < +~, j = 1. . . . . k) is defined as follows:
with R ~ ( t ) being the rank of IX* + tl in the sequence [X~ + tl .... , [X*~ + tl.
Notice that by the asymptotic linearity (see van Eeden, 1972)
1 / f1 \ - 1/2
f0 +U +b/ )d. j ,
The procedures with the decision rules based on O, the linearity of rank
statistic and the Bayesian rule can be easily modified to the two-sample case
(see Randles and Hogg, 1973; Hfijek, 1970).
At the end of this section let us mention some adaptive a-trimmed means,
more exactly, possible choice of a according to the data. Hogg (1967) recom-
mended to choose a according to the kurtosis, later, in 1974, he developed the
rule based on O. Prescott (1978) and de Wet and van Wyk (1979) studied some
modifications of these rules. Another procedure was suggested by Jaeckel
(1971).
3. Nonrestrictive procedures
Here will be presented estimators of the functions ~0i, g,y, Js, some of their
asymptotic properties and also the resulting test statistics and the estimators.
Hfijek (1962) proposed the following estimator of q~r based on ordered
sample 32(1)~<-.- ~<Xc.I:
Beran (1974) suggested another estimator (say ft.) of q~s through the estima-
tors 6k of the Fourier coefficient Ck belonging to ~ot(u). Namely,
At.
~b.(u) = ~'~ 6k exp{2rriku}, Ck = T.(X, exp{-2wiku}) (3.2)
Ikl=l
j=l
j~v
354 M. Hugkovd
m n
HN = ~ F + ~ G ,
where i = min{i; i ~<j ~< k - i + 1, h(tj) ~>A}+ 1. Under very mild conditions /gr
is asymptotically optimal. Other adaptive L-type estimators were presented by
Takeuchi (1971) and Johns (1974).
Stone (1975) developed an adaptive estimator of maximum likelihood type.
Write g for the density of X (i.e. g(x) = f(x - 0)). The following estimator of
the function 0 is used:
There exist some other kinds of estimators that can be classified as adaptive
ones, e.g. Parr and Schucany (1980), Beran (1977, 1978), Rao, Schuster and
Littel (1975). Let us mention briefly two of them. Rao, Schuster and Littel
(1975) and Beran (1978) suggested estimators of 0 in the one-sample model
motivated by the symmetry of f. Writing again G and g for the distribution
function and the density, respectively, of X, we have
Using the Hellinger distance instead of the Kolmogorov one, Beran (1978)
arrived at the estimator 02.; ~J2. is that value of t minimizing
f(V - + 2t)) 2 dx
where
References.
Albers, W. (1979). Asymptotic deficiencies of one-sample rank tests under restricted adaptation.
Ann. Statist. 7, 944-954.
Behnen, K. and Neuhaus, G. (1981). Two-sample rank tests with estimated scores and the Galton
test. Preprint 81-10, Hamburg Universit~it.
Behnen, K., Neuhaus, G. and Ruymgaart, F. (1982). Chernoff-Savage theorem for rank statistics
with estimated scores and rank estimators of score functions. Preprint 82-2, Hamburg Uni-
versit~it.
Beran, R. (1974). Asymptotically efficient adaptive rank estimates in location model. Ann. Statist.
2, 63-74.
Beran, R. (1977). Minimum Hellinger distance estimates for parametric models. Ann. Statist. 5,
445-463.
Beran, R. (1978). An efficient and adaptive estimator of location. Ann. Statist. 6, 292-313.
Bickel, P. J. (1982). On adaptive estimation. Ann. Statist. 10, 647-671.
van Eeden, C. (1970). Efficiency-robust estimation of locution. Ann. Math. Statist. 41, 172-181.
van Eeden, C. (1972). An analogue for signed rank statistics of Jure6kovfi's asymptotic linearity
theorem for rank statistics. Ann. Math. Statist. 43, 791-802.
Hfijek, J. (1962). Asymptotically most powerful rank order tests. Ann. Math. Statist. 33, 1124-1147.
Hfijek, J. (1969). A Course in Nonparametric Statistics. Holden-Day, San Francisco.
Hfijek, J. (1970). Miscellaneous problems of rank test theory. In: Nonparameteric Techniques in
Statistic Inference. London-Cambridge University Press, pp. 3-17.
Hatter, H. L., Moore, A. H. and Curry, T. F. (1979). Adaptive robust estimation of location and
scale parameter. Comm. Statist., A 8, 1473-1491.
Hogg, R. V. (1967). Some observations on robust estimation. J. Amer. Statist. Assoc. 62,
1179-1186.
Hogg, R. V. (1972). More light on the kurtosis and related statistics. J. Amer. Statist. Assoc. 67,
422--424.
Hogg, R. V. (1974). Adaptive robust procedures: partial review and some suggestions for future
applications and theory. J. Amer. Statist. Assoc. 69, 909-923.
Hogg, R. V. (1976). A new dimension to nonparametric tests. Comm. Statist. Th. Methods A 5,
1313-1325.
Hogg. R. V., Fisher, D. M. and Randles, R. G. (1975). A two-sample adaptive distribution-free
test. J. Amer. Statist. Assoc. 70, 1020-1034.
Hogg, R. V. et al. (1972). On the selection of the underlying distribution and adaptive estimation.
J. Amer. Statist. Assoc. 67, 579-600.
Jaeckel, L. A. (1971). Some flexible estimates of location. Ann. Math. Statist. 42, 1540-1552.
Jones, D. H. (1977). A one-sample adaptive distribution-free test with a stable power function.
Commun. Statist. A 5, 251-260.
Jones, D. H. (1979). An efficient adaptive distribution-free test for location. 3. Amer. Statist. Assoc.
74, 822-828.
Johns, M. V. (1974). Nonparametric estimation of location. J. Amer. Statist. Assoc. 69, 453-460.
Jure~kovfi, J. (1969). Asymptotic linearity of a rank statistic. Ann. Math. Statist. 40, 1889-1900.
Kraft, C. and van Eeden, C. (1970). Efficient linearized estimates based on ranks, in M. L. Puri
(ed.), Nonparametric Techniques in Statistical Inference. Cambridge University Press, London.
Moberg, T. F., Ramberg, J. S. and Randles, R. H. (1978). An adaptive M-estimator and its
applications to a selection problem. Technometrics 20, 255-263.
Moberg, T. F., Ramberg, J. S. and Randles, R. H. (1980). An adaptive multiple regression
procedure based on M-estimators. Technometrics 22, 213-224.
Parr, W. C. and Schucany, W. R. (1980). Minimum distance and robust estimation. J. Amer. Statist.
Assoc. 75, 616-624.
Policello, G. E. and Hettmansperger, T. P. (1976). Adaptive robust procedures for the one-sample
location problem. J. Amer. Statist. Assoc. 71, 624-633.
358 M. Hugkovd
Prescott, P. (1978). Selection of trimming proportions for robust adaptive trimmed means. J. Amer.
Statist. Assoc. 73, 133-140.
Rao, P. V., Schuster, E. F. and Littel, R. C. (1975). Estimation of shift and center of symmetry
based on Kolmogorov-Smirnov statistics. Ann. Statist. 3, 862-873.
Randles, R. H. and Hogg, R. V. (1973). Adaptive distribution-free tests. Comm. Statist. 2, 337-356.
Randles, R. H., Ramberg, J. S. and Hogg, R. V. (1973). An adaptive procedure for selecting the
population with largest location parameters. Technometrics 15, 769-778.
Rieder, H. (1980). Locally robust correlation coefficient. Comm. Statist. A 9, 803-819.
Sacks, J. (1975). An asymptotically efficient sequence of estimators of a location parameter. Ann.
Statist. 4, 285-298.
Samanta, M. (1974). Efficient nonparametric estimation of a shift parameter. Sankhya Ser. A 36,
273-292.
Stone, C. J. (1975). Adaptive maximum likelihood estimators of a location parameter. Ann. Statist.
3, 267-284.
Takeuchi, K. (1971). A uniformly asymptotically efficient estimator of a location parameter. J.
Amer. Statist. Assoc. 66, 292-301.
Uthoff, V. A. (1970). An optimum test property of two-well-known statistics. J. Amen. Statist. Assoc.
65, 1597-1600.
de Wet, T. and van Wyk, J. W. J. (1979). Efficiency and robustness of Hogg's adaptive trimmed
means. Comm. Statist. A 8, 117-128.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1~7
.ILl
Elsevier Science Publishers (1984) 359-382
Order Statistics
Janos Galambos
1. Introduction
This work was supported by Grant MCS-7912139 of the National Science Foundation to Temple
University.
360 JanosGalambos
understanding such models, we should investigate the order statistics of
dependent random variables.
The reader who would like to study in more detail one or several aspects of
the theory discussed here, can consult the following books: David (1981)
(statistical methods, basic theory, extensive references), Galambos (1978)
(extreme value theory in the univariate and multivariate case, including
dependent models, extensive references), Gumbel (1958) (statistics of extremes
in the i.i.d, case) and the four volume set by Johnson and Kotz (1968-1972) (the
order statistics of random samples for specific distributions). Additional
references are given within the text. The development of the theory here,
however, is self-contained and it is substantially different from any one of the
books listed above.
Since the distribution of m,(x) is binomial with parameters n and F(x), (2.1)
yields
=r(7)f~(xlf-*(1-t)"-rdt. (2.2)
In particular,
If F(x) has a density function F'(x)=f(x), then we get from (2.2) through
(2.4)
W e also d e d u c e a formula for the joint density function frl, r2..... rk:n
X (Xl, X2, , Xk) of the o r d e r statistics X r . , , 1 <<-j <~ k, 1 <~ rl < r2 < " " < rk <~ n,
when the population density function f(xt" exists. First notice that
if the inequalities xl < x2 < " - < Xk fail to hold. Now, for xl < x2 < " " < Xk,
P ( x j ~< Xg:, < x~ + Axj, 1 ~<j ~< k) - f,l, '2..... ,k:~(xl, X2 . . . . . Xk) AX1 AX2"'" AXk.
O n the o t h e r hand, the event {xj ~<Xr:~ < x j + Axj, 1 ~<j ~< k}, for small 2~xj,
1 ~<j ~< k, m e a n s that, for each j, one of . ~ , X2 . . . . , X , is approximately xj, and
the other Xt are spread between the values Xo = - ~ < xl < < Xk < Xk+~ = + ~
with the following rule: there are exactly r j - r]_a- 1 a m o n g the Xt which satisfy
xj_~<Xt<xj, l~<j~<k+l, where r o = 0 and r k + l = n . H e n c e , an easy com-
binatorial calculation yields that, for x~ < x2 < < Xk,
f R ( Z ) = n ( n -- 1) f +? f ( x ) d ( X + z ) [ F ( x + z ) - F ( x ) ] "-2 d x ,
FR(Z) = n
f/ ~ f(x)[F(x + z)- F(x)] "-1 d x . (2.7)
THEOREM 2.1. For the uniform distribution F ( x ) = x, 0 < x < 1, the differences
X,+I:, - X,:, are identically distributed with c o m m o n density f u n c t i o n
THEOREM 3.1. If, with some an and bn > O, the normalized maximum (X~:n-
an)/b, has a limiting distribution H ( z ) , then, with the same constants, each
normalized upper order statistic Xn-:,(an, bn) has a limiting distribution Hr(z).
The relation
Fn(an + bnz)~ H ( z )
THEOREM 3.2. Assume that with some constants an and bn > 0, the distribution
function of (Xn:, - an)/bn converges to a distribution function H ( z ) (at each of its
continuity points). Then, with suitable numbers A and B > O, H ( A + B z ) is one
of the following three functions:
364 Janos Galambos
where Ho(z) = limc-.O He(z), reduces to the above distribution functions according
as c = O, c > 0 , or c < 0 .
THEOREM 3.3. (i) I f F(X) < 1 for all x, and if, for all x > 0,
with some 'y > 0, then there are constants a, and b~ > 0 such that the distribution
of ( X , : , - a,)/b, converges to Hl.v(z). One can choose a~ = 0 and b, as the
smallest x such that 1 - F ( x ) <<-1/n.
(ii) I f there is a finite number w ( F ) such that F ( x ) = 1 for all x > w(F) and
F ( x ) < 1 for x < w(F) and if F * ( x ) = F [ w ( F ) - 1/x], x > 0 , satisfies (3.3), then
there are constants a, and b~ > 0 such that the distribution function of (X,:, -
a,)/b, converges to H2,:,(z). One can choose a~ = w(F) and b, = w ( F ) - c , ,
where c~ is the smallest x such that 1 - F ( x ) <<-1/n.
(iii) If, for some finite a such that F ( a ) < 1,
f : ~ [1 - F ( y ) ] dy < + ~ ,
and if
1 f w(F)[1 - F ( y ) ] d y ,
R (t) = 1 - F(t) Jt
then there are constants a. and b. > 0 such that the distribution function of
Order statistics 365
(X~:.- a,)/b, converges to H3,o(Z). One can choose a, as c, in part (ii) and
bn = R(an).
If, to F(x), none of (i), (ii) and (iii) applies, then there are no constants an and
bn > 0 such that the distribution of (X,,:n -- an)/bn would converge.
~. P(X1 >I n)
l=m P(X1 >i n + 1) = 1 (3.5)
fails, then, whatever be the constants an and bn > 0, Of,:, - a,)lbn does not have
a limiting distribution. The limit (3.5) fails for the geometric and Poisson
distributions.
lim ~ d / 1 - F(x)'~
x=w(F)OXt f ( x ) j = 0 ,
then there are constants an and b, > 0 such that the distribution function of
Of.:, - an)/b, converges to H3,0(z).
limiting form. This is certainly true when the population distribution is normal,
but not for all population distributions. The fact that the population dis-
tribution is a major factor in the speed of convergence in Theorems 3.1 and 3.2
is clearly seen from the estimates given by Galambos (1978, p. 113). It should
be noted that these estimates of Galambos are applicable to any population
distribution F(x), and for any values of a., b. and n. For the special cases when
F(x) is normal, P. Hall (1979), and when F(x) is exponential, Hall and Wellner
(1979), obtained uniform bounds for the difference
In the case of the normal distribution, the bound is c/log n, while for the
exponential distribution, it is of the magnitude of 1/n. Nair (1981) extended
Theorem 3.2 to an asymptotic expansion of several terms for the case when the
population distribution is normal, while Reiss (1981) gives an asymptotic
expansion, together with a uniform estimate of his remainder term, to the
distribution of X,,-k:,,(a,,, b,,) for all bounded k, and for a large family of
population distributions.
Let us return to the investigation of Xr:,,(a,,, b,,), where now neither r nor n - r
is bounded. We have two options to proceed. One is the relation at (2.1), in
which the fact that the distribution of m,(x) is binomial with parameters n and
F(x) provides an easy tool for approximating the distribution of Xr:,,(a,, b,) by
the normal distribution. On the other hand, if F(x) is continuous, then, with
the transformation X ~ * = - l o g F ( X j ) , (2.8) leads to another form of normal
approximation to the distribution of Xr:,. We develop both of these ap-
proximations below.
We start with the latter case, i.e., we assume that the population distribution
function F(x) is continuous. If we transform the data X~ into X ~ * =
- l o g F(Xj), then
X.-r+,:. = - l o g F(Xr:,),
and since the distribution function of X~* is F**(x) = 1 - e -x, x > 0, we can
apply the representation (2.8) with A = 1 to - l o g F(Xr:,,). We thus have
where the Yj, j I> 1, are independent random variables with common dis-
tribution function 1 - e -x, x > 0. Now, since both r and n - r are unbounded as
n ~ +o0, the Ljapunov form of the central limit theorem (Lo6ve, 1963, p. 277)
provides a normal approximation to the distribution of a properly normalized
form of {-log F(Xr:,)}, in which approximation the error term can also be
estimated by the Berry-Esseen bound (Lo4ve, 1963, p. 288). As a matter of
fact, since E ( Y j ) = 1, V(Y/)= 1 and E ( [ Y j - I [ 3 ) < 3 , the quoted normal ap-
proximation theorem and (3.6) yield that uniformly for all x and with constant
Order statistics 367
n 1 " 1 " 1
an, = ~'~--, b~=~-~ dnr=k~=r~" ~. (3.8)
k=r k ' :' '
Although Lo6ve gives an actual value for c, his bound can be improved
considerably (we do not carry out such a computation here, since in our second
approach we give a numerical value in the error term, which value was
computed by Englund (1980)). However, the present form is sufficient for
asymptotic results and Lo6ve's c also suffices for large n and 'large' r. In par-
ticular, the given estimate implies that if r = r(n) is such that
= 2i__+mP(Xr:n> F-l[exp(log p - ~ - - ~ p
i-p/)]) (3.10)
F 1[ e x p ( l o g p - ~q/-l~p---
1 - p x )1 = F - l ( p ) + p e x p ( - ~f(O.F_l(p)
/ ( 1 - p ) / n p x) ) - p ,
pexp(-~/~pPx)-p=-~/P(inP)x+O(1).
THEOREM 3.5. Let the population distribution F(x) satisfy the following proper-
ties: for some a < b, F'(x) = f(x) exists and is continuous for all a <~x <~b. Let
F(a)<pl<pz<'''<pk < F ( b ) , and assume that f(F-l(pj)) SO, l<-j<~k. If
rj = rj(n) satisfies (3.9) with p being replaced by pj, then the vector
p j ( 1 - p,)
f(F_~(pj))f(F_,(pt)), j <~t.
For a review of the literature concerning Theorem 3.5, see pp. 257.-258 of
David (1981).
For getting another form of approximation to the distribution of X,:,, we turn
to (2.1). Rewriting (2.1) as
the classical central limit theorem, together with the Berry-Esseen bound,
yields
Order statistics 369
E3 = E ( I I F - F(x)p),
Englund (1980) computes c and shows that the above estimate can be
modified to replacing F(x)[1-F(x)] by {(r/n)[1-(r/n)]}, thus achieving an
estimate which is uniform in x. In fact, he gets that for an arbitrary population
distribution F(x), and for any values of n, r and x,
x --~
P(X,:. < l o g n - l o g l o g n - t ~/lo---~) q)(x). (3.12)
We also have from (3.11) that the error term in (3.12) is of the order of
magnitude of 1/(log n) v2, uniformly in x. But the real meaning of (3.12) for the
applied scientist is rather confusing, because log n is so slowly increasing that
for all practical value of n, Xr:, with r = n - log n would be viewed as an u p p e r
370 Janos Galambos
order statistic, to which (3.1) applies, rather than a moderately upper order
statistic in which n - r depends on n (note that log 3 000 = 8).
However, as r e m a r k e d earlier, the theoretical results are quite interesting
and quite thorough. For example, Chibisov (1964) obtained that, for
moderately upper and lower order statistics, besides the normal distribution,
X,:,(a,, bn) may have as its asymptotic distribution the log normal distribution.
On the other hand, Smirnov (1949) has shown that when the limit in (3.9) is
positive then the asymptotic distribution of Xr:,(a,, b~) may be that of a certain
power of a normally distributed random variable (but again, e.g., in 32459:900,
r = 459 can be viewed as r = 0.5n + 0.3~/n, clearly violating (3.9), but satisfying
Smirnov's assumption, or one can view r = 459 as r = 0.51n, clearly satisfying
(3.9); hence only very good error terms m a k e such asymptotic results practical).
A more thorough analysis for the case when (3.9) is violated is given by
B a l k e m a and de H a a n (1978).
Let us look at the results of the present section through some specific
distributions.
1. The exponential distribution. Let the population distribution be
By applying the same theorems as above to (-Xj), 1 ~<j ~< n, (or directly from
T h e o r e m 2.2), the (asymptotic) distribution of nXl:, is F(x) = 1 - e -x itself, and,
for any r, not dependent on n,
~
t
. (3.14)
Now, if ri, l ~ j ~ < k , satisfy (3.9) with some 0 < p j < l , l~<j~<k, then
(Theorem
. .
3.5) the vector ~ / n ( X
.
, -
j"
n + l o g ( 1 - p j)), 1 <-j <~k, is asymptotically a
k-&menslonal normal vector w a h zero expectation and with covariance matrix
pj/(1 - pi), j <<-t (the value itself does not depend on t).
Finally, if r ~ + ~ with n and r / n ~ O or if n - r ~ + ~ and r / n ~ l , then (3.11)
can be applied to see that X,:n(an, b,) is asymptotically normal (although see
our previous discussion on its practical meaning when r or n - r increases very
Order statistics 371
r<X/r \
lim P Xr:,, - n - - n x ) = qb(x) (3.15)
rt='b~
(if r is of larger order of magnitude than mentioned then further normalizing terms
are required in the form of powers of (r/n)), and if n - r ~ + ~ with n and r/n ~ 1,
then
lim P ( X , : , - l o g - - <
n ~n--~r)= qb(x). (3.16)
n=+~ /'/ - - r
We have pointed out that (3.15) would rarely be utilized when r goes to
infinity very slowly (such as log n), in which case (3.14) would be used. On the
other hand, when r is a power of n, but r/n is still small, then (3.15) is
applicable. In boundary cases, results computed in two different ways should be
equal within the error of approximation. As an example, let us take n = 900
and r = 36. Then, if r = 36 is viewed as r = 0.04n, that is a central term, the
result is that 30(X36:900-1og(1-0.04)) is asymptotically normal with zero
expectation and with variance = 0.04/0.96, i.e., 147(3(36:900-0.04) is asymp-
totically standard normal. On the other hand, if r = 36 is viewed as r = 1.2X/n,
then (3.15) yields that 150(X36:900-0.04) is asymptotically standard normal. The
difference is reasonably small.
2. The logistic distribution. When brought into standard form, the dis-
tribution of X is logistic if
1
F(X)-l+e_X, -~<x<+~.
1 - e -x as x ~ +oo. (3.17)
l+e-X
For the central terms rj, 1 <~j <~ k, satisfying (3.9) with 0 < P i < 1, the nor-
malizing term F - l ( p i ) = log(pj/(1- pj)), and thus the vector
372 Janos Galambos
is normal with zero expectation and covariance matrix 1/(1 - Pj)Pt, j ~< t.
In view of (3.17), (3.16) also applies, again with the limitation that the order
of magnitude of n - r be smaller than n 2/s. By the symmetry of F ( x ) about
zero, the normalizing constants for X,:,, r-~ +oo, r/n ~ O, are similar to those at
(3.16) except that log{n/(n - r)} is to be replaced by log{(n - r)/n}.
3. The standard normal distribution. We now turn to the case when the
population distribution function
x
F(x) = 1 e -0/2)~2dt.
By Theorems 3.4, 3.3 (iii) and 3.1, we get that for each fixed r,
n =-I-~
where
a, = (2 log n) u2 ~(log log n + log 4rr)
- (2 log n) v2 and b, = (2 log n) - m .
1 1
F ( x ) = ~ + --~arctan x.
lira
-=+=
r xtan ( 1))-exp X,x)t=0 xtt!1 "
Orderstatistics 373
For the lower order statistics, a similar formula applies in view of the symmetry
of F(x) about zero.
Turning to the central terms, since F - l ( y ) = tan w(y-), Theorem 3.5 im-
plies that, for r satisfying (3.9),
s =Z 1
j=k 1 yj" (4.3)
If we multiply (4.3) by ( - 1 ) k-r (~-1) k-1 and s u m the terms with respect to k, we get
the classical B o n f e r r o n i - J o r d a n inequalities: for r/> 1, and for any n o n n e g a t i v e
integer u,
r+2u
attention, in which case all binomial coefficients in (4.5) become one. Ad-
ditional reduction of (4.5) is usually necessary since Sk defined at (4.2) assumes
that the k-dimensional distribution of the Xj is known. In particular, when only
bivariate distributions are available then (4.4) and (4.5) reduce to
1
S1 - S2 <~P(mn(X) >I 1) ~< $1 - ~ S2. (4.6)
No better upper bound is known if only $1 and $2 are available. However, the
best lower bound in the form of aS1 + bS2 was determined in Kwerel (1975a) and
Galambos (1977); namely if k0 = 1+ [2S2/$1], where [y] signifies the integer
part of y, then
Using the method of Galambos (1977), Sathe et al. (1980) extended (4.7) by
showing that if
( S l - r + 1)St
P ( m ~ ( x ) >I r) >i (r+ 1-)S-~+1-+rS~ "
For a historical account of early results in this direction, see Takfics (1958).
The asymptotic results of Section 3 are not applicable when the random
variables X1, X 2 , . . . , 32, are not i.i.d. In fact, normalized central terms may
376 Janos Galambos
Now, since just as at (21), {Xr:, < x} = {m,(x) t> r}, the behaviour of the central
terms (r ~ np, 0 < p < 1) is related to the behaviour of the arithmetical mean
(1/n)~,7=llj(x). This latter is known to be asymptotically normal for a large
variety of dependence structures (see e.g. Sen, 1968 and 1972) but it is not
always normal. For example, if the events {Xj < x} are exchangeable, which
means that for every choice 1 ~< il < i2 < < ik of the subscripts
With our usual meaning for m,(x), the above expression for S, can also be
written as
from which, under quite general conditions on the dependence of the Xj and
Order statistics 377
on their common distribution function F(x) one can deduce that, with a
suitable constant A, ~ / n ( ( S , / n ) - A ) is asymptotically normal. See Suh et al.
(1970), Sen et al. (1973), Sen (1973a,b) and Phoenix and Taylor (1973).
3. Strength of a sheet of metal. The random strength S of a sheet of metal
can be represented as an extreme of some dependent random variables.
Indeed, if the sheet is subdivided (hypothetically) into n smaller pieces of equal
size, and if the strength of the j-th piece is Xj, then, by the weakest link
principle,
where A > 0, y > 0 and B are parameters. See also the relevant references in
Harter (1978).
4. Reliability applications. There are two special systems of components
whose life distribution is directly related to extreme value theory. Let a system
consist of n components with useful random lives X1, 2(2, , 3(,. The system is
called parallel if it functions until at least one component functions, while it is
called series if it fails as soon as one component fails. If the life of the system is
denoted by L, then
A basic result of reliability theory is that every coherent system can be reduced
to a parallel or to a series system (see the first two chapters in Barlow and
Proschan, 1975). Therefore, a good approximation to the distribution of L is an
asymptotic distribution of the extremes in a dependent model. It should be
recognized that in the decomposition into a parallel or series system, one has
no control over the dependence of the 'new' components, hence an assumption
of independence is unrealistic. A number of dependent models are discussed in
Chapter 3 of Galambos (1978). One of these models concludes that the set of
possible limiting distributions of the extremes is all distributions with mono-
tonic hazard rates, which is in agreement with the experience of engineers. It
also covers such general models as stationary sequences (for which see also
Leadbetter, 1974), exchangeable sequences (see also Berman, 1962; Chernick
1980) and Gaussian sequences. References to the original works are also
378 Janos Galambos
which is the exact form of emphasizing that not age was the factor in a claim.
The question is then E(Xl:10000).
The equation at (6.1) is known as the lack of memory property and it is well
known to have a unique solution among distribution functions, namely, F ( x ) =
1 - e -~x, x > 0, with some A > 0 (see Galambos and Kotz, 1978, p. 8). Since
E ( X j ) = 10, A = 1/10, and by Theorem 2.2, 2(1:, is also exponential with
E(XI:,) = (1/n)E(Xj) (hence, the answer is that the first claim is expected in
1/1000 year, which is about 1/3 day, i.e., in the first few hours after the
completion of the deal).
When in a practical question there is only one underlying distribution for the
random quantity we investigate, we speak of a characterization theorem. There
are a large number of characterization results which are based on properties of
order statistics. Since the works Galambos (1975) and Galambos and Kotz
(1978) give a good collection of such results we present here a few typical ones
only. For these characterizations, we return to the assumption of independence
and identical distributions.
The simplest but useful characterization is due to Huang (1974a). He
remarked that, in view of the last expression at (2.2), the distribution of a
single order statistic uniquely determines the population distribution.
Another characterization of general nature is in terms of moments. If we put
References
Arnold, B. C. (1980). Two characterizations of the geometric distribution. J. Appl. Probab. 17,
570-573.
Arnold, B. C. and Ghosh, M. (1976). A characterization of geometric distributions by distributional
properties of order statistics. Scand. Actuar. 3. 4, 232-234.
Balkema, A. A. and de Haan, L. (1978). Limit distributions for order statistics. Theor. Veorjatnost. i
Primen. I:23, 80-96; 11:23, 358--375.
Baringhaus, L. (1980). Eine simultane Characterisierung der geometrischen Verteilung und der
logistischen Verteilung. Metrika 27, 237-242.
Barlow, R. E. and Proschan, F. (1975). Statistical Theory of Reliability and Life Testing: Probability
Models. Holt, Rinehart and Winston, New York.
Berman, S. M. (1962). Limiting distribution of the maximum term in a sequence of dependent
random variables. Ann. Math. Statist. 33, 894-908.
Chernick, M. R. (1980). A limit theorem for the maximum term in a particular ERMA(1,1)
sequence. J. Appl. Probab. 17, 869-873.
Chibisov, D. M. (1964). On limit distributions for members of a variational series. Teor. Verojatnost.
i Primen. 9, 150-165.
Chow, Y. S. and Teicher, H. (1978). Probability Theory. Springer, New York.
.. it
Csorgo, M., Seshadri, V. and Yalovsky, M. (1975). Applications of characterizations in the area of
goodness of fit. In: G. P. Patil et al., eds., Statistical Distributions in Scientific Work, Vol. 2.
Reidel, Dordrecht, pp. 79-90.
David, H. A. (1981). Order Statistics. Wiley, New York, 2nd ed.
Deheuvels, P. (1978). Characterisation complede des lois extremes mutivariees et de la con-
vergence des types extremes. Publ. Inst. Statist. Univ. Paris 23, 1-36.
Deheuvels, P. (1980). The decomposition of infinite order and extreme multivariate distributions.
In: Asymptotic Theory of Statistical Tests and Estimation (Proc. Adv. Internat. Syrup. Univ.
North Carolina, Chapel Hill, NC 1979). Academic Press, New York, pp. 259-286.
Deheuvels, P. (1981). Univariate extreme values-theory and applications. Bull. ISI (Proceedings of
the 43rd Session), pp. 837-858.
El-Neweihi, E. and Govindarajulu, Z. (1979). Characterizations of geometric distribution and
discrete IFR (DRF) distributions using order statistics. J. Statist. Plann. Inference 3, 85--90.
Englund, G. (1980). Remainder term estimates for the asymptotic normality of order statistics.
Scand. J. Statist. 7, 197-202.
Epstein, B. and Sobel, M. (1953). Life testing. J. Arner. Statist. Assoc. 48, 486--502.
Galambos, J. (1969). Quadratic inequalities among probabilities. Ann. Univ. $ci. Budapest Sectio
Math. 12, 11-16.
Galambos, J. (1975). Characterizations of probability distributions by properties of order statistics.
I-II. In: G. P. Patil et al., eds., Statistical Distributions in Scientific Work, Vol. 3. Reidel,
Dordrecht, pp. 71-88 and 89-101.
Galambos, J. (1977). Bonferroni inequalities. Ann. Probab. 5, 577-581.
Galambos, J. (1978). The Asymptotic Theory of Extreme Order Statistics. Wiley, New York.
Galambos, J. (1981). Extreme value theory in applied probability. Math. Sci. 6, 13-26.
Galambos, J. (1982). The role of functional equations in stochastic model building. Aequationes
Math. 25, 21--41.
Galambos, J. and Mucci, R. (1980). Inequalities for linear combination of binomial moments. Publ.
Math. Debrecen 27, 263-268.
Galambos, J. and Kotz, S. (1978). Characterizations of Probability Distributions. Lecture Notes in
Mathematics 675, Springer, Berlin.
Gerber, H. U. (1980). A characterization of certain families of distributions via Esscher transforms
and independence. J. Amer. Statist. Assoc. 75, 1015-1018.
Gnedenko, B. V. (1943). Sur la distribution limite du terme maximum d'une serie aleatoire. Ann.
Math. 44, 423--453.
Gumbel, E. J. (1958). Statistics of Extremes. Columbia University Press, New York.
Order statistics 381
Haan, L. de (1970). On regular variation and its application to the weak convergence of sample
extremes. Math. Centre Tracts 32, Amsterdam.
Hall, P. (1979). On the rate of convergence of normal extremes. J. Appl. Probab. 16, 433-439.
Hall, W. J.. and Wellner, J. A. (1979). The rate of convergence in law of the maximum of
exponential sample. Statist. Neerlandica 33, 151-154.
Harter, H. L. (1978). A bibliography of extreme value theory. Internat. Statist. Rev. 46, 279-306.
Huang, J. S. (1974a). Personal communication.
Huang, J. S. (1974b). Characterizations of the exponential distribution by order statistics. J. Appl.
Probab. 11, 605-608.
Huang, J. S. (1975). Characterization of distributions by the expected values of the order statistics.
Ann. Inst. Statist. Math. 27, 87-93.
Johnson, N. L. and Kotz, S. (1968-1972). Distributions in Statistics, Vol. I-IV. Wiley, New York.
Kotz, S. (1974). Characterizations of statistical distributions: A supplement to recent surveys.
Internat. Statist. Rev. 42, 39-65.
Kwerel, S. M. (1975a). Most stringent bounds on aggregated probabilities of partially specified
dependent probability systems. J. Amer. Statist. Assoc. 70, 472-479.
Kwerel, S. M. (1975b). Bounds on the probability of the union and intersection of m events. Adv.
Appl. Probability 7, 431-448.
Kwerel, S. M. (1975c). Most stringent bounds on the probability of the union and intersection
of m events for systems partially specified by Sh $2. . . . , Sk 2 <~k <-m. J. Appl. Probab. 12,
612--619.
Leadbetter, M. R. (1974). On extreme values in stationary sequences. Z. Wahrsch. Verw. Geb. 28,
289-303.
Loeve, M. (1%3). Probability Theory. Van Nostrand, New York, 3rd ed.
Mejzler, D. (1965). On a certain class of limit distributions and their domain of attraction. Trans.
Amer. Math. Soc. 117, 205-236.
Mori, T. (1981). The relation of sums and extremes of random variables. Bull. ISI (Proc. of the 43rd
Session), pp. 879-894.
Nair, K. A. (1981). Asymptotic distribution and moments of normal extremes. Ann. Probab. 9,
150-153.
Phoenix, S. L. and Taylor, H. M. (1973). The asymptotic strength distribution of a general fiber
bundle. Adv. Appl. Probab. 5, 200-216.
Pickands, J. (1981). Multivariate extreme value distributions. Bull. ISI (Proc. of the 43rd Session),
pp. 859-878.
Reiss, R. D. (1976). Asymptotic expansions for sample quantiles. Ann. Probab. 4, 249-258.
Reiss, R. D. (1981). Uniform approximation to distributions of extreme order statistics. Adv. Appl.
Probab. 13, 533-547.
R6nyi, A. (1953). On the theory of order statistics. Acta Math. Acad. Sci. Hungar. 4, 191-231.
Rossberg, H. J. (1965). Die asymptotische Unabhangigkeit der kleinsten und grossten Werte einer
Stichprobe von Stichprobenmittel. Math. Nachr. 28, 305-318.
Sarhan, A. E. and Greenberg, B. G. (1962). Contributions to Order Statistics. Wiley, New York.
Sathe, Y. S.0 Pradhan, M. and Shah, S. P. (1980). Inequalities for the probability of the
occurrence of at least m out of n events. J. Appl. Probab. 17, 1127-1132.
Sen, P. K. (1968). Asymptotic normality of sample quantiles for m-dependent processes. Ann.
Math. Statist. 39, 1724-1730.
Sen, P. K. (1972). On the Bahadur representation of sample quantiles for sequences of ~b-mixing
random variables. J. Multivariate Anal. 2, 77-95.
Sen, P. K. (1973a). On fixed size confidence bands for the bundle strength of filaments. Ann. Statist.
1, 526-537.
Sen, P. K. (1973b). An asymptotically efficient test for the bundle strength of filaments. J. Appl.
Probab. 10, 586-596.
Sen, P. K., Bhattacharyya, B. B. and Suh, M. W. (1973). Limiting behavior of the extremum of
certain sample functions. Ann. Statist. 1, 297-311.
Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. Wiley, New York.
382 Janos Galambos
Shimizu, R. and Davies, L. (1979). General characterization theorems for the Weibull and the
stable distributions. Technical Report, Inst. Statist. Math., Tokyo.
Smirnov, N. V. (1949). Limit laws for members of a variational series. Trudi Math. Steklov 25,
1-60.
Suh, M. W., Bhattacharyya, B. B. and Grandage, A. (1970). On the distribution and moments of
the strength of a bundle of filaments. J. Appl. Probab. 7, 712-720.
Takfics, L. (1958). On a general probability theorem and its applications in the theory of stochastic
processes. Proc. Cambridge Philos. Soc. 54, 219-224.
Tiago de Oliveira, J. (1961). The asymptotical independence of the sample mean and the extremes.
Revista da Fac. Ciencias Univ. Lisboa A 8, 299-310.
Tiago de Oliveira, J. (1980). Bivariate extremes: Foundations and statistics. In: P. R. Krishnaiah,
ed., Multivariate Analysis V. North-Holland, Amsterdam, 349-366.
Walker, A. M. (1981). On the classical Bonferroni inequalities and the corresponding Galambos
inequalities. J. Appl. Probab. 18, 757-763.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1
At.
Elsevier Science Publishers (1984) 383-403
P. K. Bhattacharya
1. Introduction
Suppose (X1, Y1), (X2, Y2). . . . are independent and identically distributed
(iid) as a two-dimensional random vector (X, Y). For each x, = ( x , , . . . , x,) in
the sample space of (Xb Y , ) , . . . , (X,, Y,), arrange x , , . . . , x, in ascending
order of magnitude, treating ties in an arbitrary but well-defined manner (e.g.,
among ties x~ is placed before or after xj depending on whether i < j or j < i).
Thus the coordinates of each x, are arranged as x,, ~< --- ~<x,,. This defines the
X - o r d e r statistics X , , ~< ... ~<X,, obtained from X, = ( X b . . . , X,). W e now
define the r-th induced Y-order statistic Y., to be Y,r = Y/if X,r = Xj. In other
words, the r-th induced Y-order statistic is the Y-value associated with the r-th
X - o r d e r statistic and is, in general, not the same as the r-th smallest Y-value.
383
384 P. K. Bhattacharya
If Y has a continuous cumulative distribution function (cdf), then the Y,r are
distinct with probability 1, and the rank R,r of Y,r among Y I , . . . , Y, is
unambiguously defined. This rank can be written as
3. Dependence structure
In this section we present two results. The first one, due to Bhattacharya (1974),
shows that the {Y.,} are conditionally independent given X,,, while the other one,
due to Sen (1976), shows that their partial sums, adjusted for the regression, form a
martingale sequence.
LEMMA 3.1. For every n, the induced order statistics Y,1 . . . . . Y,, are con-
ditionally independent given X1 . . . . . X , with conditional cdf' s
G(.[ X,1) . . . . . G(-] X , , ) respectively.
PROOF. For X, = (X1 . . . . . X,), define A(r, X,) to be that j for which X,, = Xj.
Then the random permutation (A(1, X,) . . . . . A(n, X,)) of ( 1 , . . . , n) is deter-
mined only by X, and Y,r = Y,(r,x,). Argue conditionally given X, -- x,. Since,
for each i, Y/is independent of {(X:, Y:), j # i}, it follows that
P[Y,r<~y, k = 1. . . . . n I X , = x , ]
= P[Y,~r,x,)<~yr, r = 1. . . . . n ] X,(r,x,)= x,(r,~,), r = 1, . . . , n]
n
o G(y, IX,.(,,~n)
= I] ) = fi G(y, IX,,,),
r=l r=l
LEMMA 3.2. For arbitrary {Ck, n >I 1, 1 ~< r ~< n}, let
k
S*k = ~ Cnr{Y~- m (X..)}.
r=l
PROOF. For 1 ~ k ~ n - 1,
(1978) on the asymptotic distribution of Y,,. The last section deals with partial
sums of induced order statistics and their applications in regression analysis. This
aspect of induced order statistics was developed by Bhattacharya (1974, 1976) and
Sen (1976). In this section, we also briefly describe estimates of m (x) due toYang
(1981), which are weighted averages of Y,~ using kernel weights, and some results
on mixed rank statistics due to Sen (1981).
Formulas for the mean and the variance of Y,r and covariances between Y~
and Y,~ and between X,r and Y,~ are also obtained by conditioning with
respect to X, using Lemma 3.1. These formulas are given below.
PROOF. (a) and (b) follow immediately from Lemma 3.1. For (c), observe that
Cov(Y,,, Y,~ [ X,) = 0 for r ~ s by the conditional independence of Ynr and Y,~,
and for (d), observe that Cov(X.r, Yr~ [ X,) =--0.
We next consider the distribution of the rank R,r of Y~r among Y1. . . . . Y~.
THEOREM 4.3.
where t = m i n ( r - 1, s - 1),
Induced order statistics: Theory and applications 387
(n - 1)!
Ck(r, s, n) = k ! ( r - 1 - k ) ! ( s - 1 - k )!(n - r - s+ 1+ k ) ! '
=nf[ f[
n-1
E I{Y~ ~<y)= s - 11 d f ( x ) d G ( y I x ) .
i=1
T o complete the proof we only have to recognize the integrand in the last
expression as the probability of obtaining marginal frequencies r - 1 for
{Xi ~ x} and s - 1 for {Yi ~< y} in a 2 2 contingency table in which a total
frequency of n - 1 is classified according to the occurrence of the events {Xi <~ x}
and {Y~ ~< y} and their complements.
An interesting variation of the event {R,, = n}, i.e., the same pair obtains the
highest rank in both X and Y, arises when in addition to the bivariate
observations (321, Y1). . . . , (X,, Y,), we also have ml + m2 other independent
observations X ~ , . . . , X " l from the X-distribution and Y'~. . . . . Y"2 from the
Y-distribution. Now the X-ranking involves X1,.. , X,, X1' . . . . . . X ' 1, the
Y-ranking involves Y1 . . . . , Y,, Y7 . . . . . Y"2, and one of the pairs (X/, Y/)
obtaining the highest ranks in both X and Y is the event of interest. The
probabilities of this and related events have been studied by Spruill and
Gastwirth (1981) in the context of professional couples and singles completing for
employment in two departments of a university.
We now specialize the results of T h e o r e m 4.2 First suppose that the residual
Z= Y- m(X)
expressed as
We call this the linear regression model with homogeneous residuals. For
example, this holds when (X, Y) follows a bivariate normal distribution, in
which case Z is N(0, 0-2). Let
Then the following formulas, originally due to Watterson (1959) are obtained as
special cases of Theorem 4.2.
denote the true regression formula. However, in practice the true regression
formula is estimated by the method of least squares from a previous sample of
size N from the same population, for which all p + 1 variables 1. . . . . ~:,, Y
were observed. Let rio, fil . . . . . tip denote the estimated coefficients. Then the
above procedure calls for selection of s out of n individuals with the highest
values of X =/31~:1+"" + flpp =/~'~:. Let (Xi, Yi), i = 1 . . . . . n denote the pre-
dicted score and the true score in the sample from which the s individuals with
the highest X-values are selected. The average true score of the selected
individuals is then given by
Y s = s -1 ~ Y~r,
r=n-s+l
Induced order statistics: Theory and applications 389
where the induced Y-order statistics Y,r clearly depend on/~. The standardized
form of the conditional expectation E(f's ]/~), viz.
{E(E I )-
which is a measure of the predictive accuracy of the estimated regression
formula, has been called by Gross (1973) the expected gain from selection. By
Theorem 5.1,
{E(?,lti)- #,}/o-, = Z
r=n-s+l
where p(fl)is the conditional correlation coefficient between/~'~ and Y given
fi, i.e.,
p(fl) = Z'~/3 Cov(~b Y)
t ,t j Cov( ,,
Comparing the expected gain from selection using /~'s with the maximum
expected gain that can be achieved by any linear prediction formula, we see
that a relative measure of efficacy of the formula fl'se is provided by the ratio
e = p(13)/O,
Yk = Yk + bk (fi2. -- Xk ) , (5.3)
where
k k n
1 1 1
k k (5.4)
390 P. K. Bhattacharya
k k
Var(Y*)/o -2 = ( 1 - p 2 ) ~ a~ + p2 Var(X*) where X* = ~ aiX, ri .
1 1
In the above approach, the choice of {rl . . . . . rk} is restricted to fixed sets of k
elements from {1. . . . . n}. We now let {rl . . . . . rk} be a random set depending
on X,. The induced order statistics Y, rl are no longer concomitants of X-order
statistics, but Lemma 3.1 is still applicable by virtue of the remark following
that lemma. Consequently, I?(k) given by (5.8) is still an unbiased estimator of
p,y and Var(Y(k)) is still given by (5.10). We now optimize by choosing the
random {rl . . . . . rk} SO as to minimize V V(rl = rk, X,) given by (5.9) for
. . . . .
V(rl . . . . . rk;X,) for given X, is easy for small n, but would need an efficient
algorithm even for moderately large n. We leave this as an open problem.
However, it would often be possible to obtain suboptimal but fairly good
solutions relatively easily by restricting the choice near the two extremes of the
X-observations.
(iii) Maximum likelihood estimation from censored bivariate normal sample.
Consider a life-testing problem in which X is the failure-time of a randomly
selected unit and Y is a covariate whose effect on X is of interest. If in an
experiment with a random sample of n units only k units are observed
according to a type II censoring scheme, i.e., the experiment is run until the
first k failures and (X, Y) observations are made only on these k units, then
the observations consist of the first k order statistics X , I < " " < X , k of the
X-variate and the corresponding induced order statistics Y , 1 , . . . , Y,k of the
covariate. In the bivariate normal case, Harrell and Sen (1979) have obtained the
maximum likelihood estimators (mle) of/Zx, #y, o'x, o-y and p, and the likelihood
ratio test for H0: p = 0 under such a censoring scheme. Using Lemma 3.1 in
conjunction with (5.1), the log likelihood function of (X,1 . . . . . Xnk) and the
conditional log likelihood function of (Y,I . . . . . Y,k) given X, are obtained as,
respectively,
k
L.Xk = C1- k log o-x - (2o'2) -~ ~ (X.,-/zx) 2
1
Adding these, we obtain the log likelihood based on the complete data, from
which the mle's/2x,/2y, 6% t~y and t~ are calculated. Of these,/2x and 6-x turn out
to be the same as the mle's based on X alone, and they are computed by an
iterative process. In terms of/2~ and t~, we have
where )(~k), ~'~k), S2,(k), b(k) and r(k) are given by (5.5)-(5.7). Since/2x and 6-x are
known to have considerable bias under heavy censoring, it may be preferable
to estimate these by other suitable estimates throughout (Saw (1959, 1961)).
The likelihood ratio statistic for testing /40 is a function of r(k) and con-
ditionally, given X,, the statistic
T : (k - 2)r~k)/(1 -- r(k))
2
where Z,~ is the r-th induced order statistic of the random sample
(X1, Z1) . . . . . (X,, Z , ) from the distribution of (X, Z). Since X and Z are
independent, it follows from Lemma 3.1 that Z,r 1. . . . . Z,rk are independent,
each being distributed as Z. Now if for large n, the random variables m (X,,i)
are approximately the same as their mean values, then Y*,i-~ Z"re and are
therefore approximately iid as Z. The following theorem due to David and
Galambos (1974) makes this precise.
then for any fixed 1 <<-rl "~ "" < rk ~ n, the induced order statistics Y.,~ . . . . . Y.,k
are asymptotically independent, the deviation of each from its m e a n being
distributed as Z.
( y *,,1 . . . . . * ) = (Znrl . . . . .
Y.,~ Znrk) -~ (E nrl, . . . , E nrk) ,
l.--*o0
imP[Y., 1 ~<Yb-.. '
Y.,k <~ Yk ] = ~I G(y~ ] F - I ( A , ) ) .
i=l
and (a) is proved, which in turn implies that the asymptotic distribution of R J n
is the same as the conditional distribution of G ( Y ) given X -- F-I(A). Thus
COROLLARY 7.2. Suppose (X, Y ) is bivariate normal with ]p] < 1. Then, for
r/n ~ A E (0, 1) as n ~oo,
oo
(a) lim E [ ( R , d n ) k ] =
f q~k (X/-f-Z-_p2u + pq)-'(Z)) d ~ ( u ) .
In particular,
in probability as n ~ m . The same results remain true if Y.{.) and Y.. are
replaced by Y.,{.-k) and Yn.n-k for any fixed k. Thus, if X and Y represent
measurements of a characteristic for a parent and an offspring, then in a large
population, the offspring of the top individuals will not be in the top part of the
next generation.
We state without proof the following general result due to Galambos (1978),
describing the asymptotic behavior of the extreme induced order statistics.
- (2), --,
Suppose (XI 1), YI1)), 1 ~< i ~< nl, and (Ai v{2h,, 1 -<
~ i -< n2, are random samples
from two nonsingular bivariate normal populations. Let ~v-o) nlr~ 1 ~< r ~ nl, and v*n2r,
(2)
1 ~< r <~ n2, denote the induced order statistics in the two samples, and for a fixed k
consider the probability
~nl n2 1
p(nl, n2, k) = p ym
n: > r n2-~Z+l ~:(2)
~ n2" "
r=nl_k+ 1 = _
396 P. K. Bhattacharya
Obviously, if pl > 0 then for fixed k and n2, this probability tends to 1 as
nl~. Kaminsky (1978) has derived an approximate formula for 111 for which
p(nb n2, k) attains an arbitrary level y when n2 is a given large number.
for 0 < t < 1. H e r e for simplicity, we have written nt for the largest integer [nt]
not exceeding nt. The problem of estimating h(t), the regression of Y on X
evaluated at the t-quantile of X, can be approached by attempting to estimate
E(Yn, nt). In a model-free setting, the natural way to attempt this would be to
take a weighted average of the Y~, attaching predominantly large weights to the
Y-values corresponding to the few X~ which are close to X,,n,. Examples of
such estimators are:
mn(x)=11-1~b(n)-1K(r/rlb(Fi(x))Ynr
1
of re(x) and
m*(t)=n,l~b(n)-lg(~)Yn~
1
of h(t) constructed with kernel weights, where F,(x) is the empirical cdf, K is a
pdf and b(n) ~ 0 as n ~ o0. Using asymptotic properties of linear functions of
induced order statistics, Yang (1981) has shown the above estimators to be mean
square consistent under fairly mild conditions.
Now consider inference about the integrated regression function
t f F-l(t)
H(t)=
fO h(s)ds=J_~ m(x)dF(x), O<t<l.
when F is known.
The functions H and h are related to one another in a manner analogous to
the relation between the cdf and the pdf of a random variable. Many questions
Induced order statistics: Theory and applications 397
1
sup IZn(t)l
0~<t<~l
or
f0 Z.(t) at
are obtained by replacing Zn(t) by Z(t) and then finding the distributions of the
corresponding statistics.
We shall denote a standard Brownian motion by B(t), t >1O, and a Brownian
bridge on [0, 1], i.e., a process having the same distribution as B ( t ) - tB(1), by
B*(t). We assume that the following conditions hold.
We now state Theorem 9.1 which gives the joint weak convergence of
398 P. K. Bhattacharya
(Uo(t), V.(t)) and (U*(t), V.(t)), where U.(t) and U*(t) are as in (9.2) and
V.(t) is the empirical process
(i) Testing /40: m = rno where mo is a specified function. Under the null
hypothesis, U.(t) = n-~/2S.,~t ~ B(O(t)), so that
S.,.,I(nO(1)) m 0 B(O(t)/O(1)),
W~ ) = max IS.kll(n(b.(l))!/2
l<~k<_n ,,
and
W(2) = f o ' S . , . , d t / l n fo' fo't~.(min(s,t))dsdt] 1/2
n-1 '~ n-1 ql/2
,=,z 1 ,
0. Since Hn(1) = I7" and s 2 = n -1 E~' (Yi - ~-)2 is a consistent estimate of ~(1)
under H0, it follows from T h e o r e m 9.2(c) that, under H0,
which has the same distribution as that of the Brownian bridge B*(t). Hence
the critical region
where D * is the upper 100a % point of the cdf of sup0<~t~l ]B*(t)[, is asymp-
totically of level a. Note that the Kolmogorov-Smirnov statistic has the same
asymptotic null distribituon.
(iii) Testing equality of two regressions. Let (Xi, Yi), 1 ~< i ~< n, and (X;, Y~),
1 ~< i ~< n' be random samples from two bivariate populations with regression
functions m(x) and m'(x) respectively. W e want to test H0: m = m' under the
assumption of common marginal cdf F of X and X ' and common residual
variance o-2(x). Suppose F is unknown, and define
nt rift
in terms of the induced order statistics Y.r and Y',r from the two samples.
Consider a test based on
1
T* = (n -1 + n'-l) -1/2
f0 ( H * ( t ) - H*,'(t)) dt
H.(t)++-nK-l(1-a) { ~ 1 ( Y j - m . ( X j ) ) 2)1/2
/, O~<t<~l,
which is less restrictive than Conditions 1 and 2 under which the other results
are proved. Sen uses Lemma 3.2 to establish the above result from which
sequential a'aalogues of the tests discussed in (i) and (ii) can be constructed.
We conclude with a brief discussion of statistics of the form
tit
M.t= ~[a,(R*i)-fi.lYi, 0~<t~<l,
i=1
R*i being the rank of X~ among X1. . . . . X,. Here the scores are defined in the
usual manner as a.(i) = E~o(U,~) where U.i is the i-th order statistic in a random
sample of size n from the uniform distribution on (0, 1), t~, = n -1 El' a,(i) and p is
a square-integrable function on (0, 1). These statistics have been called mixed rank
statistics by Ghosh and Sen (1971). The structural affinity of M, with linear
combinations of induced order statistics is identified by writing
M. = ~ [a.(i)- an]Y.,.
i=1
The following weak and strong convergence properties of these statistics have
been obtained by Sen (1981) subject to regularity conditions on q~.
402 P. K. Bhattacharya
as n ~ oo. A law of iterated logarithm for M,, viz., that the lim sup and lim inf of
Mn/~/2nA2tr~ log log n are +1 and - 1 respectively with probability 1, follows
from this.
Sen [19] has also proved weak and strong convergence results for Mn under
general alternatives and for arbitrary scores a*(i) which are sufficiently close to
an(i) defined above and has indicated the use of these results in deriving
asymptotic properties of sequential tests of the hypothesis of independence.
References
Gross, A. L. (1973). Prediction in future samples studied in terms of gain from selection.
Psychometrika 38, 151-172.
Harrell, F. E. and Sen, P. K. (1979). Statistical inference for censored bivariate normal distributions
based on induced order statistics. Biometrika 66, 293-298.
Kaminsky, K. S. (1981). A note on concomitants of order statistic in two bivariate normal
samples. Proc. 43rd Session of ISI, Buenos Aires, Vol. 1, Contributed Papers, pp. 161-164.
Nadaraya, E. A. (1964). On estimating regression. Theor. Probab. Appl. 9, 141-142.
O'Connell, M. J. and David, H. A. (1976). Order statistics and their concomitants in some double
sampling situations. In: S. Ikeda et al., eds., Essays in Probability and Statistics, Ogawa Volume.
Shinko Tsusho, Japan.
Saw, J. G. (1959). Estimation of the normal population parameters given a singly censored
sample. Biometrika 46, 150-155.
Saw, J. G. (1961). The bias of the maximum likelihood estimates of the location and scale
parameters given a type II censored normal sample. Biometrika 48, 448-451.
Sen, P. K. (1976). A note on invariance principles for induced order statistics. Ann. Prob. 4,
474-479.
Sen, P. K. (1981). Some invariance principles for mixed rank statistics and induced order statistics
and some applications. Comm. Statist. A10, 1691-1718.
Skorokhod, A. V. (1965). Studies in the theory of random processes. Addison-Wesley, Reading, MA.
Spruill, N. L. and Gastwirth, J. L. (1981). A probability model illustrating the employment problems
a professional coupld faces. Unpublished manuscript.
Watterson, G. A. (1959). Linear estimation in censored samples from multivariate normal popu-
lations. Ann. Math. Statist. 30, 814-824.
Watson, G. S. (1964). Smooth regression analysis. Sankhy~ Ser. A. 26, 359-372.
Yang, S. S. (1977). General distribution theory of the concomitants of order statistics. Ann. Statist.
5, 996-1002.
Yang, S. S. (1981). Linear functions of concomitants of order statistics with application to
nonparametric estimation of a regression function. J A S A 76, 658-662.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 1 0. J
.t
O Elsevier Science Publishers (1984) 405-430
Endre Csdki
1. I n t r o d u c t i o n
where I(-) denotes the indicator function of the event in the brackets. With
other words, for all real x, Fn(x) is the relative frequency of the event whose
probability is given by the distribution function F(x). Hence, for fixed x, we
have
(1.3)
405
406 Endre Csdki
and determined its limiting distribution for continuous F(x). Later Smirnov
(1939a,b) considered the one-sided statistics
and also their two sample analogous. These are the so called K o l m o g o r o v -
Smirnov statistics. It is well known that they are distribution free as long as
F(x) is assumed to be continuous.
Empirical measure can be defined in a much more general context. In this
paper however we consider the case when X1, X2. . . . . X, . . . . is a sequence of
i.i.d, real valued random variables. We study the more general statistics
introduced by Anderson and Darling (1952):
This theorem has been extended in various direction (cf. the survey paper by
Gaenssler and Stute, 1979).
Empirical distribution function 407
For the general statistic K,,~ defined by (1.8), Wellner (1977b) has shown that
provided
The condition (1.13) fails for ~'(u) = I/u, or r(u) = 1/(1 - u) (0 < u < 1). For this
particular weight functions we have (1.14). It can be seen furthermore that for
continuous F(x), we have also
2. Exact distributions
P(G'(x)<~F"(x)<~G2(x)'O<-x<~l)=n!det[(~2ui)j+-i+a]i+]l)'] (2.1)
In the two sample case, i.e. when we have two independent samples
(X1 . . . . . X~I) and (Y1 . . . . . Y,~) with distribution functions F(1)(x) and Fm)(x),
resp., and empirical distribution functions F ~ ( x ) and F~)2(x), resp., we may
consider the following general statistics:
+
K,,v,,z,, = sup ( F ~ ( x ) - F(2.)(x))r(FN(X)), (2.7)
xER
where N = nl + 112 and FN(X) denotes the empirical distribution function of the
combined sample. The problem of finding the distributions of the above
statistics can be reduced to the determination of the probability P(a~ ~ Ri <<-
b~,i= 1 . . . . . nl), where RI~<R2~<'"~<R,1 are the ordered ranks of
(X1 . . . . , X,~) in the combined sample. Steck (1969) gives the following formula
for F(1)(x) ~ F(2)(x):
B + 0 whenB<OorA<B,
1 when B = 0.
(2.9) can also be obtained from a lattice path counting result due to Kreweras
(1965).
The recurrence relation for po) = P(ai <~Ri <~ bi, i = 1,
n 1
n 0 reads as fo1-
- . .
(2.13)
P(D+(n,n)<C)=l(n2+nc)c=1,2
(2:)'
..... n. (2.14)
0 ~ z ~< 1. (2.15)
(al ~< u <~ a2), (iii) r(u) = 1/(1 - u) (al <~ u ~< a2), etc. The weight function given
by (i) has been introduced by Maniya (1949), While Rrnyi (1953) introduced
the weight functions (ii) and (iii). The distributions P(K+,,, <~ z ) and P ( K ~ , <~ z )
for the above weight functions are determined by the following formula:
P ( F . ( x ) >! cx + b, al ~ x ~ a:)
- - a l e x i a 2 -- 1-a2~x<~l-a 1
s.p
alexia2 \l_a2~x~l_al \ 1 -- X
3. L i m i t t h e o r e m s
Smirnov (1939a,b) has determined the one-sided and the two sample analo-
gues of (3.1):
lim
~[/ nln2 \1/2D ,,~,,,2~
V~[nl~-~n2)
)
< y = ~'~ (--1)ke -2k2y2, O< y, (3.3)
min(n 1,n2)~ k = -c
linmP(F.(x)>(l+n-~)X-n-~-l-l~,al<~x<~a2 )
= ~(A1, B1; - r ) - e-2(v-")tP(A2, B2; r), (3.5)
1 1
~(A, B; r) = 2,tr(l_ r2)m f A f 2 exp{- 2(1- r2) (z2- 2rzlz2+ z~)} dzl dZ2
Furthermore
{a1(1 - a2)~1/2
r = \ a 2 ( 1 - a3/ '
/~ -- ual B1 - v - ua2
A1 = (a1(1- al)) 1/2' (a2(1- a2)) m '
v(1 - 2 a 0 + ual, B2 = v(2a2- 1 ) - ua2
A2 = (a1(1- a0) 1/2 (a2(1- a2)) 1/2
Empirical distributionfunction 413
nln2 x~l/2
n~---'~n~] sup (F~?(x)- F~)2(x)).
al<~FN(x)<-a 2
Furthermore
: limP(F.(x)>~(l+nY--~)x-nY--~,al<_x<~az)
n....oQ
p(( nl.2 ~1/2 (1) _ V~(
[Ft~(x)..s (2)x )'X < \
= lim ~\ n~---~n~/ sup (3.7)
min(nl,n2)._>~ al<F(nll)(x)<a2 ~ I -- F(nl)l(X ) : ~ Y)"
lim,(nl'2 sup
n--~ al<<.x<<_a2
(x ,)
= l i m P n u2 sup \ ]-:-x <~Y
n~ 1-a2<-x<- l - a 1
: limP(F,(x)>-(1-nY---~)x,a,<~x<_a2)
n__>~ -
limiting distributions of the statistics rtl/2gn,r and nl/2K"+ . . . . . . . . are the distributions
of sup0_<_x_<u( [ B ( x ) l r ( x ) ) and sup0.~_<l ( B ( x ) r ( x ) ) , resp., if the conditions
hold for all e > 0 , where hi(t) = tl/2r(t), h2(t) = t ' / Z r ( 1 - t ) and B ( x ) is a
Brownian bridge (tied down Wiener process). Anderson and Darling (1953)
studied the limiting distribution of K,,~ but no explicit formula is known in the
general case. In particular cases, e.g. for Maniya- and R6nyi-type weight
functions the following limiting distributions are given. Maniya (1949) shows
where
Ek = 1
2 f, =
(271.)1/2 (a211-a2)Ua
{u2}
exp - -~- du
Gnedenko (1954) for two sample case and Kuiper (1960) for one sample case
determined the following limiting distribution:
The weight function ~'(u)= ( u ( 1 - u ) ) -1/2 ( 0 < u < 1) does not satisfy (3.9).
The limiting distributions in this case have been found by Jaeschke (1979):
where A, = (2 log log n) ~/2, B, = 2 log log n + 51 log log log n. Some related
results are given in Jaeschke (1979) and Eicker (1979).
For weight function of the form (1.17) we have (Cs~iki, 1977a)
,2m (x-
((k + j)/(1 - y ) - V)/'-. 1
=k~_y)~t.e-(1--(lk~--y - v )
j:o J!
and the same is true for D. replaced by either D+. or D L For the general
statistic Kn,~ James (1975) gives
n 1/2gn, r
limsupn_,=t'log log n) 1/2= supx (r(x)x(1 - x)) m a.s. (3.20)
(3.21) is also necessary for (3.20) in the sense that if (3.21) fails then
n 1/2Kn, r
limsup (log log n) m = ~ a.s. (3.22)
the weight functions r(u)= ( u ( 1 - u)) -1/2, ~'(u)= 1/u, r(u)= 1/(1- u), ~-(u)=
(u(1 - u)) -~, 0 < u < 1 do not satisfy (3.21) and hence (3.22) holds true for these
weight functions. For ~-(u)= ( u ( 1 - u)) -1/2 we have shown the following result
(Csfiki, 1974, 1982): Let 6, be an increasing sequence of positive constants,
then
( ( IF.(x)-
P n 1/2 sup \ ( x ( 1 - x ) ) 1/2] >~6" i.o. = 0 or 1
) (3.23)
0<x<l
For ~-(u)= 1/u Shorack and WeUner (1978) show for increasing 6,,
The same is true if ~-(u)= 1/u is replaced by either r(u)= 1/(1- u) or r(u)=
1/(u(1 - u)), 0 < u < 1.
Recently Mason (1981) gave a result which is a common generalization of
(3.24) and (3.25):
Let 6, be an increasing sequence and 0 ~< a ~<. Then
( IF.(x)- xl
P n" sup ( x ( 1 - x ) ) -1" ~>6" i.o. = 0 or 1
) (3.27)
0<x<l
On the other hand, as shown by Jaeschke (1979), the weak version of the law of
the iterated logarithm holds for the weight function 7 ( u ) = ( u ( 1 - u))-m:
For one sided version it is shown in Csfiki (1977b) and Shorack (1980) that
[ n x 1/2 { X__Fn(X ) x
limsup,--- sup 1 1/2 = 2 a.s. (3.30)
,~oo \log l o g n J O<x<m~(x(-x)) )
We consider again weight functions of the form (1.17). First we mention the
following result of Kiefer (1972b). Define 3~ > l as the solution of the equation
1-d
fl(log 13 - 1) = d (d > 0), (3.31)
Let fl] = 0 if 0 < d ~< 1, and the other solution (/3]< 1) of the equation (3.3l) if
1 < d < ~. Put an = d n -~ log log n. Then
[ n ],/2 (IF.(x)- x I )
limsuPvlo 2 a.s. (3.35)
,_,~ \ g log n ] sup
an~x<_l--an\(X(l--x))l/2J
An extension of these results is given in Cs~iki (1977b) (see also Shorack,
1980):
If an is a decreasing sequence such that
lim nan(log log n) -1 --- and lim (log log a~l)(log log n) -1 = c
n..~ n.--~oo
{ n \1/2 [ IFn(X)__ Xl
limsup
log sup ~(X(1 _ X)) 1/2)
log -n) a<x<l_a
= max(2, dl/2(/3} - 1)) a.s. (3.37)
limsup sup ~
/ / F tn t x , - x ' ] = / 3 } _ l a.s., (3.40)
n-~ an<-X<l\ X /
Much less is known for liminf of K,., and related statistics. Mogulskii (1979)
and Kuelbs (1979) show
( F/ "~1,2 { If.(x)- xl
a.s. (3.43)
liminf \2 log log n// sup \(x(1 - x)) 1/2] = 1
0<x<l
Finally we note that strong limit theorems discussed in this Section are useful
in studying functions of order statistics, rank tests, etc. In this respect we refer
to Wellner (1977c) and Sen (1981).
is a martingale in x .
An easy consequence of the above properties is that the sequences nKn,~,
nK+,,, and nK~,~ are all submartingales. Furthermore (cf. Sen, 1973), K,,,, K+,,~
and K~,, are reverse submartingales.
Moreover (Hmaladze, 1981; Stute, 1982) the process nF,(x), 0 ~<x ~< 1 is a
Markov process.
A basic inequality f o r / 9 , is due to Dvoretzky, Kiefer and Wolfovitz (1956):
with some constant c. Devroye and Wise (1979) evaluated c 2611 in the
original proof. Shorack (1980) claims that (3.46) holds with c = 58. It has been
conjectured (see Birnbaum and McCarty, 1958) that (3.46) is true with c = 2
but this conjecture has not been proved so far. Similar inequalities are true for
D~+ and D~ with c replaced by c/2.
For the general statistic K,~, however an exponential bound similar to (3.46)
cannot be expected. By the use of some martingale properties and the
Birnbaum-Marshall inequality it can be shown that
P(K"~T>~Y) ~<..-~
ny eL1 ~-2(u)du, y>0 (3.47)
p(n,,2
\ o<x~b\ 1 --
p(n,,2 sup
l_b~<x<l X
where
O ( u ) = (u + 1) l o g ( u + 1) - u
u2
P ( n 1/2-'~ sup \
[F,(x)z x)
x,/2+,~ >~y)<~cy 2,(x+2,~), y > 0 , (3.50)
O<x<-a n
where K,~ denotes either of K,,, K+,,~, K~,~. This method however requires
evaluation or a good estimation of the moment generating function of K~. See
Csaki (1968, 1977a,b), Stanley (1972) and Khan (1977).
James (1975) gives an inequality improved by Shorack (1980) as follows:
Let z(u) be a positive function on some (0, 6] such that z(u)u ~/2 is increasing
on (0, 6] and "/'(/g)UI/2"-> 0 as u ~ 0 . Let y > 0, 0 < c < 1, a > 1 be given. Then
there exists 0 < ba ~< 6 such that for all integers nt ~< n2 having rtz/nl <~ ot and for
all 0 ~< a ~< b ~< b~ we have
4. T e s t s b a s e d on e.d.f.
The one sample statistics K.,. K+,. and K:,. defined by (1.8), (1.9) and (1.10),
resp., are used in goodness of fit tests, i.e. to test the null hypothesis that the
true distribution function F(x) is identically equal to the given hypothetical
422 Endre Csdki
distribution function Fo(x). In these tests the null hypothesis is rejected when
the test statistics are large, i.e. the critical regions are of the form {K 0.... ~ k,},
where K , denotes one of K,,, K+7 or K~,7 and the crtical value k~ is
determined so that under the null hypothesis we have
P(K,>~ k . ) ~ a. (4.1)
In order to find the value of k~ we need the exact or limit distribution of the
statistic used. For the most important weight functions the values of k~ are
given in tables. In this Section we study some properties of these tests and their
two sample analogues.
4.1. Powers
The power of the test is the probability of the critical region under the
alternative. The problem of determining the power can always be reduced to
computing probabilities of the form (2.1) in the one sample case or (2.9) in the
two sample case.
Consider e.g. the null hypothesis Fo(x) = x, 0 ~<x ~< 1 and use the test statistic
D~ = supo<_x<_L(x-F,(x)). Reject the null hypothesis if D~/> dr, where d, is
determined by P(D~ >~dJHo) = a. Consider two alternatives:
x if0~x~xo-A,
al(x)= Xo- A ifxo-A<X<Xo, (4.2)
x ifxo~<x<~l ;
x(1 1-x0+Ak/n+d~l"(l-a=)l-k(n--k)~ . i
(4.5)
Empiricaldistributionfunction 423
P(Dg t> d~ ] Gl(X)) ~<P(D~ >i d~ I G(x)) <~P(Dg I> d~ I G2(x)). (4.6)
4.2. Consistency
Recall that a test is consistent against an alternative if the power under this
alternative tends to 1 as the sample size tends to infinity.
For the consistency of test based on K,,, K+,,~ or Kg,~ we need the (weak
version) of Glivenko-Cantelli theorem, i.e. for the consistency of a test based
on K : against an alternative it is sufficient that
0 P
K,---~0 as n ~ (4.7)
0 P
K,---~A > 0 as n ~ (4.8)
under the alternative. Similar results hold for two sample tests. For the validity
of Glivenko-Cantelli theorem see the Introduction. Hence the test based on
4.3. Efficiency
There are several definitions of efficiency in the literature. The Pitman
efficiency for tests based on empirical distribution has been studied by Capon
(1965), Ramachandramurty (1966), Yu (1971), Kalish and Mikulski (1971).
Chernoff (1952, 1956) and Bahadur (1960, 1967) have introduced efficiency
concepts based on large deviations.
Let {T,, n t> 1} be a sequence of statistics for testing/40 against Ha. Let the
critical region be defined by {T, >/t}.
424 Endre Cs6ki
Chernoff (1956) introduces the following efficiency: assume that there exist
p > 0 and {t,} such that
(4.9) says that both errors of first and second kind are about the same order p".
Then if {T~ )} and {T~ )} are two sequences of test statistics testing H0 against HI,
whose p's are pa and p2, resp. then the Chernoff efficiency is defined by
ec = log Pl (4.10)
log p2"
Assume that
b is called the exact Bahadur slope of T,. Now if {T ~} and {T~ )} are two
sequences of test statistics whose exact Bahadur slope are bl and b2, resp., then
the Bahadur efficiency is defined by
eb = ~ . (4.14)
where K~ is any of statistics K,,, K+,,~ or KT~ and the function g~(y) is defined
as follows:
Let
(a+t)lg-~ +(1-a-t)lgl-a-tl-t ifO~a~l-t,
f(a, t) = (4.16)
ifa>l-t,
Empirical distribution function 425
g , ( y ) = m i n ( g , + ( y ) , g ; ( y )) . (4.19)
If ~-(t) is such t h a t e i t h e r
log(l/t)
liminf r(t) =0 (4.20)
t$0
or
r .
1mint --
. log(i/t)
, = 0 (4.21)
t~o ~ ' ( 1 - t )
References
Abrahamson, I. G. (1967). The exact Bahadur efficiencies for the Kolmogorov-Smirnov and
Kuiper one- and two-sample statistics. Ann. Math. Statist. 38, 1475-1490.
And61, J. (1967). Local asymptotic power and efficiency of tests of Kolmogorov-Smirnov type.
Ann. Math. Statist. 38, 1705-1725.
Anderson, T. W. and Darling, D. A. (1952). Asymptotic theory of certain 'goodness of fit' criteria
based on stochastic processes. Ann. Math. Statist. 23, 193-212.
Bahadur, R. R. (1960). Stochastic comparison of tests. Ann. Math. Statist. 31, 276-295.
Bahadur, R. R. (1967). Rates of convergence of estimates and test statistics. Ann. Math. Statist. 38,
303-324.
Barton, D. E. and Mallows, C. L. (1965). Some aspects of the random sequence. Ann. Math.
Statist. 36, 236-260.
Birnbaum, Z. W. (1953). On the power of a one-sided test of fit for continuous probability
functions. Ann. Math. Statist. 24, 484-489.
Birnbaum, Z. W. and Lientz, B. P. (1969). Exact distributions for some R6nyl-type statistics.
Zastos. Mat. 10 (Hugo Steinhaus Jubilee Vol.), 179-192.
426 Endre Csdki
quantile process. Coll. Math. Soc. J. Bolyai 11, Limit theorems of probability theory, Keszthely
(Hungary), 1974, pp. 59-71.
Csorgo, M. and Revesz, P. (1981). Strong Approximations in Probability and Statistics. Academic
Press, New York.
Daniels, H. E. (1945). The statistical theory of the strength of bundles of threads, I. Proc. Roy. Soc.
London Ser. A 183, 405-435.
Darling, D. A. and Robbins, H. (1968). Some nonparametric sequential tests with power one. Proc.
Nat. Acad. Sci. U.S.A. 61,804-809.
Dempster, A. P. (1959). Generalized D + statistics. Ann. Math. Statist. 30, 593-597.
Devroye, L. P. and Wise, G. L. (1979). On the recovery of discrete probability densitites from
imperfect measurements. J. Franklin Inst. 307, 1-20.
Durbin, J. (1968). The probability that the sample distribution function lies between two parallel
straight lines. Ann. Math. Statist. 39, 398-411.
Durbin, J. (1971). Boundary-crossing probabilities for the Brownian motion and Poisson processes
and techniques for computing the power of the Kolmogorov-Smirnov test. J. Appl. Probability 8,
431-453.
Durbin, J. (1973). Distribution theory for tests based on the sample distribution function. Regional
conference series in appl. math., No. 9, Siam, Phil.
Dvoretzky, A., Kiefer, J. and Wolfowitz, J. (1956). Asymptotic minimax character of the sample
distribution function and of the classical multinomial estimator. Ann. Math. Statist. 27, 642-669.
Dwass, M. (1959). The distribution of a generalized D ,+ statistic. Ann. Math. Statist. 30, 1024-1028.
Dwass, M. (1967). Simple random walk and rank order statistics. Ann. Math. Statist. 38, 1042-1053.
Dwass, M. (1974). Poisson process and distribution free statistics. Adv. Appl. Probability 6,
359-375.
Eicker, F. (1970a). A Ioglog-law for double sequences of random variables. Z. Wahrsch.
Verw. Gebiete 16, 107-133.
Eicker, F. (1970b). On the probability that a sample distribution function lies below a line segment.
Ann. Math. Statist. 41, 2075-2092.
Eicker, F. (1979). The asymptotic distribution of the suprema of the standardized empirical
processes. Ann. Statist. 7, 116-138.
Epanechnikov, V. A. (1968). The significance level and power of the two-sided Kolmogorov test in
the case of small sample sizes. Theory Probab. Appl. 13, 686-690.
Feller, W. (1948). On the Kolmogorov-Smirnov limit theorems for empirical distributions. Ann.
Math. Statist. 19, 177-189.
Finkelstein, H. (1971). The law of the iterated logarithm for empirical distributions. Ann. Math.
Statist. 42, 607-615.
Gaenssler, P. and Stute, W. (1979). Empirical process: a survey of results for independent and
identically distributed random variables. Ann. Probability 7, 193-243.
Ghosh, M. (1972). On the representation of linear functions of order statistics. Sankhygt Ser. A 34,
349-356.
Glivenko, V. (1933). Sulla determinazione empirica della leggi di probabilitY. Giorn. 1st. Ital.
Attuari 4, 92-99.
Gnedenko, B. V. (1954). Kriterien fhr die Unver~nderlichkeit der wahrscheinlichkeitsverteilung
von zwei unabh~ngigen Stichprobenreihen. Math. Nachrichten 12, 29-66.
Gnedenko, B. V. and Korolyuk, V. S. (1951). On the maximum discrepancy between two empirical
distribution functions. Dokl. Akad. Nauk SSSR 80, 525-528. English transl, in: Selected Transl.
Math. Statist. and Probability, Amer. Math. Soc. 1, 13-16. Providence, RI, 1961.
Goodman, V., Kuelbs, J. and Zinn, J. (1981). Some results on the LIL in Banach space with
applications to weighted empirical processes. Ann. Probability 9, 713-752.
Govindarajulu, Z., LeCam, L. and Raghavachari, M. (1967). Generalizations of theorems of
Chernoff and Savage on the asymptotic normality of test statistics. Proc. Fifth Berkeley, Sympos.
Math. Statist. and Probability Vol. I. Univ. of California Press, Berkeley, CA, pp. 609-638.
Groeneboom, P. and Shorack, G. R. (1981). Large deviations of goodness of fit statistics and linear
combinations of order statistics. Ann. Probability 9, 971-987.
Gupta, S. S. and Panchapakesan, S. (1973). On order statistics and some applications of com-
428 Endre Cs6ki
binatorial methods in statistics. In: A Survey of Combinatorial Theory (R. C. Bose Seventieth
Birthday Volume) (Proc. Internat. Sympos. Combinatorial Math. and Appl., Fort Collins, CO,
1971). North-Holland, Amsterdam, pp. 217-250.
H~jek, J. and Sid~k, Z. (1967). Theory of Rank Tests. Academia, Prague and Academic Press, New
York.
Hmaladze, E. V. (1981). Martingale approach in the theory of goodness-of-fit tests. Teor. Verojat-
nost. i Primenen. 26, 246-265 (Russian).
Ishii, G. (1958). Kolmogorov-Smirnov test in life test. Ann. Inst. Statist. Math. 10, 37-46.
Ishii, G. (1959). On the exact probabilities of Rtnyi's tests. Ann. Inst. Statist. Math. 11, 17-24.
Jaeschke, D. (1979). The asymptotic distribution of the supremum of the standardized empirical
distribution function on subintervals. Ann. Statist. 7, 108-115.
James, B. R. (1975). A functional law of the iterated logarithm for weighted empirical distributions.
Ann. Probability 3, 762-772.
Kalish, G. and Mikulski, P. W. (1971). The asymptotic behavior of the Smirnov test compared to
standard "optimal procedures". Ann. Math. Statist. 42, 1742-1747.
Khan, R. A. (1977). On strong convergence of Kolmogorov-Smirnov statistic and a sequential
detection procedure. Tamkang J. Math. 8, 157-165.
Kiefer, J. (1972a). Skorohod embedding of multivariate r.v.'s and the sample d . f . Z . Wahrsch.
Verw. Gebiete 24, 1-35.
Kiefer, J. (1972b). Iterated logarithm analogues for sample quantiles when p, $ O. Proc. Sixth
Berkeley Sympos. Math. Statist. and Probability, Vol. I. Univ. of California Press, Berkeley, CA,
pp. 227-244.
Klotz, J. (1967). Asymptotic efficiency of the two-sample Kolmogorov-Smirnov test. J. Amer. Statist.
Assoc. 62, 932-938.
Knott, M. (1970). The small sample power of one-sided Kolmogorov tests for a shift in location of
the normal distribution. J. Amer. Statist. Assoc. 65, 1384-1391.
Kolmogorov, A. N. (1933). Sulla determinazione empirica di una legge di distribuzione. Giorn. 1st.
Ital. Attuari 4, 83-91.
Korolyuk, V. S. (1955). On the discrepancy of empiric distributions for the case of two independent
samples. Izv. Akad. Nauk SSSR Set. Mat. 19, 81-96 (Russian).
Korolyuk, V. S. and Borovskih, Yu.V. (1981). Analytical problems of asymptotics of probability
distributions. Kiev: Naukova Dumka (Russian).
Kreweras, G. (1965). Sur une classe de problemes de denombrement fibs au treillis des partitions de
entiers. Cahiers du Bur. Univ. de Rech. Oper. 6, 5-105.
Krumbholz, W. (1976). The general form of the law of the iterated logarithm for generalized
Kolmogorov-Smirnov-Rtnyi statistics. J. Multivariate Anal. 6, 653-658.
Kuelbs, J. (1979). Rates of growth for Banach space valued independent increment processes.
Lecture Notes in Mathematics, No. 709. Springer, New York, pp. 151-169.
Kuelbs, J. and Dudley, R. M. (1980). Log log laws for empirical measures. Ann. Probability 8,
405-418.
Kuiper, N. H. (1960). Tests concerning random points on a circle. Nederl. Akad. Wetensch. Proc.
Ser. A 63, Indag. Math. 22, 38-47.
Lauwerier, H. A. (1963). The asymptotic expansion of the statistical distribution of N. V. Smirnov.
Z. Wahrsch. Verw. Gebiete 2, 61-68.
Malmquist, S. (1954). On certain confidence contours for distribution functions. Ann. Math. Statist.
25, 523-533.
Maniya, G. M. (1949). Generalization of Kolmogorov's criterion for an estimate for the law of
distribution for empirical data. Dokl. Akad. Nauk SSSR 69, 495-497 (Russian).
Mason, D. M. (1981). Bounds for weighted empirical distribution functions. Ann. Probability 9,
881-884.
Massey, F. J. Jr. (1950). A note on the power of a non-parametric test. Ann. Math. Statist. 21,
440-443; correction: 23 (1952), 637-638.
Mogulskii, A. A. (1979). On the law of the iterated logarithm in Chung's form for functional
spaces. Theory Probab. Appl. 24, 405-413.
Mohanty, S. G. (1979). Lattice Path Counting and Applications. Academic Press, New York.
Empirical distribution function 429
Mikl6s Cs6rgi~
Usually X1, X2 . . . . will be random vectors in the Euclidean space R a (d/> 1),
i.e., R = R a, and ~3 is the Borel subsets of R a. In this case let F be the right
continuous distribution function of X1, i.e., F(x) = be((-o% x]) --
P{~o E g2: Xl(to)E ( - % x)}, where ( - ~ , x] is a d-dimensional interval with
x E R d. The corresponding empirical distribution function be,((-~, x]) of the
random sample X1 . . . . . X, will be denoted by F,(x), i.e., for each ~o E O
nF,(x) is the number of those Xj(~o)= (Xjl(~O). . . . ,Xja(~o)) ( j = 1. . . . , n)
431
432 Mikl6s Cs6rg~
then the null distribution of fin(L-l(y)) is that of an(y), i.e., the same for all
F E 40 and for d = 1 with F simply continuous. Otherwise, i.e., if/-/1 obtains,
the empirical process fin is a function of F and so will be also its distribution.
Hoettding (1948), and Blum, Kiefer and Rosenblatt (1961) suggested an
alternate empirical process for handling H0 of (1.10). Let F,i be the marginal
Invariance principles for empirical processes 433
and define
d
T,(x) = Tn(x1, . . . , Xd)= n'/2(Fn(x) YI F,,(x,)), d >i 2, (1.12)
i=1
with F, as in (1.5). In terms of the mapping L -1 of (1.8) we define t,, the uniform
version of T., by
d
t.(y) = T,(L-I(y))= nl/2(F,(Fo~(y,) . . . . . F(,~,(ya))- ~ F.i(Fa~(yi)) )
d
= nl/2(E.(Y) - 1-I E,,(y,)), (1.13)
i=1
if XI:. > x,
F,,(x) =i k n if Xk:. <~x < Xk+l:., (1.15)
if X.:~ ~< x,
if UI:n,
E.(y) = F,(O(y))=! I k/On Uk:n
if ~< y < Uk+l:., (r16i
It1 if U.:. ~< y,
434 M i k l 6 s Cs6rg~
A simple relationship like that of (1.17) for c~. and/3, does not exist for u. and
p.. However by (1.19) we have
where U.(y) ^ y < 0y,. < U.(y) v y. Since u.(k/n) = --o~.(Uk:.) (k = 1. . . . . n), it
is reasonable to expect that the asymptotic distribution theory of ~. and u.
should be the same. This, in turn, implies that via (1.22) p. should also have the
same kind of asymptotic theory if f is 'nice'. We are going to see in Section 4 that
this actually is true under appropriate conditions of f.
Let C(t) = fR ~ exp(i(t, x)) dF(x) be the characteristic function of F ( x ) on R e,
where (t,x)=~.d=ltkXk, the usual inner product of t = ( t l , . . . , t k ) , X=
(X~. . . . . Xd) E R d. With F. as in (1.5) the empirical characteristic function C. of
the sample X1 . . . . . X. is defined by
d
EW(x)W(y) = A(x ^ y) with A (x ^ y) = I ] (xi ^ Yi),
i=1
Recent work on the limiting distribution of and critical values for the
multivariate Cram6r-von Mises and Hoeffding-Blum-Kiefer-Rosenblatt in-
dependence criteria is reviewed in Section 3.
An up-to-date review of strong and weak approximations of the quantile
process, including that of weak convergence in ][-/ql[-metrics, is given in Section 4.
For further readings, references on this subject and its statistical applications, we
.. tt
P{sup Io~.(y) - B.(y)[ > n-112(C log n + x)} ~< L e -*x , (2.1)
yE/1
(iii) for any A > 0 there exists a constant C > 0 such that for each n (cf.
Cs6rg~ and R~vdsz, 1975a)
P{su~ la.(y)- B.(y)[ > C(log n)3/2n-uz(d+l)} ~< n -x, d/> 1. (2.3)
yEI
The constants of (2.1) for example can be chosen as C = 100, L = 10, h = 1/50
(cf. Tusnddy, 1977b,c).
Ltt ~ .
Due to Bfirtfai (1966) and/or to the Erdos-Renyl (1970) theorem, the O(.)
rate of convergence of (2.4) is best possible (cf. Koml6s, Major, Tusnfidy, 1975a,
or Theorem 4.4.2 in Cs6rg6 and R6v6sz, 1981a).
Next, on strong approximations of the uniform empirical process a, of (1.7)
or, equivalently, that of fl,(L -~) of (1.9) in terms of a single Gaussian process,
the Kiefer process {K(x, t); (x, t) E I d x R ~+},we have
P{ sup sup [kU2ak(y)-- K(y, k)[ > (C log n + x) log n} < Le -xx , (2.7)
l~k<~n yE11
P{ sup sup Ikl/Zak(y)- K(y, k)l > C n (d+x)12(a+2)log 2 n} ~< n -A, d/> 1.
l ~ k ~n y E l d
(2.8)
COROLLARY 2.2. (2.7), (2.8) in turn imply
n -1/2 sup sup [kl/20tk(y)- K(y, k)l ~' O(log 2 n/nX/2), (2.9)
l~k<~n y E l 1
438 Mikl6s Cs6rgd
n -v2 sup sup Iklr2ak(y)-- K(y, k)[ ~ O(n -m2d+4) log2 n), d i> 1.
l<~k<~" Yu'Id (2.10)
The first result of the type of (2.4) is due to Brillinger (1969) with the a.s. rate
of convergence O(n-V4(log n)m(log log n)1/4). Kiefer (1969) was first to call
attention to the desirability of viewing the one dimensional (in y) empirical
process a,(y) as a two-time parameter stochastic process in y and n and that it
should be a.s. approximated in terms of an appropriate two-time parameter
Gaussian process. M/iller (1970) introduced {K(y, t);(y, t ) E I I x R I + } and
proved a corresponding two dimensional weak convergence of {a,(y); y E
11, n = 1, 2 . . . . } to the latter stochastic process. Kiefer (1972) gave the first
strong approximation solution of the type (2.9) with the a.s. rate of con-
vergence O(n-1/6(log n)2/3).
Both Corollaries 2.1 and 2.2 imply the weak convergence of a , to a
Brownian bridge B on the Skorohod space D[0, 1]d. Writing k = [ns] (s E
[0, 1]), [ns]lr2at~l(y)/n v2 is a random element in D[0, 1]d+l for each integer n,
and (2.10) implies also
For d = 1 the latter result is essentially the above mentioned result of Mfiller
(1970) (cf. also discussion of the latter on page 217 in Gaenssler and Stute,
1979).
Corollary 2.2 also provides strong invariance principles, i.e. laws like the
Glivenko-Cantelli theorem, LIL, etc. are inherited by a,(y) from K(y, n) or
vice versa (cf., e.g., Section 5.1, Theorem S.5.1.1 in Cs6rg~ and R6v6sz, 1981a).
The main difference between the said Corollaries is that such strong laws like
the ones mentioned do not follow from Corollary 2.1. In the latter we have no
information about the finite dimensional distributions in n of the sequence of
Brownian bridges {Bn(y)}~=l. On the other hand, the inequalities (2.1), (2.2)
and (2.3) can be used to estimate the rates of convergence for the distributions
of some functionals of a , (to those of a Brownian bridge B), and those of the
appropriate Prohorov distances of measures generated by the sequences of
stochastic processes {a,(y); y G Ia}n=l and {B,(y); y E Ia},=l (cf., e.g., Koml6s,
Major and Tusnfidy, 1975b; Theorem 1.16 in M. Cs6rg~, 1981a; Theorem 2.3.1
in Gaenssler and Stute, 1979 and references therein; Borovkov, 1978; and
review of the latter paper by M. Cs6rg~, 1981, M R 81j; 60044; the latter two to
be interpreted in terms of {a,(y); y G Ia}~=l and {B,(y); y ~ Id}n= 1 instead of
the there considered partial sum and Wiener processes).
When the distribution function F of (1.6) is not a product of its marginals for
all x E R d (d/> 2), then strong and weak approximations of/3, can be described
in terms of the following Gaussian processes associated with the distribution
function F(x) (x E R d, d >! 2).
Brownian bridge Bv associated with F on R a (d~>2): A separable d-
I n v a r i a n c e principles f o r e m p i r i c a l p r o c e s s e s 439
K~(x, O) = O,
lira KF(X, . . . . . xa, t) = 0 (i = 1 . . . . . d),
is a Brownian bridge process on rid associated with G on rid, and the Brownian
bridge process B r associated with F on R e can be represented via (2.12) and
(2.13) as
{By(x); x G R e} = { W G ( L ( x ) ) - G ( L ( x ) ) W e ( 1 . . . . ,1); x ~ R e}
and (2.14)
{BF(L-I(y)); y E U} = {Be(y); y ~ Ia}.
{KF(X, t); X E R ~, t ~ 0 }
= {We(L(x), t)- G(L(x))Wo(1 ..... 1, t); x E R a, r 1>0}
and (2.16)
{KF(L-~(y), t); y E I a, t >~0} = {Ke(y, t); y E I d, t ~> 0}.
W e note that if F ~ o~0 or d = 1, then the latter Kiefer processes K e ( ' , ) and
K s ( ' , ) coincide with our originally defined Kiefer process K ( - , . ) on I d x R l+.
T h e same is true concerning our originally defined Brownian bridge B, and the
Brownian bridges B e and BF in the context of F E o~0. N o t e also that in general/3n
of (1.16) can be written as
process/3, of (1.6) or, equivalently, that of/3,(L -~) of (2.17) is (for more recent
information we refer to Borisov, 1982).
for h = 1/(5000d2), where Cl, C2 are positive constants depending only on F and
d.
COROLLARY2.4. (2.18) in turn implies
REMARK 2.1. We note that Philipp and Pinzur (1980) state only (2.19) in their
Theorem 1. S. Cs6rg~ (1981b) noted that going through their proof one can see
that they had in fact proved the somewhat stronger (2.18).
REMARK 2.2. Theorem 2.3 and Corollary 2.4 are best available in the sense
that in them there are no assumptions made on F. The a.s. rates of convergence
of Corollary 2.2 are of course better than that (2.19), but in the former F is
assumed to be uniformly distributed on I d. While in case of d = 1 the latter
assumption is not a restriction (for d = 1 and F continuous we have (1.9), and if
F is arbitrary in the latter case, then (2.1) and (2.7) remain true (cf. Remark 1
in S. Cs6rg~, 1981a)), for d/> 2 it is. Cs6rg~ and R~v6sz (1975b) actually proved
(2.18) with the better rate n (a+l)/(2a+4)log2 n (cf. (2.8)) replacing its present rate of
n (1/2)-z (A = 1/(5000d2)), but only for a class of d-variate distribution functions
satisfying a rather strict regularity condition.
Clearly, for all fixed t > 0,
The result of (2.21) was first proved by Dudley (1966) (cf. also Neuhaus,
1971; Straf, 1971; Bickel and Wichura, 1971; Theorem 2.1.3 in Gaenssler and
Stute, 1979, and discussion of the latter theorem therein). The result of (2.22)
was first proved by Bickel and Wichura (1971) (cf. Theorems 2.1.4 and 2.1.5 in
Gaenssler and Stute, 1979; see also Neuhaus and Sen, 1977).
A common property of the quoted results so far is that they are uniform
approximations of measures over intervals only (of I d or those of R a mapped
onto Id). Concerning now the more general problem of approximating the
empirical measure process 13, of (1.4) by appropriate Gaussian measure pro-
cesses, the question is over how rich a class of subsets ~ C ~ of (R, ~ ) (cf.
paragraph one of Section 1) could we possibly have theorems like, for example,
Theorems 2.1, 2.2 and 2.3. Let (R, ~, p.) = (I d, ~, A), A the uniform Lebesgue
measure on I d, i.e., X1 . . . . , X, (n = 1, 2 . . . . ) are distributed as in Theorems 2.1
and 2.2. In" this case write t~,(B) = nl/2(A,,(B)- A(B)), B ~ ~, instead of/3, of
(1.4). Then, while it is true (cf. Philipp, 1973) that
where ~ = qg(2) is the class of convex sets of I 2, it is also known that the law of
the iterated logarithm (2.23) fails for c = CO(d)' the class of convex sets of I d
(d t>3). The latter negative result for d = 3 was only recently proved by
Dudley (1982) (for a discussion of previous results see e.g. Gaenssler and Stute
1979). In spite of (2.23) dimension two (d = 2) is also critical, for Dudley (1982)
showed also that if c is the collection of lower layers in 12 (a lower layer in I 2
is a set B such that if (x, y) E B, u ~<x and v ~<y, then (u, v) E B), then the law
of the iterated logarithm (2.23) fails again (for previous mostly negative results
and references along these lines for higher dimensions we refer to Stute, 1977;
and Gaenssler and Stute, 1979). The same kind of negative results hold true
concerning the problem of central limit theorem for an(B) (cf. Dudley, 1979), i.e.,
concerning empirical measure processes o n I d o r R d, for the central limit theorem
as well as law of the iterated logarithm the critical dimension is 2 for the lower
layers and 3 for the convex sets. Hence any extension of results like those of
Theorems 2.1, 2.2 and Corollaries 2.1, 2.2 in terms of uniform distances over a
class of sets c~ of i d , other than the intervals already considered, can only be true
for somewhat restricted classes ~ C ~. R6v6sz (1976a,b) extended (2.6) and (2.10)
over sets in I d, defined by differentiability conditions. For example, instead of (2.6)
we have (R6v6sz, 1976a)
Invariance principles [or empirical processes 443
where Y is the class of those Borel sets of 12 which have twice differen-
tiable boundaries, and {B.(B); B ~ ~}]=1 respectively {K(B, n); B E ~g,
n/> 1} are Gaussian measure processes with mean zero and covariance
function EBn(B)B.(D)= A(B C) D ) - A(B)A(D) (n = 1, 2 . . . . ) respectively
EK(B, n)K(D, m) = (n A m)(h(B CI D ) - A(B)A(D)) for all B, D ~ ~. Similar
extensions hold true over sets in I d (d >~2) with differentiable boundaries (cf.
R6v6sz, 1976b; Ibero, 1979a,b).
A common feature of the results of Theorem 2.2, Corollary 2.2 and that of
the first one of these types by Kiefer (1972) is not only that they improve (they
imply for instance functional laws of iterated logarithm (cf., e.g., Section 5.1 in
Cs6rg~ and R6v6sz, 1981a)) and are conceptually simpler than the original
weak convergence result of Donsker (1952) on empirical distribution functions,
but they also avoid the problem of measurability and topology caused by the
fact that D[0, 1]a endowed with the supremum norm is not separable (cf., e.g.,
Billingsley, 1968, p. 153). This idea of proving a.s. or in probability invariance
principles h ia Kiefer (1972) also works for distribution functions on R d (cf.
Theorem 2.3 and its predecessors by R6v6sz (1976a, Theorem 3), and Cs6rg~
and R6v6sz (1975b, Theorem 1)) and, as we have just seen in (2.24) and (2.25),
for uniform distances over sets of U, defined by differentiability conditions (see
also R6v6sz, 1976b; Ibero, 1979a,b). Recently Dudley and Philipp (1981) used
the same idea to reformulate and strengthen the results of Dudley (1978,
1981a,b), Kuelbs (1976) on empirical measure processes while removing their
previously assumed measurability conditions. They do this by proving in-
variance principles for sums of not necessarily measurable random elements
with values in a not necessarily separable Banach space and by showing that
empirical measure processes fit easily into the latter setup. We refer for
example to Theorems 1.5 and 7.1 in Dudley and Philipp (1981) which can be
viewed as far reaching generalizations (with slower but adequate rates of
convergence) of Theorems 2.2, 2.3 and their Corollaries 2.2, 2.4, and also that of
(2.25), in terms of Kiefer Measures {K,,(B, n); B E ~, n ~> 1} associated with
probability measures/x on (R, N) over a subclass ~ (of some generality) of N.
The strong and weak approximations of the empirical characteristic function
Cn of (1.23) can be accomplished, on R ~ in terms of Gaussian processes built on
Kiefer and Brownian bridge processes (cf. S. Cs6rgS, 1981a) and on R d (d 1> 2)
in terms of Gaussian processes built on Kiefer and Brownian bridge processes
associated with F on R d (cf. S. Cs6rg6, 1981b). For further references we refer
to the just mentioned two papers of S. Cs6rgd.
444 Mikl6s CsOrg/~
W2,d ) I ] dF(i)(xi) = d
i=1 i=1
(3.1)
Cs6rg6 and Stach6 (1979) gave a recursion formula for the exact distribution
function V,~I of the rv OJ~l. The latter in principle is applicable to tabulating
Vn,1 exactly for any given n. Naturally, much work has already been done to
compile tables for Vn.1. A survey and comparison of these Can be found in Knott
(1974), whose results prove to be the most accurate so far. All these results and
tables are based on some kind of an approximation of V,~I. As to higher
Invariance principles for empirical processes 445
where to} = fie B2(y) dy, {B(y); y E I d} a Brownian bridge, and dy = I-I/d=1dyi
from now on.
For the sake of describing the speed of convergence of the distribution
functions {V~,d}~=l to the distribution function Vd of o9~ (cf. (3.3)) we define
S. Cs6rgd (1976) showed that a,,,1= O ( n -1/2 log n) and, on the basis of his complete
asymptotic expansion for the Laplace transform of the rv w21 (cf. (3.1)), he
conjectured that A,,1 is of order 1/n. Indeed, the latter turned out to be correct (cf.
Corollary 1:An,1 = o(n-1), in Cotterill and M. Cs6rg6, 1982), and it can be deduced
from the ground breaking work of G6tze (1979). Actually the latter work when
combined with Dugue (1969), and Bhattacharya and Ghosh (1978) implies (cf.
Section 2 in Cotterill and M. Cs6rg6, 1982) an asymptotic expansion of arbitrary
order for the distribution function V,,d of (3.2) and also that
As mentioned already, for the sake of computing the value of ~O2,d for a
sample, /40 of (1.10) will have to be completely specified. An alternate route to
testing for H0 of (1.10) can be based on the empirical process t, = T , ( L -1) of
(1.13) which will not require the specification of the marginals of F under H0,
i.e., it will work also when H0 of (1.10) is a composite statistical hypothesis.
For the sake of describing the latter approach due to Hoeffding (1948), and
Blum, Kiefer and Rosenblatt (1961), we define the sequence of Gaussian
processes {T(")(y); y E Id}n~l by
d
{T(")(y); y C I a} = {Bn(y)- 2 B.(1 . . . . . 1, Yi, 1. . . . ,1) l-[ Y~;
i=1 j#i
y =(yl ..... yd)~- Id(d~>2)} (3.6)
where {B(y); y E I d (d >~2)} is a Brownian bridge. Thus T(-) has mean zero
and covariance function p ( - , - ) of (3.8), and weak convergence of t, to the
Gaussian process T of (3.17) on the Skorohod space D[0, 1]d follows by (3.14)
say. Also, a Corollary 2.3 type weak convergence of [n.]~/2t[,l(.)/n 1/2 to T ( - , - )
of (3.7) on D[0, 1]d+l follows by (3.15).
Blum, Kiefer and Rosenblatt (1961) proposed the following Cram6r-von Mises
type test statistic for/40 of (1.10):
d
C"'d= fa T2(x) I~dF(i)(xi)= ~Idt~(y)dy' d>~2" (3.18)
d i=1
448 Mikl6s Cs6rgd
As far as we know the rates of convergence in (3.21) are the only ones available
so far.
Concerning tables for the distribution function F~, for d = 2, Blum, Kiefer
and Rosenblatt (1961) obtained the characteristic function of the distribution
function Ca of the rv Ca and tabulated its distribution via numerical inversion
of the said characteristic function. The statistic C,,d of (3.18) itself cannot be
computed unless F E ~0 of /40 of (1.10) is completely specified. Hoeffding
(1948), and Blum, Kiefer and Rosenblatt (1961) suggested, as critical region for
H0 of (1.10) when it is viewed as a composite statistical hypothesis, large values
of
or those of
d
These two statistics are equivalent to C,,d in that both converge in dis-
tribution to the rv Ca. This was already noted by Blum, Kiefer and Rosenblatt
(1961), and for a detailed proof of this statement we refer to Section 4 in
Cotterill and Cs6rg6 (1980). Recently D e W e t (1980) studied a version of (3.23)
in the case of d = 2 with some nonnegative weight functions multiplying the
integrand T 2 of E',,d. Koziol and Nemec (1979) studied ~7,,d of (3.22) and its
lnvariance principles ]:or empirical processes 449
THEOREM 4.1 (Cs6rg~ and R6v6sz, 1978). Let X1, 2 2. . . . be i.i.d, rv with a
continuous distribution function F and assume
O) is twice differentiable on (a, b), where a = s u p { x : F ( x ) = 0}, b =
inf{x: F ( x ) = 1}, - ~ ~< a < b ~< +0%
(ii) F ' ( x ) = f ( x ) > 0 on (a, b),
(iii) for some y > 0 we have
If'(O(Y))l ~<
sup y ( 1 - y) ~(o(y)) r.
O<y<l
450 Mikl6s Cs6rg~
THEOREM 4.2 (Cs6rg6 and R6vdsz, 1975c, 1978). For an i.i.d, sequence of rv
X1, X2 . . . . there exists a probability space with a sequence of Brownian bridges
{B,} on it such that
D
p, (-)---* B(.) (4.5)
P
sup [(u.(y)- B,(y))/q(y)] ~ 0 (4.8)
1/(n+ l)~y~n/(n+ l)
g(y)f(O(y/a))/f(Q(y))~ as y ~ 0 (4.10)
and (symmetrically)
g ( y ) f ( Q ( 1 - y)/A)/f(Q(1 - y ) ) ~ ~ as y ~ 0 (4.11)
for each A ~> 1 (note that in Stute's (1982) case g = 1/h on account of q --- 1).
Shorack (1982) announced the latter result with q and g as in (4.7) (for a proof
we may, for example, refer to M. Cs6rg~, 1983, Corollary 5.3.2). All the
afore-mentioned results concerning (4.9) are contained in
THEOREM 4.3 (M. Cs6rg6, 1983, Theorem 5.3.1). Let a, b be as in Theorem 4.1
and assume that F has a continuous density function F' = f that is positive on
(a, b), the support of F. Let q be any given O'Reilly weight function with (4.7).
Then, as n-~ ~, with the sequence of Brownian bridges {Bn} of (4.4) we have
(4.9) under (4.7), provided that with g of the latter the following assumption also
holds true:
For any given 0 < 7l < 1 and e >O there exist 0 < c < 1 and no such that
452 Mikl6s CsOrg~
and similarly
All the afore quoted results concerning (4.9) can be put in terms of weak
convergence on D[0, 1], provided we redefine u, (and hence also p,) to be
equal to zero on [0, 1/(n + 1)) and (n/(n + 1), 1].
The sufficient conditions of Theorem 4.3 (cf. (4.12)) for the weak ap-
proximation of p, on [1/(n + 1), n/(n + 1)] are nearly necessary as well. For
convenience, a weight function q will be called an O'Reilly weight function
from now on if g = q/h satisfies (4.7). We have
P P
sup Ip,(y)/q(y)[---~O, sup Ip,(y)/q(y)[--~O. (4.15)
0~<y~l/n (n-1)/n~y<~l
If, on the other hand, for the given q and g there exists a sequence of rv
T, <~ 1/n so that the rv
n-1/a(f(O(T~))/f(O(O,,,,)))/q(T,), or
(4.16)
n -uz(f( Q (1 - %))/f( O (01- ,,.,)))/q (T~)
with r, ^ UI:, <~ 0,,,, <~ rn v Ul:n and (1 - ~-,) ^ U~:n ~< 01+rn,n ~ (1 -- ~'~) V U~:,, is
bounded a w a y from zero in probability as n -~ ~, then so will be also the rv of
(4.15).
for some A ~> 1 and all n ~> 1 with probability arbitrarily near to one by Remark
1 in Wellner (1978). Hence (4.17) implies the first condition of (4.12) and the
Stute (1982)-Shorack (1982) theorem follows from Theorem 4.3.
(2) (Taken from Cs6rg~, Cs6rg6, Horv~th and R6v6sz, 1982). For any
O'Reilly weight function q such that q ( y ) ~ 0 we have sup0<y~l/, 1/q(y)= ~.
Hence in case of the uniform quantile process u, we can choose {r,} of (4.16)
such that for any given constant K > 0 we have (1/nl/2q('&)) > K. Consequently,
for any q with q(0) = 0 and for any sequence of Brownian bridges {B,} we have
(3) (Taken from Cs6rg~, Cs6rg~, Horvfith and R6v6sz, 1982). As to the
454 Mikl6s Cs6rg[~
problem of having
Hence condition (4.12) of Theorem 4.3 is satisfied and consequently (4.9) holds
true with any O'Reilly weight function q under conditions (i), (ii), (iii) of
Theorem 4.1.
(5) (Taken from Cs6rg6, Cs6rg6, Horvfith and R6v6sz, 1982). Given the
conditions (i), (ii) of Theorem 4.1 and replacing its condition (iii) by requiring
the existence of the limits (cf. (vi) of Theorem 4.7)
where yl and ')/2 are real numbers, then it can be shown (cf. Theorem 3.A in
Parzen (1980), or page 7 of Seneta (1976), or Mason (1982)) that we have
riO(y)) = y"Ll(y) as y ~, O,
(4.20)
f(O(y)) = (1- y)r2L2(y) as y 1' 1,
where L1 and L2 are slowly varying functions at 0 resp. at 1. If we simply
I n v a r i a n c e principles f o r empirical processes 455
assume the forms of (4.20) for f ( O ) on the mils, then these are weaker
assumptions on f than that of Off), for then we do not require the existence of f'
on (a, b), the support of F. So let us assume that f ( O ) is as in (4.20) on the tails,
and consider its first statement (regarding the second one, similar conclusions
will hold true). It follows from Corollary on page 274 in Feller (1966) that for a
slowly varying function L1 we have: for any e > 0 there exist some positive
constants K1,/(2 and 0 < y0 < 1 such that we have
K l y ~ < LI(y) < K2y -e for all 0 < y <~ y0. (4.21)
' Ll(y) 1
sup su- f(O(y)) 1 =
sup sup Oy,. Ll(Oy,,,)g(y)
t/(,,+O~<y~<coy,v f(O(Oy,.))q(Y) 1/(n+l)~y~c Oy,n
It follows from (2.8) in M. Cs6rg6 (1983) that for both 7 + e > 0 and y + e < 0
we have
Hence choosing g(y) = y-~, 6 > 0 and then e > 0 of (4.21) so that e < 6/2, then
the first condition of (4.12) holds true by (4.22) and (4.23) combined. This
means that having assumed (4.20), which is weaker than (i), (ii), (iii) of
Theorem 4.1 combined, the statement of (4.9) holds only with g ( y ) = y-8 for
6 > 0, arbitrary otherwise (cf. the last sentence of our example (4)).
(6) We note that the conditions discussed in (4) and (5) for the validity of
(4.9) cannot, in general, insure also the validity of (4.17). Namely we have the
following (CsiSrg~, Cs6rg6, Horvfith and R6v6sz, 1982)
Next we mention some new strong approximations of p,. First we recall that
conditions (i), (ii), (iii) of Theorem 4.1 imply (4.1), and given also the tail
monitonicity assumptions of (iv), (v) we have also (4.2) and (4.3). We have just
seen in Examples (4) and (5) that conditions (i), (ii), (iii) of Theorem 4.1 alone,
456 Mikl6s Cs6rg~
or the somewhat weaker assumptions of (4.20) for the tail behaviour of f(Q),
imply (4.9) (in (4) without any further restrictions on q, while in (5) with g(y) of
the form y-8 (6 > 0) only). As to the possibility of extending the statement of
(4.1) over a wider range than [6,, 1 - 6,], 6, = 25n -1 log log n, but using only the
assumptions (i), (ii), (iii) of Theorem 4.1 and not those of its conditions (iv), (v)
which, when combined with (i), (ii), (iii), made (4.2) and (4.3) possible, we have
THEOREM 4.6 (Cs6rg6, Cs6rg6, Horvfith and R6v6sz, 1982). A s s u m e the con-
ditions (i), (ii), (iii) of Theorem 4.1. Then
t4.25)
where e > 0 is arbitrary, and y is as in condition (iii).
It is clear from Theorem 4.6 that, when proving (4.2) and (4.3) the conditions
(iv) and (v) of Theorem 4.1 come into play only because of the tail regions
[0, 1/(n + 1)), (n/(n + 1), 1]. Having replaced 6, of (1.8) by 1/(n + 1) in (4.25), we
have only paid the price of slightly weakened rates of convergence. While they
render (4.2) and (4.3) true, the extra conditions (iv) and (v) of Theorem 4.1 are
somewhat disjoint from that of (iii). Next we modify the latter somewhat for
the sake of seeking further insight into the effect of the tail behaviour of the
density-quantile function f ( O ) on a statment like (4.25). We are going to
formulate this over the interval [0, ] only and note that similar statements can
be made over [, 1].
provided that a < l + l / ( 2 [ y l ] ) . On the other hand, when 71<0, for a >
1 + 1/(2171[) there exists positive constants K = K ( a ) and A = A(a) such that
n~
n
7 sup
O~y~l
Io.(y)l (4.28)
COROLLARY 4.1 (Cs6rg6, Cs6rg6, Horvfith and R6v6sz, 1982). Assume the
conditions (i), (ii), (iii) of Theorem 4.1 and condition (vi) of Theorem 4.7. Then
if 3/1 > 0 we have (4.28), and if yl < 0 we have
Now the first statement of (4.29) follows from the latter combined with (4.26).
The second statement of (4.29) is by (4.17).
We note that Corollary 4.1 implies the non existence of LIL for p, over [0, 1]
under (i), (ii), (iii) and (vi) if 71 < 0. On the other hand it follows from Theorem
3 in Mason (1982) that if we replace the weight function f ( Q ) in n-ll2pn by y~,
e > O, then
for every e > 0, given the conditions (i), (ii), (iii) and (vi), i.e., under the latter
conditions we always have a Mason type Glivenko-Cantelli theorem for
(O.(y)- O(y)).
Summarizing the main features of the problem of strong approximation of p,
by u, over [0, 1] in general, we have seen so far that under the conditions (i),
(ii), (iii), (iv), (v) of Theorem 4.1 we have (4.2) and (4.3), on dropping the
conditions (iv) and (v) we have (4.25), and when we replace the conditions (iv),
(v) by that of (vi) we have (4.26) and (4.27). Now we give an example which will
amount to saying that for results like (4.2) and (4.3) neither the conditions (iv),
(v), nor the condition (vi) are necessary. This example is due to Parzen (1979,
page 116). Continuing the numbering of examples of this section, we now have
(7) Parzen's example (1979) (Result of (4.32) is quoted from Cs6rg6, CsOrg6,
Horvfith and R6v6sz, 1982): Let
Clearly then
i.e.,_~conditions (i), (ii), (iii) of Theorem 4.1 are satisfied. Hence by Theorem 4.6
we have (4.25) with y = 1 / ( 1 - C ) . On the other hand, as y ~ l, O ( y ) - * ~ and
f(O(y)) oscillates. Hence conditions (iv), (v) of Theorem 4.1 are not satisfied.
Also, as y --->1,
(1 f(o(y))
- y)j2(O(y))
oscillates, i.e., the right tail version limit requirement of condition (vi) is also not
satisfied. Nevertheless in case of this example we have
References
Aly, E.-E., Burke, M. D., Cs6rg6, M., Cs6rg6, S. and Horv~th, L. (1982). On the product-limit
empirical and quantile processes: A collection of four papers. Carleton Mathematical Lecture Note
No. 38, Carleton University, Ottawa.
Aly, E.-E. and Cs6rg6, M. (1981). Three papers on quantiles and the parameters estimated quantile
process. Carleton Mathematical Lecture Note No. 31, Carleton University, Ottawa.
Bahadur, R. R. (1966). A note on quantiles in large samples. Ann. Math. Statist. 37, 577-580.
B~irtfai, P. (1966). Die Bestimmung der zu einem wiederkehrenden Process geh6renden Ver-
teilungsfunktion aus den mit Fehlern behafteten Daten einer Einzigen Relation. Studia Sci.,
Math. Hung. 1, 161-168.
Invariance principles for empirical processes 459
Bhattaeharya, R. N. and Ghosh, J. K. (1978). On the validity of the formal Edgeworth expansion.
Ann. Statist. 6, 434--451.
Bickel, P. J. and Wichura, M. J. (1971). Convergence criteria for multiparameter stochastic
processes and some applications. Ann. Math. Statist. 42, 1656-1670.
Billingsley, P. (1968). Convergence of Probability Measures. Wiley, New York.
Blum, J. R., Kiefer, J. and Rosenblatt, M. (1961). Distribution free tests of independence based on
the sample distribution function. Ann. Math. Statist. 32, 485-498.
Borisov, I. S. (1982). An approximation of empirical fields. Coll. Math. Soc. J. Bolyai 32,
Nonparametric Statistical Inference, Budapest 1980.
Borovkov, A. A. (1978). Rate of convergence and large deviations in invariance principle. In:
Proceedings of the International Congress of Mathematicians (Helsinki, 1978), 725-731. Acad. Sci.
Fennica, Helsinki (1980).
Brillinger, D. L. (1969). An asymptotic representation of the sample distribution function. Bull.
Amer. Math. Soc. 75, 545-547.
Burke, M. D., Cs6rg~, M., Csfrg~, S. and R6v6sz, P. (1979). Approximations of the empirical
process whe n parameters are estimated. Ann. Probability 7, 790-810.
Cencov, N. N. (1956). Wiener random fields depending on several parameters. Dokl. Akadl Nank.
SSSR 106, 607-609.
Chibisov, D. (1964). Some theorems on the limiting behaviour of empirical distribution function.
Selected Translations Math. Statist. Probability 6, 147-156.
Cotterill, D. S. and Cs6rg~, M. (1980). On the limiting distribution of and critical values for the
Hoeffding-Blum-Kiefer-Rosenblatt independence criterion. Carleton Mathematical Lecture
Note No. 24, Carleton University, Ottawa.
Cotterill, D. S. and Cs6rg~, M. (1982). On the limiting distribution of and critical values for the
multivariate Cram6r-von Mises statistic. Ann. Statist. 10, 233-244.
Csfiki, E. (1977a). Investigations concerning the empirical distribution function. Magyar Tud.
Akad. Mat. Fiz. Oszt. Kozl. 23, 239-327. English transl, in Selected Transl. Math. Statist. and
Probability 15, 229-317. Amer. Math. Soc., Providence, R.I. 1981.
Csfiki, E. (1977b). The law of the iterated logarithm for normalized empirical distribution function.
Z. Wahrsch. Verw. Gebiete 38, 147-167.
Csfiki, E. (1982a), On the standardized empirical distribution function. Coll. Math. Soc. J. Bolyai
32, Nonparametric Statistical Inference, Budapest 1980.
Csfiki, E. (1982b). Empirical distribution function. This Volume.
Cs6rg~, M. (1979). Strong approximations of the Hoeffding, Bium, Kiefer, Rosenblatt empirical
process. J. Multivariate Anal. 9, 84--100.
Cs6rg~, M. (1981a). Gaussian processes, strong approximations: an interplay. Colloques Inter-
nationaux Du Centre National De La Recherche Scientifique No. 307, Aspects Statistiques JEt
Aspects Physiques Des Processus Gaussiens, Saint-Flour 22-29 juin 1980, 131-229, l~ditions du
CNRS, Paris 1981.
Cs6rg~, M. (1981b). On a test for goodness-of-fit based on the empirical probability measure of
Foutz and testing for exponentiality. In: D. Dugue et al., Eds., Analytical Methods in Probability
Theory, Lecture Notes in Mathematics 861, Springer, Berlin, pp. 25-34.
Cs6rg6, M. (1981). MR81j:60044.
Cs6rg~, M. (1983). Quantile Processes with Statistical Applicatiorls. Regional Conference Series in
Applied Mathematics, Philadelphia, SIAM (To appear).
Cs6rg~, M., Cs6rg~, S.,'Horvfith, L. and R~v6sz, P. (1982). On weak and strong approximations of
the quantile process. To appear in Proceedings of the Seventh Conference on Probability Theory
(Brasov, Aug. 2 9 - Sept. 4, 1982).
Cs6rg~, M. and R6v6sz, P. (1975a). A new method to prove Strassen type laws of invariance
principle, II. Z. Wahrsch. Verw. Gebiete 31, 261-269.
Cs6rg~, M. and R6v6sz, P. (1975b). A strong approximation of the multivariate empirical process.
Studia Sci. Math. Hungar. 10, 427--434.
Cs6rg6, M. and R6v6sz, P. (1975c). Some notes on the empirical distribution function the quantile
process. Coll. Math. Soc. J. Bolyai U, Limit Theorems of Probability Theory (P. R6v6sz, Ed.).
North-Holland, Amsterdam-London, pp. 59-7L
460 Mikl6s Csrrg~
Csrrgr, M. and Rrvrsz, P. (1978). Strong approximations of the quantile process. Ann. Statist. 6,
882-894.
Csrrgt3, M. and Rrv~sz, P. (1981a). Strong Approximations in Probability and Statistics. Academic
Press, New Y o r k - Akadrmiai Kiad6, Budapest.
Csrrg~3, M. and Rrv~sz, P. (1981b). Quantile processes and sums of weighted spacings for
composite goodness-of-fit. In: M. Cs/Srg~ et al., eds., Statistics and Related Topics. North-
Holland, Amsterdam, pp. 69-87.
Cs6rg~, S. (1976). On an asymptotic expansion for the von Mises to2 statistic. Acta Sci. Math.
(Szeged) 38, 45-67.
Csrrg~, S. (1980). On the quantogram of Kendall and Kent. J. AppL Probab.!7, 440-447.
Cs~irgt$, S. (1981a). Limit behaviour of the empirical characteristic function. Ann. Probab. 9,
13t)-144.
Csrrg~, S. (1981b). Multivariate empirical characteristic function. Z. Wahrsch. Verw. Gebiete 55,
203-229.
Csrrg~, S. (1982). On general quantile processes in weighted sup-norm metrics. Stoch. Process.
Appl. 12, 215-220.
Csrrg~, S. and Stacho, L. (1979). A step toward an asymptotic expansion for the Cramrr-von Mises
statistic. Coll. Math. Soc. J. Boyai 21, Analytic Function Methods in Probability Theory (B.
Gyires, Ed.), 53-65, North-Holland, Amsterdam.
Deheuvels, P. (1981). An asymptotic decomposition for multivariate distribution-free tests of
independence. J. Multivariate Anal. 11, 102-113.
DeWet, T. (1980). Cramrr-von Mises tests for independence. J. Multivariate Anal. 10, 38-50.
Doksum, Kjell (1974). Empirical probability plots and statistical inference for nonlinear models in
the two-sample case. Ann. Statist. 2, 267-277.
Doksum, K. A., Fenstad, G. and Aaberg, R. (1977). Plots and tests for symmetry. Biometrika 64,
473-487.
Doksum, K. A. and Sievers, G. L, (1976). Plotting with confidence: Graphical comparisons of two
populations. Biometrika 63, 421--434.
Doksum, K. A. and Yandell, B. S. (1982). Tests for exponentiality. This volume.
Donsker, M. (1952). Justification and extension of Doob's heuristic approach to the Kolmogorov-
Smirnov theorems. Ann. Math. Statist. 23, 277-283.
Dudley, R. M. (1966). Weak convergence of probabilities on nonseparable metric spaces and
empirical measures on Euclidean spaces. Ill. J. Math. 10, 109--126.
Dudley, R. M. (1978). Central limit theorems for empirical measures. Ann. Probab. 6, 899-929;
Correction (1979), ibid. 7, 909-911.
Dudley, R. M. (1979). Lower layers in R 2 and convex sets in R 3 are not GB classes. Probability in
Banach Spaces II, Lecture Notes in Mathematics 709. Springer, Berlin, pp. 97-102.
Dudley, R. M. (1981a). Donker classes of functions. In: M. Csrrg~ et al., eds., Statistics and
Related Topics. North-Holland, Amsterdam, pp. 341-352.
Dudley, R. M. (1981b). Vapnik-Cervonenkis Donsker classes of functions. Colloques Inter-
nationaux Du Centre National De La Recherche Scientifique No. 307, Aspects Statistiques JEt
Aspects Physiques Des Processus Gaussiens, Saint-Flour 22-29 juin 1980, 251-269, Editions du
CNRS, ParAs.
Dudley, R. M. (1982). Empirical and Poisson processes on classes of sets or functions too large for
central limit theorems. Preprint.
Dudley, R. M. and Philipp, W. (1981). Invariance principles for sums of Banach space valued
random elements and empirical processes. Preprint.
Dugue, D. (1969). Characteristic functions of random variables connected with Brownian motion
and of the von Mises multidimensional to~. In: P. R. Krishnaiah, ed., Multivariate Analf~sis 2.
Academic Press, New York, 289-301.
Durbin, J. (1970). Asymptotic distributions of some statistics based on the bivariate sample
distribution function. In: M. L. Puri, ed., Nonparametric Techniques in Statistical Inference.
Cambridge University Press, London, pp. 435--451.
Durbin, J. (1973a). Distribution Theory for Tests Based on the Sample Distribution Function.
Regressional Conference Series in Applied Mathematics 9, Philadelphia, SIAM.
lnvariance principles for empirical processes 461
Durbin, J. (1973b). Weak convergence of the sample distribution function when parameters are
estimated. Ann. Statist. 1, 279-290.
Durbin, J. (1976). Kolmogorov-Smirnov tests when parameters are estimated. In: P. Gaenssler and
P. Rrvrsz, eds., Empirical Distributions and Processes, Lecture Notes in Mathematics 566.
Springer, Berlin, pp. 33--44.
ErdOs, P. and Rrnyi, A. (1970). On a new law of large numbers. J. Analyse Math. 23, 103-111.
Feller, W. (1966). A n Introduction to Probability Theory and Its Applications, Vol. IL Wiley, New
York.
Gaenssler, P. and Stute, W. (1979). Empirical processes: a survey of results for independent and
identically distributed random variables. Ann. Probab. 7, 193-243.
G6tze, F. (1979). Asymptotic expansions for bivariate von Mises functionals. Z. Wahrsch. Verw.
Gebiete 50, 333-355.
Hoeffding, W. (1948). A nonparametric test of independence. Ann. Math. Statist. 19, 546-557.
Ibero, M. (1979a). Approximation forte du processus empirique fonctionnel multidimensionnel.
Bull. Sci. Math. 103, 409-422.
Ibero, M. (1979b). Intrgrale stochastique par rapport au processus empirique multidimensionell.
C.R. Acad. Sci. Paris Sdr. A 288, 165-168.
Kiefer, J. (1969). Old and new methods for studying order statistics and sample quantiles. In: Proc.
First International Conf. Nonparametric Inference. Cambridge University Press 1970, London, pp.
349-357.
Kiefer, J. (1970). Deviations between the sample quantile process and the sample D.F. In: M. L.
Puff, ed., Nonparametric Techniques in Statistical Inference. Cambridge University Press, Lon-
don, pp. 299-319.
Kiefer, J. (1972). Skorohod embedding of multivariate RV's and the sample D . F . Z . Wahrsch.
Verw. Gebiete 24, 1-35.
Knott, M. (1974). The distribution of the Cramrr-von Mises statistic for small sample sizes. J. Roy.
Statist. Soc. Set. B. 36, 430-438.
Koml6s, J., Major, P. and Tusn~dy, G. (1975a). An approximation of partial sums of independent
R.V.'s and the sample D . F . I . Z . Wahrsch. Verw. Gebiete 32, 111-131.
Koml6s, J., Major, P. and Tusn/tdy, G. (1975b). Weak convergence and embedding. Coll. Math.
Soc. J. Bolyai 11, Limit Theorems of Probability Theory (P. Rrvrsz, Ed.), North-Holland,
Amsterdam-London, pp. 149-165.
Koziol, J. A. and Nemec, A. F. (1979). On a Cramrr-von Mises type statistic for testing bivaffate
independence. Canad. J. Statist. 7, 43-52.
Kffvyakova, E. N., Martynov, G. V. and Tyurin Yu. N. (1977). On the distffbution of the tO E
statistic in the multidimensional case. Th. Probab. Appl. 22, 406--410.
Kuelbs, J. (1976). A strong convergence theorem for Banach space valued random variables. Ann.
Probab. 4, 744-771.
Martynov, G. V. (1978). Omega-Square Criteria (in Russian). Nauka, Moscow.
Mason, D. M. (1982). Some characterizations of almost sure bounds for weighted multidimensional
empirical distributions and a Glivenko--Cantelli theorem for sample quantiles. Z. Wahrsch.
Verw. Gebiete 59, 505-513.
Moore, D. S. and SpruiU, M. C. (1975). Unified large-sample theory of general chi-squared statistics
for tests. Ann. Statist. 4, 599-616.
Mtiller, D. W. (1970). On Glivenko-Cantelli convergence. Z. Wahrsch. Verve. Gebiete 16, 195-210.
Neuhaus, G. (1976). Weak convergence under contiguous alternatives of the empirical process
when parameters are estimated: The Dk approach. In: P. Gaenssler and P. Rrvrsz, eds.,
Empirical Distributions and Processes, Lecture Notes in Mathematics 566, Springer, Berlin, pp.
68-82.
Neuhaus, G. (1971). On weak convergence of stochastic processes with multidimensional time
parameter. Ann. Math. Statist. 42, 1285-1295.
Neuhaus, G. and Sen, P. K. (1977). Weak convergence of tail-sequence processes for sample
distributions and averages. Mitt. Math. Sere. Giessen 123, 25-35.
O'ReiUy, N. (1974). On the weak convergence of empirical processes in sup-norm metrics. Ann.
Probability 2, 624-651.
462 Mikl6s Cs6rgg
Parzen, E. (1979). Nonparametric statistical data modelling. J. Amer. Statist. Assoc. 58, 13--30.
Parzen, E. (1980). Quantile functions, convergence in quantile, and extreme value distribution
theory. Texas A & M University Technical Report No. B-3.
Philipp, W. (1973). Empirical distribution functions and uniform distribution mod 1. In: C. F.
Osgood, ed., Diophantine Approximation and its Applications. Academic Press, New York.
Philipp, W. and Pinzur, L. (1980). Almost sure approximation theorems for the multivariate
empirical process. Z. Wahrsch. Theorie Verw. Gebiete 54, 1-13.
Pyke, R. and Shorack, G. R. (1968). Weak convergence of the two-sample empirical process and a
new approach to Chenott-Savage theorems. Ann. Math. Statist. 39, 755-771.
Rrvrsz, P. (1976a). On strong approximation of the multidimensional empirical process. Ann.
Probab. 4, 729-743.
Rrvrsz, P. (1976b). Three theorems of multivariate empirical process. In: P. Gaenssler and P.
Rrvrsz, eds., Empirical Distributions and Processes, Lecture Notes in Mathematics 566. Springer,
Berlin, pp. 106-126.
Seneta, E. (1976). Regularly Varying Functions. In: Lecture Notes in Mathematics 508, Springer,
Berlin.
Shorack, G. R. (1979). Weak convergence of empirical and quantile processes in sup-norm metrics
via KMT-construction. Stoch. Process. Appl. 9, 95-98.
Shorack, G. R. (1982). Weak convergence of the general quantile process in l['/qH-metrics (pre-
liminary report). IMS Bull. 11, Abstract 82t-2.
Straf, M. L. (1971). Weak convergence of stochastic processes with several parameters. Proc. Sixth
Berkeley Symp. Math. Statist. Prob. 2, University of California Press, pp. 187-221.
Stute, W. (1977). Convergence rates for the isotrope discrepancy. Ann. Probab. 5, 707-723.
Stute, W. (1982). The oscillation behaviour of empirical processes. Ann. Probab. 10, 86-107.
Tusnfidy, G. (1977a). A remark on the approximation of the sample DF in the multidimensional
case. Periodica Math. Hungar. 8, 53-55.
Tusn~idy, G. (1977b). A Study of Statistical Hypotheses (in Hungarian). Candidatus Dissertation,
Hungarian Academy of Sciences.
Tusq~idy, G. (1977c). Strong invariance principles. In: J. R. Barra et al., eds., Recent Developments
in Statistics. North-Holland, Amsterdam, pp. 289-300.
Wellner, J. A. (1978). Limit theorems for the ratio of the empirical distribution function to the true
distribution function. Z. Wahrsch. Verw. Gebiete 45, 73-88.
Wichura, M. (1973). Some Strassen-type laws of the iterated logarithm for multiparameter
stochastic processes with independent increments. Ann. Probab. 1, 272-296.
Yeh, I. (1960). Wiener measure in space of functions of two variables. Trans. Amer. Math. Soc. 95,
433-450.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 t.)-1
Elsevier Science Publishers (1984) 463-485
Jana Jure~kovd
1. Introduction
2. Estimation of location
463
464 Jana Jure~kov(t
2 O(x,- t) = 0 (2.1)
i=1
,(x) = - f ( x ) f f ( x ) , x E R'
M-, L- and R-estimators 465
{x if Ix I ~< c, (2.4)
~b(x) = sign x if Ix[ > c,
for some c > 0. The pertaining M-estimators, called redescending, are studied
by Collins (1977), Portnoy (1977), Collins and Portnoy (1981); see also Huber
(1981) and Hampel, Rousseeuw and Ronchetti (1981).
F(x)+ F ( - x ) = l, x~ ~.
O) M , ( x l + c , - . . , x , + c ) = M , ( x l . . . . . x,)+c forx~R", c E R 1,
with c19being the d.f. of the standard normal distribution, e > O. Let M. be defined
as
P ( M . = M+.) = P ( M . = M ; ) = (2.7)
where M ; , M+. are defined in (2.3) and ~b is given in (2.4) with c > 0 satisfying
PROOV. The theorem is proved in Huber (1968); see also Huber (1969).
Further and more general finite-sample minimax results may be found in
H u b e r and Strassen (1973), Rieder (1977, 1980); see also Huber (1981).
~ / n ( M , - 0)-->
e N(0, o-2(~b,F)) as n ~ (2.12)
where
0-2(~, F ) = f 02(x)dF(x). ( f f ( x ) d 0 ( x ) ) -2 . (2.13)
The more we assume about ~b, the less we need to impose on F to achieve
the asymptotic normality of M,; for instance, if qJ is a step-function, then the
derivative of F should exist only in a neighborhood of the jump-points of ~b.
We see that, for qJ bounded, o-2(~b,F) is finite for a large class of
distributions. The characteristic SUpFe~O-2(qj,F) may be considered as a
measure of robustness of the M-estimator generated by ~b over the family 0%. If
o% is a neighborhood of a given distribution, for instance of the normal one,
there may exist an optimal ~b which minimizes SUpFE~ trZ(qJ, F). Let us illustrate
one of such minimax results (established by Huber (1964)) corresponding to the
case that 0% forms a special neighborhood of the normal distribution.
THEOREM 2.2 (Huber, 1964). Let 0%, be in the family of e-contaminated normal
distributions, i.e.,
where ill is the set of all symmetric distribution functions, e is a fixed number,
0 <~e < 1, and 49 is the standard normal d.f. Denote by ~Oo(x) the function defined
in (2.4) with c satisfying
dqO
2[(f*(c)/c)- 1 + q~(c)] = e l ( l - e), f*(x) = dx " (2.15)
Then
sup ~rZ(q,o,F) = inf sup o-2(~0,F ) (2.16)
and the supremum on the left-hand side of (2.16) is attained for the d.f. Fo with
the density
REMARK 1. The distribution (2.17) is the least informative one in ~~, i.e.
I(Fo) = inf{I(F): F E ~ } ; the M-estimator generated by qJ0 is the maximum
likelihood estimator for F0.
where R+i(Oo) is the rank of IX~ - 00] among ]Xa- 00[. . . . , IX, - 001 and q~+(t)=
q~((t+ 1)/2), 0 < t < l , where q~(t) is nondecreasing and square-integrable
function, q~(1-t)=-~0(t), 0<t<l. The statistic S . ( X - t ) is then
nonincreasing in t and attains positive as well as negative values with
probability 1 and Eo,S,(X - 00) = 0. The R-estimator of 0 is then defined as a
solution of the equation S , ( X - t) = 0; more precisely, it is defined as
M-, L- and R-estimators 469
1 R -, + R +)
R, = ~( (2.21)
where
R ; = sup{t: S,(X - t) > 0}, R + = inf{t: S,(X - t) < 0}. (2.22)
The R-estimators of location, which are the inversions of the signed-rank tests,
were suggested by Hodges and Lehmann (1963). Only some single R-estimators
could be given a simple explicit form: besides the sample median (which is the
inversion of the sign test), the most well-known is the R-estimator
corresponding to the Wilcoxon one-sample test (usually called H o d g e s -
Lehmann's estimator); it can be written as
R. = m e d { X @ - : l<~i,j<-n}. (2.23)
REMARK. The properties (i), (iii) and (iv) are analogous to these of M-
estimators. The value e in (iv) is independent of F. The property (ii) is the only
one which we miss in the case of M-estimators. On the other hand, R-
estimators do not have the finite-sample minimax property of Huber's M-
estimator (see Theorem 2.1).
X/n(R, - 0)-~
~ N(0, o-2(~0,F)) (2.25)
with
tr2(~, F) = fo' ~2(t) dt " ( f ~o(F(x))f'(x) dx ) -2 . (2.26)
then or2(@,F ) = 1/1(F); it means that the class of R-estimators also contains an
asymptotically efficient element. Similarly as in the case of M-estimators, we
are interested in the behavior of supF~O'2(q~,F) over some family o% of
distributions, e.g. over the family , ~ of contaminated normal distributions
(2.14). Then (cf. Jaeckel, 1971), if we put q~0(t)= q~to(t), 0 < t < 1, with f0 being
the least informative distribution (2.17), i.e.,
-c if t<a,
~Oo(t)= @-l((t- e)/(1- e)) if a ~< t ~ 1 - a , (2.28)
c if t > 1 - o~,
of 0 is defined as
L, = ~ CniXn:i (2.29)
i=l
This class of estimators covers the sample m e a n as well as the sample median.
T h e L-estimators are computionally m o r e appealing than M- and R-
estimators. If we wish to get a robust L-estimator, insensitive to the e x t r e m e
observations, we must put c,~ = 0 for i <~ k, and i >f n - k, + 1 with a p r o p e r k,.
Typical e x a m p l e s of such estimators are the a - t r i m m e d m e a n ,
.-[n~l
L, = (1/(n-2)[nal) ~ X,:, (2.311
i=[na]+l
and the a - W i n s o r i z e d m e a n ,
n-[hal
L , = - 1 { [ na]X.:i,~l+ X.:i + [na]X . . . . t,~l+~} ; (2.32)
n i=[nal+l
L. = 1 J X . : i + 2 ajX.:~pj~ (2.33)
rt i=1 j=l
THEOREM 2.3. Let X1, X2, . . be the sequence of independent observations from
the d.f. F ( x - O) such that F ( x ) + F ( - x ) = 1, x E R 1. Let J ( u ) be a function such
that J(u) = J(1 - u), 0 < u < 1 and fd J ( u ) du = 1. Then, under the assumptions
(i) J ( u ) = O for O < u < a and 1 - a < u < l , is bounded and satisfies a
Lipschitz condition of order > (except possibly a finite number of points of F -1
measure 0);
(ii) f IF(x)(1 - F(x))[ m dx < oo and
t"
tr2(J, F ) = J J J(F(x))J(F(y))[F(min(x, y ) ) - F(x)F(y)] dx dy
(2.34)
is positive.
Then the estimator
L"=li~__lJ-n
_ n:i (2.35)
satisfies
If Ln is of the form (2.33) and the second component does not vanish, then,
under the assumptions of Theorem 2.3, ~v/n(L, - 0) is asymptotically normally
M-, L- and R-estimators 473
3. Estimation of regression
X~ = C.O + E (3.1)
P
8i(t) = X~ - ~ ci#i, i = 1. . . . , n , (3.2)
j=l
3.1. M-estimators
The M-estimator M. of 0 is defined as a solution of the system of equations
with respect to tl ..... tp. If there are more solutions of (3.3), then Mn may be
defined as that the nearest to some proper preliminary consistent estimator of
6. If F has an absolutely continuous density f and we put ~O(x)= - f ' ( x ) / f ( x ) ,
x E R 1, we get the m.l.e, of O; M, coincides with the l.s.e, if ~b(x) = x, x E R 1.
Similarly as in the location case, Mn is translation-equivariant but generally
not scale-equivariant, so that, unless the scale of F is supposed to be known, M~
should be supplemented by an appropriate estimator of scale.
The asymptotic behavior of M, as n ~ ~ was studied by Relies (1968), H u b e r
M-, L- and R-estimators 475
(1972, 1973), Yohai and Maronna (1979), among others. Under the assumptions
on ~Oand on F analogous to those in the location case (besides the assumption
of symmetry of F), it was shown that, as n ~ , ~ / 2 ( M n - O) is asymp-
totically p-dimensionally normally distributed with expectation 0 and with the
covariance matrix o'2(~b,F)lp with o-2(~b,F) given in (2.13) and ~ , = C ' C , ; the
matrix ~ , is assumed to be positive and of the rank p for n t> no. We see that,
the sequence {C,} being fixed, the efficiency properties of M, depend only on
the constant o-2(~0,F) and are analogous to these in the location case. This
further implies that the asymptotic minimax property of M-estimators over the
family ~ , of e-contaminated normal distributions (see Section 2.1.2), extends
to the linear model (3.1).
Huber (1973) considered the asymptotic behavior of M, in the case that
p ~ oo simultaneously with n. An extension of M-estimators to the multivariate
linear model and its asymptotic behavior was studied by Maronna (1976) and
Carroll (1978). M-estimators of regression parameters with random design
matrix were studied by Maronna, Bustos and Yohai (1979). Bahadur type
representation of M-estimators in the linear model was considered by Jure6k-
ova and Sen (1981a,b).
3.2. R-estimators
R-estimators of regression parameters are inversions of linear rank tests of
regression. The general rank test of the hypothesis H : 0 = 00 in the model (3.1)
is based on the vector of statistics
i=1
where R,i(Oo) is the rank of the residual 6i(0o) among 61(0o) . . . . . 6,(00) and
(t) is a nondecreasing square-integrable score function, 0 < t < 1. Denote
$ . ( t ) = ( S , l ( t ) , . . . , S,p(t))'; then, under 0 = 00, EooS,(Oo) = 0 and analogously
as in the location case, we may define the R-estimator of 00 as any solution of
the system of 'equations'
S,j(t)'=- O, ] = 1 . . . . . p , (3.5)
with respect to t.
The statistics (3.4) are invariant to the translation, so that they are not able
to estimate the main additive effect (i.e., the component 0r for which co = 1,
i = 1 , . . . , n). The main additive effect should be estimated with the aid of the
signed rank statistics on the same line as the location parameter (el. Jure~kov~i,
1971b).
Adichie (1967), following the ideas of Hodges and Lehmann suggested an
estimator of (01, 02) in the regression model X~ = 01 + 02cj + Ej, i = 1 . . . . . n,
based on the Wilcoxon tests and derived its asymptotic distribution. Jure~kovfi
(1971a), Koul (1972) and Jaeckel (1972) then extended the procedure to the
476 Jana Jure~kov6
p-parameter regression and to the general linear rank tests. The three respec-
tive R-estimators are asymptotically equivalent and thus they have the same
asymptotic distributions and efficiencies. The estimators differ in the way how
they describe the solution of (3.5). Jure6kovfi (1971a) suggested the estimator
R, as any solution of the minimization problem
n
with respect to t; ft, = (1/n)E~'=l (i/(n + 1)). The idea is that (3.7) could be
considered as a measure of the dispersion of the residuals 6i(t), i = 1. . . . . n,
instead of the proper variance of the residuals which is used in the method of
least squares. Jaeckel proved the asymptotic equivalence of the solution of (3.7)
and of (3.6), respectively, as n ~ ~.
Koul (1971) suggested the R-estimator as minimizing, instead of (3.6) and
(3.7), an appropriate quadratic form in the statistics S,j(t), j = 1. . . . . p, with
respect to t. All three estimators are asymptotically equivalent, as n ~ ~.
3.3. L-estimators
While being computionally very appealing in the location case, the L-
estimators do not have any straightforward extension to the linear model. Let
us mention some of the attempts which appeared in the literature.
Koenker and Bassett (1978) extended the concept of quantiles to the linear
model. For a fixed a, 0 < a < 1, put
with respect to t = (q . . . . . tp)'. They proved that the asymptotic behavior of the
regression quantiles is similar to that of the standard sample quantiles and
suggested the following a-trimmed least squares estimator:
Remove X/from the sample if 6i(T.(a))< 0 (the i-th residual from T.(a) is
negative) or if 6 i ( T . ( 1 - a ) ) > O , i = 1 . . . . . n; 0 < a < ; and calculate the
least-squares estimator using the remaining observations.
The resulting estimator L* was later studied by Ruppert and Carroll (1980)
who proved that X1,/2(L*- 0),~,/z is asymptotically normally distributed with
the expectation 0 and with the covariance matrix o'2(a, F)I v where o'2(a, F) is
the asymptotic variance of the a-trimmed mean in the location ease. Jure~kovfi
(1983) proved that L* is asymptotically equivalent, in probability, to the Huber
estimator of 0 generated by the function ~b of (2.4) with c = F-1(1- a). The
regression quantiles seem to provide a basis for the extension of various
L-estimators from the location to the regression model.
Ruppert and Carroll (1980) also suggested another extension of the a-
trimmed mean to the linear model. Starting with some reasonable preliminary
estimator L0, one calculates the residuals 6i(Lo) fromL0, i = 1. . . . . n, and
removes the observations corresponding to [na] smallest and [na] largest
residuals. The estimator L** is then defined as the least-squares estimator
calculated from the remaining observations. The asymptotic behavior of L**
depends on L0 and generally is not similar to that of the trimmed mean; L** is
asymptotically equivalent to L*, provided L0 = (T,(a) + T,(1 - a)).
Bickel (1973) proposed a general class of one-step L-estimators of 0 depend-
ing on a preliminary estimate of 0. The estimators have the best possible
efficiency properties, i.e. analogous to those of the corresponding location
L-estimators but they are computationally complex and are not invariant under
a reparametrization of the vector space spanned by the columns of C,.
asymptotically as the root of the equation. This idea was applied by Kraft and
van Eeden (1972a,b) to the R-estimators of location and regression, respec-
tively. Bickel (1975) studied the one-step versions of the M-estimators in the
linear model.
Let us first describe the one-step version of the M-estimator. Let M, be the
M-estimator of 0 in the linear model (3.1), defined as the solution of the system
of equations (3.3). Assume that the design matrix C, satisfies the condition
n - I C ' C , - ~ , ~ as n ~ ~ where ~; is a positive p p matrix. Then, provided F
has an absolutely continuous density f, I ( F ) < oo and ~Ohas bounded variation
on any compact interval,
for 1 ~<j, k ~<p, where ,~* = [Orjk]j,k=l,..., p is a positive matrix. Then, provided ~o
M-, L- and R-estimators 479
and V n I I R , - R,][---~0
' p as n ~ ~. However, y is generally unknown; Kraft and
van Eeden (1972b) suggested to replace y in (4.11) by J'd (~(t)-ff)Zdt, ff =
J'd q~(t)dt. The resulting estimator (say, R") is generally not asymptotically
equivalent to R, ; it could be proved (cf. Humak, 1983) that ~ / n H R , - R'.'II& 0
as n ~ oo if and only if either both R, and R" are asymptotically efficient (i.e.,
where ~0(t)= -f'(F-l(t))/f(F-l(t))) or if R, ~ n d thus also R") is asymptotically
equivalent to the preliminary estimator 0,. In order to get an estimator
asymptotically equivalent to R,, we should replace 3' in (4.11) by an ap-
propriate estimator "/,, similarly as in the case of M-estimator. One of such
possible estimators is
q/. = n-1/21[t2- till -1" II,~*-l(Sn (Sn -- tl-1/2t2) -- Sn(Sn -- n-1/2tl))ll, (4.12)
We have seen that the three groups of estimators, though being defined in
different ways, follow the same idea: to cut-off the influence of outliers and to
diminish the sensitivity to the long-tailed distributions. It turns out that these
three classes of estimators are even nearer than one would expect; in fact, they
become asymptotically equivalent as n ~ oo.
The asymptotic relations of M-R-L-estimators were studied by Jaeckel
(1971), Bickel and Lehmann (1975), Jure~kovfi (1977, 1978, 1981), Hugkovfi and
Jure~kovfi (1981), among others. Let us briefly illustrate some of the results on
the location submodel.
Let X1, X2 . . . . be the sequence of independent observations, identically
distributed according to the distribution function F ( x - O ) such that
480 JanaJure~kov6
for almost all x E R ~. The relation (5.1) means that, given the distribution F,
there exists an M-estimator to every R-estimator (and vice versa) such that
both estimators are asymptotically equivalent. Being dependent on the un-
known d.f. F, the relation (5.1) does not enable to determine the value of the
M-estimator once we have calculated the value of R-estimator; it rather
indicates which type of M-estimators belongs to a given type of R-estimators
etc.
Let L, be the L-estimator (2.34) generated by the function J ( t ) such that
J ( t ) = J(1 - t)/> 0, 0 < t < 1. Then, under some smoothness conditions on J and
F, V ' n ( L . - M . ) = o p ( l ) as n ~ o o for the M-estimator M. generated by the
function
1
qt(x) =
f0 J ( t ) ( I [ F ( x ) <~ t] - t) dF-l(t), xE R 1. (5.2)
if x < F - l ( a ) ,
0(x) = x if F - l ( a ) <- x <~ F-l(1 - a ) , (5.3)
F-l(1 - a ) if x > F-l(1 - a ) .
-1 _ a
References
Adichie, J. N. (1967). Estimate of regression parameters based on rank tests. Ann. Math. Statist.
38, 894-904.
Andrews, D. F., Bickel, P. J., Hampel, F. R., Huber, P. J., Rogers, W. H. and Tukey, J. W. (1972).
Robust Estimates of Location. Survey and Advances. Princeton University Press.
Antille, A. (1974). A linearized version of the Hodges-Lehmann estimator. Ann. Statist. 2,
1308-1313.
Azencott, R., Birgr, L., Costa, V., Dacunha-Castelle, D., Deniau, C., Deshayes, J., Huber, C.,
Jolivaldt, P., Oppenheim, G., Picard, D., Trrcourt, P. and Viano, C. (1977). Thforie de la
robustesse et estimation d'un parametre. Astdrisque 43-44.
Beran, R. (1974). Asymptotically efficient adaptive rank estimates in location models. Ann. Statist.
2, 63-74.
Beran, R. (1977a). Robust location estimates. Ann. Statist. 5, 431-444.
Beran, R. (1977b). Minimum Hellinger distance estimates for parametric models. Ann. Statist. 5,
445-463.
Beran, R. (1978). An efficient and robust adaptive estimator of location. Ann. Statist. 6, 292-313.
Bickel, P. J. (1965). On some robust estimates of location. Ann. Math. Statist. 36, 847-858.
Bickel, P. J. (1967). Some contributions to the theory of order statistics. Proc. 5th Berkeley Syrup. 1,
575-591.
Bickel, P. J. (1975). One-step Huber estimates in the linear model. J. Amer. Statist. Assoc. 70,
428-434.
Bickel, P. J. (1976). Another look at robustness: A review of reviews and some new developments.
Scand. Journ. of Statistics 3, 145-168.
Bickel, P. J. and Lehmann, E. L. (1975). Descriptive statistics for nonparametric model. I.
Introduction, II. Location. Ann. Statist. 3, 1038-1044 and 1045-1069.
Birnbaum, A. and Laska, E. (1967). Optimal robustness: A general method, with applications to
linear estimates of location. J. Amer. Statist. Assoc. 62, 1230-1240.
Bjerve, S. (1977). Error bounds for linear combinations of order statistics. Ann. Statist. 5, 357-369.
Boos, D. D. (1979). A differential for L-statistics. Ann. Statist. 7, 955-959.
Boos, D. and Serfling, R. J. (1979). On Berry-Essen rates for statistical functions, with application
to L-estimates. Technical Rep., Dept. Statist., Florida State University.
Boos, D. and Serfling, R. J. (1980). A note on differentials and the CLT and LIL for statistical
functions, with application to M-estimates. Ann. Statist. 8, 618-624.
Carroll, R. J. (1978). On the asymptotic distribution of multivariate M-estimates. Journ. Multivar.
Analysis 8, 361-371.
Chernoff, H., Gastwith, J. L. and Johns, M. V. (1967). Asymptotic distribution of linear com-
binations of order statistics, with application to estimation. Ann. Math. Statist. 38, 52-72.
Collins, J. R. (1976). Robust estimation of a location parameter in the presence of asymmetry. App.
Statist. 25, 228-237.
Collins, J. R. (1977). Upper bounds on asymptotic variance of M-estimators of location. Ann.
Statist. 5, 646-657.
Collins, S. L. and Portnoy, J. R. (1981). Maximizing the variance of M-estimators using the
generalized method of moment spaces. Ann. Statist. 9, 567-577.
Dutter, R. (1975a). Robust regression: Different approaches to numerical solution and algorithms.
Res. Report No. 6/ETH Zfirich.
Dutter, R. (1975b). Numerical solution of robust regression problems: Computional aspects, a
comparison. Res. Report No. 7, ETH Ziirich.
482 Jana Jure~kovd
Kraft, C. and van Eeden, C. (1972b). Linearized rank estimates and signed rank estimates for the
general linear hypothesis. Ann. Math. Statist. 43, 42-57.
Krasker, W. S. and Welsch, R. E. (1982). Efficient bounded-influence regression estimation. Journ.
Amen. Statist. Assoc. 77, 595-604.
Launer, R. L. and Wilkinson, G. N., eds, (1979). Robustness in Statistics. Academic Press, New
York.
Maronna, R. A. (1976). Robust M-estimates of multivariate location and scatter. Ann. Statist. 4,
51-67.
Maronna, R., Bustos, O. and Yohai, V. (1979). Bias- and efficiency robustness of general
M-estimators for regression with random carriers. In: T. Gasser and M. Rosenblatt, eds.,
Smoothing Techniques for Curve Estimation, 91-116. Lecture Notes in Math. 757. Springer,
Berlin.
Millar, W. S. (1981). Robust estimation via minimum distance methods. Z. Wahrsch. Verw. Geb.
55, 73-89.
Miura, R. (1981). Adaptive confidence intervals for a location parameter. The Keiei Kenkyu XXXI,
197-218.
Nowak, H. and Zentgraf, R., eds. (1980). Robust Verfahren. Medizinische lnformatik und Statistik
No. 20. Springer, Berlin.
Portnoy, S. L. (1977). Robust estimation in dependent situations. Ann. Statist. 5, 22-43.
Raoult, J. P. ed. (1981). Statistique Non Param~trique Asymptotique. Lecture Notes in Math. 821,
Springer, Berlin.
Relies, D. A. (1968). Robust regression by modified least squares. Ph.D. Thesis, New York.
Relies, D. A. and Rogers, W. H. (1977). Statisticians are fairly robust estimators of location. J.
Amer. Statist. Assoc. 72, 107-111.
Rey, W. J. J. (1978). Robust Statistical Methods. Lecture Notes in Math. 690. Springer, Berlin.
Rieder, H. (1977). Least favorable pairs for special capacities. Ann. Statist. 5, 909-921.
Rieder, H. (1980). Estimates derived from robust tests. Ann. Statist. 8, 106-115.
Rocke, D. M. and Downs, G. W. (1981). Estimating the variances of estimators of location:
Influence curve, jackknife and bootstrap. Comm. Statist. B 10, 221-248.
Rocke, D. M., Downs, G. W. and Rocke, A. J. (1982). Are robust estimators really necessary?
Technometrics 24, 95-101.
Rousseeuw, P. J. (1981). A new infinitesimal approach to robust estimation. Z. Wahrsch. Verw.
Geb. 56, 127-132.
Ruppert, D. and Carrol, R. J. (1980). Trimmed least squares estimation in the linear model. J.
Amer. Statist. Assoc. 75, 828-838.
Sacks, J. and Ylvisaker, D. (1982). L- and R-estimation and the minimax property. Ann. Statist. 10,
643-645.
Sen, P. K. (1977). On Wiener process embedding for linear combinations of order statistics.
Sankhya Ser. A 39, 138-143.
Sen, P. K. (1978). An invariance principle for linear combinations of order statistics. Z. Wahrsch.
Verw. Geb. 42, 327-340.
Sen, P. K. (1980). On nonparametric sequential point estimation of location based on general
rank order statistics. Sankhya Ser. A 42, 202-218.
Sen, P. K. (1981). Sequential Nonparametrics. lnvariance Principles and Statistical Inference. Wiley,
New York.
Serfting, R. J. (1980). Approximation Theorems of Mathematical Statistics. Wiley , New York.
Shorack, G. R. (1969). Asymptotic normality of linear combinations of functions of order statistics.
Ann. Math. Statist. 40, 2041-2050.
Shorack, G. R. (1972). Functions of order statistics. Ann. Math. Statist. 43, 412-427.
Shulenin, V. P. (1978). On the stability of a class of estimators of Hodges-Lehmann's type (in
Russian). Proc. Conf. on Theory of Coding and Inform. Trans. VI, 147-151 (Vilnius, USSR).
Shulenin, V. P. (1980). Minimax estimators and numerical characteristics of their stability (in
Russian). Math. Statistics and its Applications 32--47, University of Tomsk.
Sievers, G. L. (1978). Estimation of location: A large deviation comparison. Ann. Statist. 6,
610-618.
M - , L- and R-estimators 485
Stigler, S. M. (1969). Linear functions of order statistics. Ann. Math. Statist. 40, 770-788.
Stigler, S. M. (1974). Linear functions of order statistics with smooth weight functions. Ann. Statist.
2, 676-693.
Stigler, S. M. (1977). Do robust estimators work with real data? Ann. Statist. 5, 1055-1098.
Torgenson, E. N. (1971). A counterexample on translation invariant estimators. Ann. Math. Statist.
42, 1450-1451.
van Eeden, C. (1970). Efficiency-robust estimation of location. Ann. Math. Statist. 41, 172-181.
van Zwet, W. R. (1980). A strong law for linear functions of order statistics. Ann. Probab. 8,
986-990.
Wegman, E. J. and Carroll, R. J. (1977). A Monte Carlo study of robust estimation of location.
Comm. Statist. A 6, 795-812.
Wellner, J. A. (1977a). A Glivenko-Cantelli-theorem and strong laws of large numbers for
functions of order statistics. Ann. Statist. 5, 473-480; correction note Ann. Statist. 6, 1394.
Wellner, J. A. (1977b). A law of iterated logarithm for functions of order statistics. Ann. Statist. 5,
481-494.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics. Vol. 4 r,)~,)
/...a/.a
Elsevier Science Publishers (1984)487-514
1. Introduction
There are certain basic estimation problems for which procedures based on a
prefixed sample size may not workout, and for valid solutions, some multi-stage
or sequential procedures are needed. There are other situations where the
observations are gathered sequentially, so that for the desired inferential
purpose, a stopping rule needs to be adapted along with the estimation rule. A
sequential estimation procedure is defined by a stopping rule and an estimation
rule; this characterization applies to both the point and interval estimation
problems. For these two problems, the associated stopping rules may or may not
be isomorphic, but the estimation rules are generally different. Note that if
(J2, ~ , P ) be a probability space and { ~ ; n/> 1} be an increasing sequence of
sub-sigma fields of ~, then a measurable function N (= N(t0)) taking values in N
(= { 1 , . . . , ~}) is called a stopping rule if {N = n} ~ ~ , Vn i> 1 and P { N = ~} = O.
In a sequential scheme, sampling is curtailed at the n-th stage if N = n, and, based
on the observations gathered up to the n-th stage, an estimation rule is then
employed for the desired inference on the parameter(s) of interes{. In such a
sequential scheme, if the stopping and estimation rules are based on appropriate
(sequence of) statistics without making explicit assumptions on the form of the
underlying distribution(s) of the random variables, then the resultant procedures
are termed nonparametric sequential estimation procedures.
We shall discuss mainly the developments on nonparametric methods in the
two basic sequential estimation problems: (i) minimum risk point estimation of
a parameter, and (ii) confidence interval for a parameter with a prefixed
coverage probability and having a width bounded by some prespecified positive
constant, where in either case, the parameter is interpreted as a functional of
the underlying distribution function. If the form of the underlying distribution
is not specified or if it involves some unknown algebraic constants (appearing
as parameters), then the minimum sample size needed to achieve the desired
goal (for either of the problems) cannot be determined in advance, so that an
estimation procedure based on a prefixed sample size may not workout. In
such a case, multi-stage or sequential procedures can be adapted with success,
487
488 Pranab Kumar Sen
and these will be considered here. In passing, we may remark that the two or
multi-stage procedures are the precursors of the sequential ones; these provide
the desired solutions, but may not be fully efficient, even in some convenient
asymptotic setup. We shall review these briefly as we proceed along. In the
developments to follow, the asymptotic theory plays a vital role: It enables one
to encompass a broad class of statistics in a broad class of problems, and these
sequential procedures usually retain their efficiency in some asymptotic setup
(viz., the cost of sampling or the width of the confidence interval is made to
converge to 0). Though the asymptotic theory may not be strictly applicable in
a nonasymptotic case, yet the asymptotic solutions provide reasonably good
approximations for the 'moderate case' as well. Further, in the asymptotic case,
the solutions can be derived in a unified manner, whereas in the nonasymptotic
case, exact solutions may either demand more stringent regularity conditions or
depend on the specific problem at hand.
In Section 2, we consider the minimum risk sequential point estimation
problem. We motivate the procedure through a parametric approach and then
present the nonparametric generalizations. Section 3 is devoted to the study of
the bounded-width sequential confidence interval problem, where also the
procedure is motivated through a parametric case. Section 4 deals with the
properties of the stopping times for either of these problems. In Section 5,
nonparametric and parametric procedures are compared in the light of the
asymptotic efficiency results. Some general discussions a r e made in the con-
cluding section. For the dual sequential testing problems, we may refer to Chapter
28 due to M/iller-Funk.
Then, for the sequence {T,} of estimators and corresponding to the risk
function in (2.2), T* is the minimum risk estimator of 0. (Later on, we shall
comment on the choice of g(.) and c(n) in this context.) In actual practice, for a
nontrivial g(.), generally ,/2 depends not only on n but also on some other
(unknown) parameter(s) (functional) of the d . f . F . For example, when the Xi
are the real valued r.v. and we take 0 = J" x d F ( x ) = mean (Ix) of the d.f. F,
and let T, = J~, = n -1 ~7=1X~, n I> 1 and g(y) = ay 2, y E E, for some a > 0, then
y2 = an-lo-2 depends on n as well as o.2 = J" (x - Ix)2 dF(x), which we need to
assume to be finite. If c ( n ) - C o + cn for some Co, c (>0), then
so that by (2.5) and (2.6), n * ( n * - 1 ) < ~ ac-lo-2< n*(n*+ 1). Thus, n* depends
on both (c, a) and o-2, and hence, no fixed-sample size (n) can lead to the
minimum risk estimation (MRE) simultaneously for all o- (>0). A similar case
arises if we let g ( y ) = a ] y l b and c ( n ) = Co+ cnd for some b > 0, d > 0. In this
setup, for normal F, n* can be explicitly obtained in terms of a, b, d, Co, c and
o-2, while for nonnormal F, the solution will depend on F through the
functional Vb = f[X] b dF(x). Hence, in any case, M R E depends on the unknown
o- or some other functional of the d.f. F, and a prefixed sample size may fail to
yield the M R E when y2 is not completely specified. For this reason, we take
recourse to multi-stage or sequential procedures.
T o motivate the sequential procedures, we go back to the normal mean
problem, stated earlier, and take g ( y ) = ay 2, a > 0 , y E E . Let $2, =
(n - 1)-a Y~I'=I(Xi - 32,) 2, n I> 2, be the sequence of sample variances. For some
initial sample size no (>~2), we define a stopping variable N (=Arc) by letting
and c o n s i d e r the two-stage estimator (Stein, 1945) T n = JqN. Note that for
490 Pranab Kumar Sen
normal F, {)~,, n/> 1} and {S 2, n t> 2} are independent and hence, using the
definition of n* following (2.6), we obtain that for c(n)=-cn, c > 0, the relative
risk of the two-stage procedure with respect to the optimal procedure (if o-
were known) is given by
where we take, for simplicity, n * = (a/c)l/2o ". Since (x + 1/x)>~ 1 Vx i> 0, (2.8)
exceeds one, unless N = n*, with probability 1. On the other hand, ( n 0 - 1)S2%/o-2
has chi-square distribution with n o - 1 degrees of freedom, and hence, (N/n*)has
a nondegenerate distribution, so that (2.8) exceeds one, for every fixed c (>0) and
n0. Thus, for any given no (I>2) and c > 0, the two-stage procedure in (2.7) fails to
be a MRE. It was observed by Mukhopadhyay (1980) that if we allow n0 (= no(c))
to depend on c (>0), in such a way that as c ,l. O, n o ( c ) ~ but cn~(c)~O, then
, P
writing n* as n*, we have N d N c --~ 1, as c ~ 0, and hence, by some standard steps,
(2.8) converges to 1, as c ~ 0. Thus, the modified two-stage procedure is
asymptotically (c ~ 0) MRE. For nonnormal distributions, ) ~ and IN = n] may
not be independent, and the above simple arguments may not hold.
Basically, in a two-stage procedure, one does not update S 2 (see (2.7)), and
hence, the M R E property may not hold. Based on updated versions of {$2~},
sequential procedures for the normal mean problem, were considered by
Robbins (1959), Starr (1966), Starr and Woodroofe (1969), and others. The first
nonparametric attempt (where F is of unspecified form) is due to Ghosh and
Mukhopadhyay (1979); their regularity conditions were relaxed by Chow and
Yu (1981). Sen and Ghosh (1981) extended the theory of asymptotic M R E to a
general class of estimable parameters based on U-statistics. Sen (1980b) has
developed the theory also for the rank based estimators in the location
problem, while Jure~kovfi and Sen (1982) have considered robust non-
parametric procedures based on M- and L-estimators of location and estima-
tors of their variation. These will be systematically reviewed here.
We consider first (asymptotically) risk-efficient sequential point estimation of
location of a (symmetric) distribution. Procedures based on rank statistics,
M-statistics and linear combinations of order statistics (L-statistics) will be
discussed here. Consider the model as in (2.1) through (2.4), where 0 stands for
the location parameter of a d.f. F0(x) = F(x - 0), x E E, where F is symmetric
about 0. The form of the d.f. F is not assumed to be specified. For simplicity, in
(2.1), we take g ( y ) = ay 2 and c ( n ) = cn, where a > 0 and c > 0 are given
constants. Later on, we shall discuss briefly the other cases of Ln in (2.1).
For the statistic Tn, defined as in before (2.1), we assume that there exists an
no (~>1), such that for every n ~>no, 2 = n E ( T _ t r ) 2 exists , and 24._>2 as
n--> oo, where 0 < tr < oo. In this case, the risk function R~ in (2,2) is therefore
given by
Nonparametric sequential estimation 491
Note that for T, = )~,, $2, = 6-2, n ~> no = 2, (2.12) closely resembles (2.7), with
the notable difference that the first-sample estimator S,~ is replace d by S,,
n >I no, and this updating of the estimator is expected to enhance the efficiency
of the sequential procedure over the two-stage procedure. Basically, the main
objective is to show that under appropriate regularity conditions, as c ~, 0,
R*-R.oc(c)=O(c ) as c $ O, (2.15)
492 Pranab Kumar Sen
These are referred to as the second order asymptotic efficiency results. Secondly,
we may also like to study the behavior of Nc as c $ 0. Specifically, we may like
to show that as c + 0,
0 ~
(n)-~/2(Nc - no)---" N(O, ,,~), (2.17)
for some finite u (0 < v < ~). This result is classically known as the asymptotic
normality of the stopping time. This provides useful information on the excess of
Nc over n o when c is small.
With these objectives in mind, we introduce now the R-, M- and L-
estimators of location; these have already been discussed in the nonsequential
case by Jure6kovfi in Chapter 21.
For every n (>/1), consider a signed-rank statistic
tl
S~ = ~'~ si gn X~a.(R.i)
+ + (2.18)
i=l
where U , j < . . . < U,. are the ordered r.v. of a sample of size n from the
uniform (0, 1) d.f. and 4,+(u) = 4,(1 + u)/2, 0 < u < 1, 4, being a nondecreasing,
skew-symmetric function. Suppose that we write Sn in (2.18) as S(Xn), X,--
(XI . . . . . 32,). If we replace X, by X, - a l , , a real, 1, = (1 . . . . . 1) and recom-
pute the signed-rank statistic; we denote the same by S~(a). Then S,(a) is "~ in
a, and the rank-based (R-) estimator of 0 is defined by
Then, we have
Thus, if we let
then, we m a y note that for any a d o p t e d score function 49, A~ is known, while ~b as
well as fo~ 49(u)O(u) d u remain u n k n o w n . T o estimate O'(R),2w e n o t e that for every
n (-->2) and every a E (0, 1), there exists an a . (~<2) and an S,.~, such that
and let
D ( R ) = 2S,,,,/n~,
, [~(R)
tr., - O^(R)'~.
L.,, (2.30)
Finally, we let
n
2 _--
O'n(R) A~/(D~))2, A ~ = --n ~.= {a+(i)}2 " (2.31)
then, it follows from Sen (1980b) that for every k > 0, there exists a positive
integer nok, such that
holds for some 6 < (4+ 2r) -1, ~-> 0, where k is a generic constant, then for
every k < 2(l + r),
where Z has the standard normal distribution. Now, (2.35) ensures that
nE(O,(R)-- 0) 2 = o',(R)2 exists for all n/> n02 and cr2,(m~ ~r~R) as n ~ ~. As such,
we may proceed as in (2.9) through (2.12), and consider the stopping number
If (2.34) holds for some ~ < (4 + 27) -1 where -c > 1 + 2h, with h being defined in
(2.36), then under (2.22) and (2.32), for the proposed sequential rank procedure,
the asymptotic risk efficiency in (2.14) is attained.
For various properties of 0.(M), we may again refer to Chapter 21 (by JureEk-
Nonparametric sequential estimation 495
Thus, if we let
O.{M)= o-(Jy(~),
= 2 (2.44)
Again, as in the case of R-estimators, we let (for some prefixed a : 0 < a < 1),
~CM)
L,n =
sup{t: n-VZW,(t) > ~'~/2S,(M)} , (2.46)
~CM)
U,n = inf{t: n-V2W.(t) <--~'~12S,~M)}" (2.47)
and further, as n ~ ~,
(2k)! 2~
Eo(X/-~lO.M)- Oly k ~ ~ (o'~M)) , (2.50)
where h (>0) is any arbitrary positive number. Under the regularity conditions
mentioned before, for the sequential M-procedure in (2.51) (with the sequen-
tial estimator 0Nc(M)~M)),the asymptotic risk efficiency in (2.14) is attained.
For the location parameter 0, an L-estimator 0,(z) is typically of the form
where X n : l ~ ' " ~ X n : n are the order statistics corresponding to the r.v.'s
X1 . . . . . X, (ties neglected, with probability 1, by the assumed continuity of F )
and the c,~ are suitable constants. By virtue of the assumed symmetry of F, we
would have ideally
where for some smooth function J = {J(t), 0 ~< t ~< 1}, we assume that
Let then
O.L = -~ { F ( x A y ) - - F ( x ) F ( y ) } J ( F ( x ) ) J ( F ( y ) ) dx dy . (2.56)
Parallel to the case of the M-procedure, we assume here that for some
e: (0 < e < ), J,(t) = 0 V 0 ~< t ~< e and 1 - e <~ t ~< e, while J ( t ) is of bounded
variation on [0, 1]. H e r e (2.22) may be replaced by a weaker assumption that
EoIV-dI6.(L).,.,.
t*lt (2k_LkX.
-+ 2 k k ! tr (L) " (2.60)
that c,i = O, i <~[ne] and i >~n - [ne] (in the L-procedure) can be omitted. For
all the three procedures, in addition to (2.14), we may claim that with n o being
defined by (2.10) and Nc by (2.12),
0 1/2
(n~) (TNc - 0)/0---~'N(0, 1) as c $ 0. (2.62)
Asymptotic normality of the stopping time (viz. (2.17)) has also been
established by Jure~kovfi and Sen (1982) for both the L- and M-procedures in
(2.51) and (2.61). In this context, it may be remarked that if in (2.40), ~:2 is
absent, then in (2.17), the order (nO)-1/2 is achieved, while if ~:2 is present, then
(n) - m has to be replaced by (n) -1/4, indicating a slower rate of convergence.
Verification of (2.15) and/or (2.16) remains as open problems. Also, general-
izations of these procedures to general linear models pose harder problems,
and are being investigated now.
Asymptotically risk efficient sequential point estimation theory has been
considered in a more general setup by Sen and Ghosh (1981). They considered
the case of general estimable parameters (which are functionals of the underly-
ing d.f.) and provided solutions based on U-statistics and their estimated
variances. For these U-statistics, we refer to Chapter 8 (by Ghosh).
Corresponding to an estimable parameter O ( F ) = f ' . ' f ~ b ( X l . . . . . Xm)
d F ( x l ) " " dF(x,,), where the kernel 4) of degree m (1>1) is symmetric in its m
arguments, we define the U-statistic U, by
U, =
(n) l ~] ~b(Xi1. . . . . X~m), n ~> m . (2.63)
1~i1<." <im <-n
(2.64)
where the ~'i are functionals of the d.f. F (0 ~< ~'1 < " " " < srm); for their expres-
sions, we may refer to Chapter 8. The case of 0 = / z ---f x d F ( x ) is a special
one, where m = 1, @(x) = x and ~'l = 0-2 = f x 2 d F ( x ) - / z 2. The case of 0-2 is
also a special one, where q~(x, y ) = l ( x - y)2, rn = 2, and ~rl and ~r2 are defined
in terms of the 4th moment of X. As such, for such U-statistics, we are
confronted with the same problem as in (2.1) through (2.5), where n* in (2.5)
and R,. in (2.2) depends on the unknown ~ ' b - . . , ~',,- If we let o-z=
nV(U,-O(F)), n~m, where we assume that E ~ : < % then 0-2=
m2~1 + O(n-~), and the following estimator of 0-2 is due to Sen (1960). Let
and let
where h (>0) is arbitrary. The desired sequential estimator is then UNc. In this
case, (2.10) simplifies to (when ~1 > 0)
n - mc-1/2~1 as c $ 0, (2.68)
while (2.11) reduces to 2 c n - 2 m ( c ~ l ) 1/2. Unlike the case of the R-, L- and
M-procedures, here, instead of (2.32), one needs to assume appropriate
moment conditions on 4', though (2.22) may not be necessary. O(F) is termed
stationary of order 0 if ~'~> 0. Then, we have the following results due to Sen
and Ghosh (1981):
and in (2.67), h ~ (0, 6z/2(2 + 6)), then for the sequential procedure based on
(2.67), the asymptotic risk efficiency in (2.14) holds. Actually, without any
restriction on h (>0), E N c / n ~ 1 as c ~ O, when (2.69) holds. If (2.69) be
replaced by El~bl4+a < 0% for some 6 > 0 and in (2.67), h is restrictefl to (, o~)
then, as c $ O,
Thus, for the case of U-statistics, (2.14) as well as (2.17) hold under quite
general conditions. There is some interest in studying (2.15)-(2.16) fot these
procedures too. For the special case of m = 1 (where nU, is a sum of
i.i.d.r.v.'s), some of these results have recently been studied by Chow and
Martinsek (1982). Their technique encounters considerable difficulties when
m > 1, and these are being investigated now.
In the developments so far, we have restricted ourselves to the case of the
squared error loss ( g ( y ) = y2) and proportional cost ( c ( n ) - cn). Parallel solu-
tions for some other choices of g(.) and c(n) can be worked out in similar ways.
500 Pranab Kumar Sen
Basically, one needs to assume (2.3) and (2.4), and further, to approximate y~
by an estimable function of n. For example, if g ( y ) ~ - [ y [ , then y~ in (2.4) may
be approximated by n-1/2(2/Tr)mo., where o.2 is the asymptotic variance of
n m ( T , - 0). So that, an estimator of o., as in (2.9)-(2.13), may be employed to
define a stopping rule, and showing that for such a rule, (2.14) holds for the
modified risk function too. In any case, the stopping rule depends on the choice
of g(.) and c(.), and hence, adherence to practicality in their choice is a good
criterion.
Multiparameter extensions of the theory have also been worked out by
several workers. For example, if T, is a q-vector (for some q >/1) estimating 0,
also a q-vector, then in (2.1), one m a y t a k e g(llT. - 011)= Ei(T. - O)'A(T. - 0)},
where A is some positive semi-definite matrix. Actually, if we let
n O~ { - I T ~ ( A F ) } I / 2 . (2.74)
i.e., the width of the confidence interval is bounded from above by a pre-
assigned number 2d. Since, in general, the (joint) distribution of (~JL,, 0u,,) may
depend on the underlying F, which is either of unspecified form or involves
u n k n o w n parameters, it may not be possible to determine a value of n, for
which (3.1) and (3.2) hold simultaneously for all such F. For this reason, o n e
may take recourse to sequential schemes. To motivate such sequential schemes,
we again consider the simple normal mean problem (as in after (2.5)). If F is
normal with mean 0 and variance ~r2, then if o- were known, one would, by
letting
obtain that for n = na, 0L,, = J~, - d, 0v,, = J(, + d, (3.1) and (3.2) hold. But na
actually depends on ~r, and hence, if o- were unknown, the solution nd cannot
satisfy (3.1) and (3.2), for all o-. Dantzig (1940) proved the nonexistence of
fixed-sample size confidence intervals for 0 for which both (3.1) and (3.2) hold,
for all cr. Stein (1945) overcame this problem, by considering a two-stage
procedure. Parallel to (2.7), define
N = m a x { n 0 , [t,o-l,~S,o/d
2 2 2] + 1} (3.4)
where $20 is the sample variance computed from the initial sample of size no,
and for Student's t-statistic with n 0 - 1 degrees of freedom, t~0_l,~ is the upper
502 Pranab Kumar Sen
50a % point. If now we consider the interval [J~u - d, J~N + d], then it is easy
to verify that both (3.1) and (3.2) hold. However, the validity of (3.1) depends
very crucially on the underlying d.f. F being normal. Moreover, the r.v. N may
be stochastically much larger than na in (3.3), unless no is chosen large. Nearly
twenty years later on, Chow and Robbins (1965) considered a sequential
procedure, which actually uses updated versions of $2, in (3.4), and the validity
and efficiency are thereby extended to a broader class of distributions.
Generalizations of the Chow-Robbins procedure in various nonparametric
setups have been considered by a host of workers; a general account of these
developments is given in Sen (1981, Chapter 10). A characteristic feature of the
nonparametric procedures is that the confidence intervals are obtained by
inversion of suitable rank statistics, so that the stopping rules may be defined
more naturally in terms of the width. Also, the various other situations, robust
statistics have been employed, and these extend the C h o w - R o b b i n s theory to a
much wider setup. As in Section 2, we consider first the location model, and
then, we will proceed on to other problems as well.
First, we consider the sequential rank procedure. Here, we take F = Fo,
where Fo(x) = F ( x - 0), -oo < x < % F symmetric about 0, and 0 is the location
parameter (median) of F. We define the signed-rank statistic S, as in (2.18)-
(2.19), and for every n (~>2), we define ~(R')
L,n
and t~(g~
V U, n
as in (2.28)-(2.29). Note
that by (2.7)-(2.9), for every n,
L , N "d , , , '
(3.8)
where by definition [in (3.7)], the width of the interval in (3.8) is ~<2d, so that
(3.2) holds. The crucial point is to verify that (3.1) holds in some sense, and to
show that such a procedure is efficient in some sense too. In this context, we
take shelter in an asymptotic setup, where, in (3.2), we allow d ~ 0. Towards
this, we have the following results:
Under essentially the same set of regularity conditions (on the score function
and the d.f.) as in Section 2, N(am is a nonincreasing function of d (>0), it is
finite a.s., E ( N ( f ~) < ~ for all d > 0, lima ~0 N(dR) + ~ a.s., and lima J,0 E ( N ~ f ~) =
=
Nonparametric sequential estimation 503
+~. Further
where R.i = rank of X~ among X 1. . . . , X., for i = 1 . . . . . n, and the scores a~)
(=Ech(U.i) or ck(i/(n + 1)), i = 1. . . . . n are defined as in after (2.19). We write
L. = L.(X.), and, if in (3.13), we replace X. by X . - b e . , b real, c. =
(Cl. . . . . c.)', then the resulting statistic is denoted by L.(b). Note that for ~b/~,
L.(b) is ~ in b E E, and L.(/3) has the same distribution as of L.(0) when
H0:/3 = 0 holds, and the later has mean 0 and variance C.An, 2 2 where C2. =
El'=1 (ci- g~,)2 and AZ, = (n - 1) -1 {~n=l a~(i)- n-l(ET=la,(i))2}. Further, under
Ho, L,(O) has a distribution independent of F. As such, parallel to (2.20), the
R-estimator of/3 may be defined by
Thus, if we let
/~(g)
L, n
= sup{b: Ln(b) > L~)~} (3.17)
/~(R)
U,n
= inf{b: L,(b) < L(,~.)~}
,
(3.18)
[/3(LR)N~R),/3~,,~R,]. (3.19)
where s(a) is '1" in a (>0) with s(1) = 1, then the stopping n u m b e r N(an) satisfies all
the properties m e n t i o n e d before (3.9), and further, (3.9)-(3.11) hold with
(
nd= O-~ A2T2/2/d2 (foI q~(t./)O(/A) du f) (3.21)
and O-~(y)= inf{x: O(x)>~ y}. Thus, the asymptotic consistency and efficiency
both hold for the sequential p r o c e d u r e in (3.19). For the least square estimator,
parallel results are due to Gleser (1965).
Let us next consider procedures based on L-statistics. Parallel to (2.52), we
consider a L-statistic of the form
/1
where the X , :~ are defined as there, g(.) is a suitable function and the score
function J , ( . ) ~ J ( . ) on (0, 1), for some smooth J. Parallel to (3.22), the
Nonparametric sequential estimation 505
population counterpart is
tx = fe J(F(x))g(x) d F ( x ) . (3.23)
Let then
and, in (2.58), we replace the X , : i by g(Xn:i) and denote the resulting statistic
by Or^2,(L).Then, under fairly general regularity conditions on J(.) and g(.), as
/'1,--->00
If we let b(u) = g(F-l(u)), 0 < u < 1, assume that b(u) is of bounded varia-
tion on [e, 1 - e] V 0 < e <, and that some generic constant K ( 0 < k <o~), for
every 0 < u < 1,
all the properties mentioned before (3.9) hold and also (3.9)-(3.11) hold with
ha, defined by (3.27). For details of the proofs, we may refer to Sen (1981,
Chapter 10). Whereas, in the rank statistics case, we use distribution-free
506 Pranab K u m a r Sen
confidence intervals to define the stopping number, in this case, (3.25) provides
only an asymptotically distribution-free (ADF) setup.
We consider next the sequential M-procedure. Let {X~, i/> 1} be a sequence
of independent r.v. with d.f.'s {F~, i ~> 1}, where
the c,- are known regression constants (not all equal to 0), a is an unknown
parameter and the d.f. F satisfies the regularity conditions of Section 2. The
location model is a special case where ci = 1 Vi ~> 1. Analogous to (2.37), we
define here
where the score function sc is monotone, skew-symmetric and satisfies the same
regularity conditions as in after (2.39). Parallel to (2.38), we define here
Further, we write C~, = E~'=l c 2, and for some prefixed a (0 < a < 1), we define
zi (M)
L,n = sup{t: W . ( t ) > C.S.(M)r~/2} (3.35)
zl (~t)
U,n = inf{t: W~(t) < - C~S.(M)T~/2} " (3.36)
Then, following Jure~kovfi and Sen (1981b), we may consider the stopping
number
where no is some initial sample size, and the (sequential) confidence interval for
^ (M) ^ (M)
A Is then (A L,N~(d)' A U~NM(d)),
We assume "ihat o'~o, defined by (2.41) is finite and strictly positive and
defining s as in (2.40) (but, without necessarily assuming that ~: is a constant
outside a compact interval), we further assume that (i) f (sq(x)) 2 dF(x) < ~, (ii)
as t ~ 0, f e {~(x + t)-~[(x)}2dF(x)--)O, (iii) at the points of jumps of ~2, fit is
bounded and (iv) max{c~/C~: 1~< i ~< n}->0, as n~oo. Then, it follows from
Nonparametric sequential estimation 507
Jure~kovfi and Sen (1981b) that for this sequential M-procedure, the properties
mentioned before (3.9) hold, and also (3.9)-(3.11) hold with n~ defined by
where the unknown parameter & is defined as in (2.64). We define our stopping
variable by
where the ( x i - / z ) 2 i.i.d.r.v, with mean o-2. Hence, in this case, we may proceed
as in Theorem 10.2.1 of Sen (1981) and show that E N ( d ) < ~ Vd > 0 and
EN(d)/na ~ 1 as d $ 0. However, for general m >~ 1, this may require a slightly
more stringent condition that E{sup,~, 0Sz.}<% and for this it suffices to
assume that E{~b21og 4)2}< ~ or E[4~[r < ~ for some r > 2. Under either con-
dition, for Nu(d), the properties listed before (3.9) hold, and also (3.9)-(3.11)
hold with rid, defined by (3.40).
A natural generalization of this problem is the sequential confidence region
for 0, a vector of unknown parameters, where instead of (3.1), we need to
construct a closed (and possibly convex) region I, such that P{O ~ In} > 1 - a,
and instead of (3.2), we like to have the property that the maximum diameter
of I, is ~<2d, for some d > 0. These are discussed in detail in Section 10.2.5 of
508 PranabKumarSen
Sen (1981). The (joint) asymptotic normality of the estimates of 0 and the
strong consistency property of their variance-covariance estimators are used in
this context. Basically, in (3.41) (or in other appropriate places), S 2 is to be
replaced by the largest characteristic root of Sn, the estimated covariance
matrix, and ~']/2 by X2,, the upper 100c~ % point of the chi square distribution
with r degrees of freedom, where r is the dimension of 0. The regularity
conditions are essentially the same. For some specific problems of special
interest, we may refer to Ghosh and Sen (1973) and Sen and Ghosh (1973b).
For both the sequential point and interval procedures in Sections 2 and 3, a
variety of stopping numbers has been considered. In the point estimation
problem, the main emphasis has been laid on (2.14), while in the interval
estimation problem, the main theme was to show that Nd/nd ~ 1 a.s. or in 1st
mean, as d $ 0. These may be regarded as the first order asymptotic efficiency
results of the sequential procedures. In (2.15)-(2.16) we have sketched the
second order asymptotic efficiency results in the context of the sequential point
estimation problem. One of the problems with the sequential confidence
intervals is that the procedures considered may not satisfy (3.1); we have only
the asymptotic equality (to 1 - a ) as d ~, 0. Thus, it may be quite appropriate
to put this question: for any given d (>0), it is possible to have a procedure for
which (3.1) holds and if so, then what is the order of magnitude of E N d - nd?
For normal population, this problem has been considered by Simons (1968) and
others. For the nonparametric problems, though some studies have been made
on ENd - na, a complete or satisfactory answer to this question is still unavail-
able. However, in the majority of the cases, (2.17) or its parallel form in the
interval estimation problem has been studied under suitable regularity con-
ditions.
For U-statistics or related von Mises' functionals, the asymptotic normality
of the stopping time in (2.17), for Nc defined by (2.67), has already been
considered in (2.70)-(2.71). For the sequential interval estimation problem, for
Nv(d) in (3.41), the same result holds whenever Eq~4< ~.
For the sequential M-procedures, for both the point and interval estimation
problems, the asymptotic normality of the stopping time has been studied by
Jure(zkovfi and Sen (1981b, 1982). It has been observed that in either case,
rt~a(Nd - - r i d ) (or nca(Nc - nc) ) has asymptotically a normal distribution, under
quite general regularity conditions, where referred to (2.40),
Consider the sequential point estimation problem first. Let T be the set of
sequences {T,} of estimators which are asymptotically normally distributed
such that the minimum risk Rn~(c) in (2.11) exists and satisfies
and this agrees with the conventional measure of A.R.E. in the nonsequential
case. Also, note that for any asymptotically unbiased and normally distributed
510 Pranab Kumar Sen
estimator {T,},
where F ( E ~ ) is the true d.f. Therefore, for two competing sequential interval
procedures (corresponding to a common d (>0) and a coverage probability
1 - a), equating the expected sample sizes (up to the ratio being asymptotically
(as d $ 0) equal to l), we arrive at the same measure of A.R.E. as in (5.4).
Hence, what has been discussed following (5.4) also pertains to the confidence
interval problem.
In the sequential point estimation problem, the minimum risk as well as the
expected sample size depend very much on the form of the loss function (viz.,
g(x) and c(n) in (2.1)). For example, if instead of g(x)= x 2, we choose
g(x) = Ixl, then in (2.10)-(2.11) we would have for small c (>0),
positive quantity, g(x) = O-tlxl or O-2x 2 has also been some other workers (viz.,
Chow and Martinsek, 1982). It seems desirable to work out the general case
with some bowl-shaped loss function.
Mukhopadhyay (1980) has shown that for the normal mean problem, a
Stein-type two-stage procedure where the initial sample size no (= n0c or nod)
depends on c (or d), such that as c (or d) $ 0, n0c ~ oo but cmno~ ~ 0 (or nod ~
but d2nod~O) has also the first order asymptotic efficiency in (2.14) or (3.11),
though it does not have the second order efficiency (i.e., finite regret as c
(or d) $ 0). The characteristic is shared by two-stage nonparametric procedures
too. This raises the question whether or not the Second order efficiency can be
attained by a three or multi-stage procedure. If the answer is in the affirmative,
then much of the labour involved in a genuine sequential procedure may be
avoided by adapting a multi-stage or group-sequential procedure.
Mostly relating to the parametric cases, some attempts have been made to
obtain some asymptotic expansions for E N ~ - n o (or E N d - nd), so that some
idea of the regrets may be gathered. However, in most of the nonparametric
problems, one encounters nonlinear statistics, and such expansions may be
quite involved. More work is needed in this area. For both the point and
interval estimation problems, in the nonparametric case, the theory has mostly
been justified on an asymptotic ground where c (or d) is made to converge to
0. Though these approximations work out quite well for small values of c or d,
they may depend on the statistics used and also on the underlying distributions.
Therefore, there remains good scope for numerical studies on the adequacy of
the asymptotic theory for moderate values of c or d.
For both the problems in Sections 2 and 3, the sequential procedures are
based on some well defined stopping times. In some situations, one may face
the problem of providing a confidence sequence for a parameter, where there
may be any role for a stopping number. W e may conceive of a sequence
{Xi; i i> 1} of independent r.v.'s defined on a common probability space, and we
desire to form a sequence {J,} of (confidence) intervals, such that for some
parameter 0 and positive integer m,
where {e,} is so chosen that under H0:0 = 0, [Sn] <~ Aconl/2en a.s., as n ~ ~. We
may for example, let en = (2 log log n) m, for which the above holds, though in
(6.2), a precise order of a,, (even for large m) may be difficult to obtain.
512 Pranab Kumar Sen
References
Anscombe, F. J. (1952). Large sample theory of sequential estimation. Proc. Cambridge Phil. Soc.
48, 600-607.
Carroll, R. J. (1977). On the asymptotic normality of stopping times based on robust estimators.
Sankhygt, Ser. A 39, 355-377.
Chatterjee, S. K. (1977). Sequential inference procedures of Stein's type for a class of multivariate
regression problems. Ann. Math. Statist. 33, 1039-1064.
Chow, Y. S. and Martinsek, A. T. (1982). Bounded regret of a sequential procedure for estimation
of the mean. Ann. Statist. 10, 909-914.
Chow, Y. S. and Robbins, H. (1965). On the asymptotic theory of fixed-width sequential confidence
intervals for the mean. Ann. Math. Statist. 36, 457-462.
Chow, Y. S. and Yu, K. F. (1981). The performance of a sequential procedure for the estimation of
the mean. Ann. Statist. 9, 184-188.
Dantzig, G. B. (1940). On the non-existence of tests of "Student's hypothesis" having power
function independent of o'. Ann. Math. Statist. 11, 186-192.
Darling, D. A. and Robbins, H. (1967). Confidence sequences for mean, variances and median.
Proc. Nat. Acad. Soc. USA 58, 66-68.
Gardiner, J. C. and Sen, P. K. (1979). Asymptotic normality of a variance estimator of a linear
combination of a function of order statistics. Zeit. Wahrsch. Verw. Geb. 50, 205-221.
Geertsema, J. C. (1970). Sequential confidence intervals based on rank tests. Ann. Math. Statist. 41,
1016-1026.
Ghosh, M. and Mukhopadhyay, N. (1979). Sequential point estimation of the mean when the
distribution is unspecified. Comm. Statist: Th. Methods, Ser. A. 8, 637-652.
Ghosh, M. and Mukhopadhyay, N. (1981). Consistency and asymptotic efficiency of two-stage and
sequential estimation procedures. Sankhyd Ser. A . 43, 220-227.
Ghosh, M. and Sen, P. K. (1971). Sequential confidence intervals for the regression coefficient
based on Kendall's tall. Calcutta Statist. Assoc. Bull. 20, 23-36.
Ghosh, M. and Sen, P. K. (1972). On bounded length confidence intervals for the regression
coefficient based on a class of rank statistics. Sankhy& Ser. A 34, 33-52.
Ghosh, M. and Sen, P. K. (1973). On some sequential simultaneous confidence intervals pro-
cedures. Ann. lnst. Statist. Math. 25, 123-134.
Ghosh, M., Sinha, P. K. and Mukhopadhyay, NI (1976). Multivariate sequential point
estimation. J. Multiv. Anal. 6, 281-294.
Nonparametric sequential estimation 513
Gleser, L. J. (1965). On the asymptotic theory of fixed-size sequential confidence bounds for linear
regression parameters. Ann. Math. Statist. 36, 463-467.
Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. Ann. Math.
Statist. 19, 293-325.
Huber, P. J. (1964). Robust estimation of a location parameter. Ann. Math. Statist. 35, 73-101.
Huber, P. J. (1981). Robust Statistics. Wiley, New York.
Hugkovfi, M. (1982). On bounded length sequential confidence interval for parameters in regression
model based on ranks. Coll. Nonparametric Infer., Janos Bolyai Math. Soc. 32, 435--463.
Jure~kov/t, J. (1977). Asymptotic relations of M-estimates and R-estimates in linear regression
models. Ann. Statist. 5, 664-672.
Jure~kovfi, J. and Sen, P. K. (1981a). Invariance principles for some stochastic processes related to
M-estimators and their role in sequential statistical inference. Sankhy& Set. A. 43, 190-210.
Jure~kovfi, J. and Sen, P. K. (1981b). Sequential procedures based on M-estimators with dis-
continuous score functions. J. Statist. Plann. Inference 5, 253-266.
Jure~kovgt, J. and Sen, P. K. (1982). M-estimators and L-estimators of location: Uniform inter-
grability and asymptotically risk-efficient sequential versions. Sequen. Anal. 1, 27-56.
Koul, H. L. (1969). Asymptotic behaviour of Wilcoxon type confidence regions in multiple
regression. Ann. Math. Statist. 40, 1950-1979.
Lai, T. L. and Siegmund, D. (1977). A nonlinear renewal theory with applications to sequential
analysis. Ann. Statist. 5, 946-954.
Lai, T. L. and Siegmund, D. (1979). A nonlinear renewal theory with applications to sequential
analysis, II. Ann. Statist. 7, 60-76.
Miller, R. G. Jr. and Sen, P. K. (1972). Weak convergence of U-statistics and von Mises'
differentiable statistical functions. Ann. Math. Statist. 43, 31-41.
Mukhopadhyay, N. (1980). Consistent and asymptotically efficient two-stage procedure to construct
fixed-width confidence interval for the mean. Mertika 27, 281-284.
Robbins, H. (1959). Sequential estimation of the mean of a normal population. In: Probability and
Statistics (H. Cramef vol.). Almquist and Wicksell, Uppsala, pp. 235-245.
Sen, P. K. (1960). On some convergence properties of U-statistics. Calcutta Statist. Assoc. Bull. 10,
1-18.
Sen, P. K. (1977). Some invariance principles relating to jackknifing and their role in sequential
analysis. Ann. Statist. 5, 319-329.
Sen, P. K. (1978). An invariance principle for linear combinations of order statistics. Z. Wahrsch.
Verw. Geb. 42, 327-340.
Sen, P. K. (1980a). Nonparametric Simultaneous inference for some MANOVA models. In: P. R.
Krishnaiah, ed., Handbook of Statistics, Vol. 1. North-Holland, Amsterdam, pp. 673-702.
Sen, P. K. (1980b). On nonparametric sequential point estimation of location based on general rank
statistics. Sankhygt Set. A 42, 201-220.
Sen, P. K. (1981). Sequential Nonparametrics: Invariance principles and statistical inference. Wiley,
New York.
Sen, P. K. (1983). Sequential R-estimation of location in the general Behrens-Fisher model. Sequen.
Anal. 2, 311-335.
Sen, P. K. (1984). On sequential nonparametric estimation of multivariate location. Proc. Third Prague
Conf. Asymp. Meth., to appear.
Sen, P. K. and Ghosh, M. (1971). On bounded length sequential confidence intervals based on
one-sample rank order statistics. Ann. Math. Statist. 42, 189-203.
Sen, P. K. and Ghosh, M. (1973a). A law of iterated logarithm for one-sample rank order statistics
and some applications. Ann. Statist. 1, 568-576.
Sen, P. K. and Ghosh, M. (1973b). Asymptotic properties of some sequential nonparametric
estimators in some multivariate linear models. In: P. R. Krishnaiah, ed., Multivariate Analysis
III. Academic Press, New York, pp. 299-316.
Sen, P. K. and Ghosh, M. (1981). Sequential point estimation of estimable parameters based on
U-statistics. Sankhy~ Set. A 43.
514 Pranab Kumar Sen
Simons, G. (1978). The cost of not knowing the variance when making a fixed-width confidence
interval for the mean. Ann. Math. Statist. 39, 1946-1952.
Sproule, R. N. (1974). Asymptotic properties of U-statistics. Trans. Amer. Math. Soc. 199, 55-64.
Starr, N. (1966). On the asymptotic efficiency of a sequential point estimation. Ann. Math. Statist.
37, 1173-1185.
Starr, N. and Woodroofe, M. (1969). Remarks on sequential point estimation. Proc. Nat. Acad. Sci.
USA 63, 285-288.
Starr, N. and Woodroofe, M. (1972). Further remarks on sequential point estimation: the exponen-
tial case. Ann. Math. Statist. 43, 1147-1154.
Stein, C. (1945). A two-sample test for a linear hypothesis whose power function is independent of
~r. Ann. Math. Statist. 16, 243-258.
Williams, G. W. and Sen, P. K. (1973). Asymptotically optimal sequential estimation of regular
functionals of several distributions based on generalized U-statistics. J. Multivar. A n a l 3,
469-482.
Williams, G. W. and Sen, P. K. (1974). On bounded maximum width sequential confidence
ellipsoids based on generalized U-statistics. J. Multivar. A n a l 4, 453-468.
Woodroofe, M. (1977). Second order approximations for sequential point and interval estimation.
Ann. Statist. 5, 984-995.
Woodroofe, M. (1982). Nonlinear Renewal Theory in Sequential Analysis. S.I.A.M. Publication,
Philadelphia.
P. R. Kfishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 e),~
J..4 ~.J
Elsevier Science Publishers (1984) 515-529
Stochastic Approximation
Vdclav Dupa6
x,+, = x , - (M'(x,))-'M(x,).
Xt+l = X t q- a M ( x t ) , (1.1)
where a is a constant.
Assume that the graph of the function M lies between two lines of negative
slopes, passing through the point [0; 0]. M o r e precisely, assume that
for all x ~ R and some positive constants K1,/2. Then the approximations xt
determined by (1.1) tend to the solution 0 for every initial value x l E R ,
provided that 0 < a <2/K2. The rate of convergence is at least that of a
geometrical sequence with the quotient
q = m a x ( 1 - aKl, a K 2 - 1).
Of course, the same holds true if the graph of M lies between two lines of
positive slopes, and if the plus in (1.1) is replaced by a minus. An analogous
remark applies throughout the paper.
515
516 V{tclav Dupa6
3. Convergence theorems
further assume
further assume
The rate of mean square convergence can be then found, under the assump-
tions of Theorem 3.3:
Hence the choice a = 1 and 2 K a > 1 is optimal from this point of view.
The following result, due to R6v6sz (1973), concerns the rate of decrease of
large deviation probabilities: Assume
Then we have
5. Asymptotic normality
THEOREM 5.1 (Fabian, 1968). Let the sequence Xt, t >->_1, defined by (3.1) with
at = at -1, tend to 0 with probability 1. Additionally assume
a2~2
0,2--~a_--i). (5.9)
Conditions ensuring the asymptotic normality of the Xt's are not very severe;
they are the differentiability of the regression function at 0 and a sort of
boundedness and continuity of the covariance matrix of the errors, together
with the usual Lindeberg-type condition.
6. An adaptive procedure
1 ~Y~-Y';
Wt = t - 1 2ci ' (6.1)
i=1
(We use the symbol [y]d for max(c, min(y, d)); c < d.) The modified estimate is
finally inserted in the iteration scheme,
x,+l=x,+~Y,.
1
(6.3)
there exists an everywhere continuous and bounded second derivative M"(x), and
Apparently, the latter asymptotic variance is half its minimal possible value.
The explanation is that the adaptive procedure needs twice as many obser-
vations as the standard Robbins-Monro procedure. So we have to compare the
precision of the t-th approximation of the adaptive procedure and the 2t-th
approximation of the standard procedure; both of them coincide then.
Recently, Lai and Robbins (1979) have shown that at least for identically
distributed random errors, there is no need of two observations at each step for
estimating M'(O); one observation at each step suffices, as well as the standard
least squares estimate based on all observations up to the current one.
7. Asymptotic efficiency
At the same time, we consider only such transformations g, for which both the
transformed regression function
Xt+l = X t + t g ( Y t ) ,
where Yt = M(Xt) + e(t + 1), with 2Kga > 1, retains the convergence property
limt~=Xt = 0 with probability 1, and tl/a(Xt - O) is asymptotically normally
distributed with parameters (0, a2tr2/(2mga- 1)). The asymptotic variance can
again be minimized by choosing a = 1/mg or by making use of the adaptive
procedure, the minimal value (for fixed g) of the asymptotic variance being
o- 2g / m 2g.
If the function go = - f ' / f belongs to the class ~q, then go realizes the
ming~o-2/m 2, which equals 1/(m2I(f)); Anbar (1973).
The above property of the g0-transformed Robbins-Monro procedure can be
called its asymptotic efficiency, in the following sense: If the regression function
is linear, M(x) = - m ( x - 0), then 1/(m2I(f)) is exactly the Cramrr-Rao lower
bound for variances of regular unbiased estimates of 0.
Stochastic approximation 523
If, for instance, the underlying density f is that of the normal (0, 0 "2)
distribution, then the optimal transformation go is the identity and the cor-
responding minimal asymptotic variance is 0-2/m2. If f is the density of the
double exponential distribution, again with 0 mean and variance 0-2, then the
transformation go(y) = sgn y is optimal, the corresponding minimal asymptotic
variance being 0-2/(2m2).
If the density f is specified only as a member of the class of e-contaminated
normal distributions, then the optimal transformation (this time in the minimax
sense with respect to the mentioned class) is
y for lyl ~ K ,
go(Y) = sgn y for [y[ > K ,
where lit = M(Xt) + e(t + 1, Xt). Then we have limt_,~.X, = 0 with probability 1.
524 V6clav Dupo2
Also the mean square convergence and the asymptotic normality hold true
for the procedure (8.3) under the same additional assumptions as in Sections 3
and 5.
Calculating projections at each step of the procedure might be uncomfort-
able. For other possibilities see Nevel'son and Has'minskil (1972/76, Chapter
7), Dupa~ and Fiala (1983).
Xt+I = X * + a , Y * , (9.1)
where
X* = (1 + ~)Xt,
1 Y* = M ( t + 1, X * ) + e(t + 1, X*) (9.2)
( M ( x + cei) - M ( x - ce,))/(2c) ,
Xt+l = X~ + a, Yt,
THEOREM 10.1 Let the second order derivatives of M exist and be (globally)
Lipschitz continuous. Let there exist a positive definite matrix C such that
further assume
t . 1 t
lim 1 ~ eA(k) = O, h m - ~ eb(k)= O, (11.1)
t-~ t k= 1 t~ t k= 1
E(~/jT)x = E(r/~:),
provided that the covariance matrix of the random vector ~ is positive definite
and that Er/2< +~. Denoting
-E(~{~ T) = A , -E(rl~) = b,
_ ~ T = A(t), -~7~ = b(t),
528 Vdclav Dupa~
1
X t + 1 -~- X t - ? ( ~ t ~ T X t - T]t~t) (11.6)
1
X,+l = x , - ? ( ( x , , ~,)- ~,)~,,
so that we don't actually need matrix multiplication when using the iterative
formula.
References
Anbar, D. (1973). On optimal estimation methods using stochastic approximation procedures. Ann.
Statist. 1, 1175-1184.
Dupa~, V. (1957). On the Kiefer-Wolfowitz approximation method. (In Czech). t~asopis P~st.
matem. 82, 47-75.
Dupa~, V. (1965). A dynamic stochastic approximation. Ann. Math. Statist. 36, 1695-1702.
Dupa~, V. and Fiala T. (1983). Stochastic approximation on a bounded convex set. In: Proceedings
of the Conference "Mathematical Learning Models-Theory and Algorithms", Bad Honnef, May
3-7, 1982. Lecture Notes in Statistics 20. Springer, Berlin, pp. 26--32.
Fabian, V. (1968). On asymptotic normality in stochastic approximation. Ann. Math. Statist.
39, 1327-1332.
Gy6rfi, L. (1980). Stochastic approximation from ergodic sample for linear regression. Z. Wahrsch.
Verw. Geb. 54, 47-55.
Kiefer, J. and Wolfowitz, J. (1952). Stochastic estimation of the maximum of a regression function.
Ann. Math. Statist. 23, 462--466.
Lai, T. L. and Robbins, H. (1979). Adaptive design and stochastic approximation. Ann. Statist. 7,
1196-1221.
Ljung, L. (1977). Analysis of recursive stochastic algorithms. I E E E Trans. Aurora. Control 22,
551-575.
Nevel'son, M. B. and Has'minskii, R. Z. (1972/76). Stochasti~eskaja Approksimacija i Rekurrentnoe
Ocenivanie. Nauka, Moscow. English translation: Stochastic Approximation and Recursive Esti-
mation. Translations of Math. Monographs, vol. 47, American Math. Society, Providence.
Nevel'son, M. B. and Has'minskii, R. Z. (1973). An adaptive Robbins-Monro procedure. Aurora.
Rem. Contr. 34, 1594--1607.
Pantel, M. (1979). Adaptive Verfahren der stochastischen Approximation. Dept. of Mathem. Univ.
of Essen. Thesis.
Stochastic approximation 529
R6v6sz, P. (1973). Robbins-Monro procedure in a Hilbert space and its application in the theory of
learning processes I. Stud. Sci. Math. Hungar. 8, 351-398.
Robbins, H. and Monro, S. (1951). A stochastic approximation method. Ann. Math. Statist. 22,
400-407.
Ruppert, D. (1979). A new dynamic stochastic approximation procedure. Ann. Statist. 7, 1179-
1195.
Venter, J. H. (1967). An extension of the Robbins-Monro procedure. Ann. Math. Statist. 38,
181-190.
Wasan, M. T. (1969). Stochastic Approximation. Cambridge Univ. Press.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 ~')h
Elsevier Science Publishers (1984) 531-549
Density Estimation
P. R~v~sz
1. Introduction
Let X~, 322,... be a sequence of i.i.d.r.v.'s with density function f(x). Let
K,(a, b) be the n u m b e r of elements of the sample lying in the interval [a, b),
i.e.
K,(a, b) = ~ I[a,b)(Xi)
i=1
where
{~ ifa<~x<b'
I[a'b)(X ) = otherwise,
is the indicator function of the interval [a, b). Then probability P(a <~X~ < b)
= fb, f ( t ) d t can be estimated by the relative frequency n-lK,(a, b) (if n is big
enough) and the value of a continuous f ( x ) (a <- x < b) can be estimated by
(b - a) -1 fba f(t) dt (if the interval is short enough), i.e.
(Here the sign - does not stand for any precise mathematical statement; it only
indicates an intuitive near equality.)
H e n c e an empirical density function f , ( x ) in the interval [a, b) can be defined
as
first error is small if the interval is short while the second one is small if the
interval [a, b) contains a large enough number of elements of the sample, that
is when [a, b) is not too short. Hence one of the main problems is how to find a
good compromise between these two opposing tendencies.
Now we give a very general definition of the empirical density function and
later on we show how the most frequently used empirical densities can be
obtained from this general concept.
GENERAL DEFINITION (F61des and R6v6sz, 1974; Walter and Blum, 1979). Sup-
pose that f ( x ) is vanishing outside the interval -oo~< C < D <~ +oo and let
~ = {~Ok(X,y)} be a sequence of Borel-measurable functions defined on the
square (A, B) 2 where (C, D ) C (A, B). Then an empirical density function can
be defined as follows:
i=1
" <x_,(n)<xo(n)<xl(n)<...
x i + l ( n ) - xi(n) = h, (i = O, +_1, + - 2 , . . . )
if "~i
"~(")~<
~ X ~"
~ Ac(")
, i + 1 where f ~ ) is defined by (3).
3. Kernel-type empirical density (Rosenblatt 1956; Parzen 1962). Let h ( x ) b e
an arbitrary density function vanishing outside an interval (A, B) D (C, D). Let
{h,} be a sequence of positive numbers tending to 0 as n ~ oo and define 0 by
The density function h of this definition is called kernel or window and h, is called
window-width.
4. Orthogonal expansion ((~encov 1962; van Ryzin 1966; Schwartz 1967). Let
= {(pk(x)}~=l be a complete orthonormal sequence defined on (A, B) and
define @ by
In
,.(x, y) = ~; ,pj(x),pj(y)
j=l
Now we turn to some further widely used definitions of the empirical density
function which are not special cases of the general definition (1).
5. A sequential definition. By a little modification of the kernel-type
definition of the empirical density Wolverton and Wagner (1969) (see also
Rejt6 and R6v6sz, 1973) introduced the following definition:
xi(n):Xi*b+l (i = 0, 1,2 . . . . . [ _ ~ 1 ])
where X T < X ~ < . . . < X * are the ordered statistics of the sample
X1, X2. . . . . X,, and {b,} is an increasing sequence of positive integers tending
to infinity. Then the empirical density function can be defined by
if *
X ibn+l * l)bn+ 1 ,
~ X < X(i+
L(x) = t b.
+l- Xi*
0 if x < X1 or x ~ X(t(,_~)/b,]+~)o" .
where x ( - ~ < x < +~) is any fixed real number and fn is any empirical density.
A much more sophisticated and useful problem is the study of the global
distances. Among the possible global distances we present here the three most
popular ones.
2. Uniform distance
3. L2-distance
~f] ~.(x) dx = vn
4. Ll-distance
+~ ~,(x) dx = w,.
Hence our problem is to minimize these distances. Since so,, u,, v,, w, are
random variables we have several different possibilities to give a precise
meaning of the expresion 'to minimize'. A possible way to minimize the
1. Second moment. Et2, where t, can be equal to any of so,, u,, v,, w,.
From a practical point of view one of the most important questions is to
find
2. Limit theorems. Here the question is to find numerical sequences {a,} and
{b,} (b, > 0) such that the sequence {b~l(t, - a,)} should have a nondegenerate
limit distribution. (Here t, is again any of so,, un, v,, w,). In this case the
goodness of the estimator f, can be characterized by the largeness of b,. If b, is
'small' the estimator f, is 'close' to f.
Largely from the theoretical point of view there is importance attached to
3. Strong theorems. Here the problem is to find numerical sequences {an}
536 P. Rdv~sz
and {b~} (b~ > 0 ) such that one of the following relation should hold with
probability one:
lim b:l(tn - an) = 0, limsup b:l(tn - an) --- 1, lim b)l(t~ - an) = 1.
n--~ n.-~ n--~
f ( x ) - (b - a ) -1 f ( t ) dt = f ( x ) - E f n ( x ) ,
Eft(x)
is the random error. The study of its properties requires the rich apparatus of
the limit theorems of probability theory. In general the 'distance' between f
and fn is estimated by the sum of the distances between f and Efn and between
Efn and fn.
As an example in case of the General Definition of the Introduction we
present some details of the evaluation of the bias. Clearly in this case
f(x) - O,(x, y ) d F ( y ) = f ( x ) -
f2 ~b,(x, y ) f ( y ) dy.
This difference can be evaluated by the method of singular integrals (see e.g.,
Density estimation 537
Alexits, 1961). O n e can easily see that the concrete examples (given in the
Introduction) of the G e n e r a l Definition p r o d u c e asymptotically unbiased esti-
mators u n d e r very mild conditions
A p p l y i n g this general m e t h o d in s o m e concrete cases the following results
can be obtained:
1. Histogram (R6v6sz, 1972). A s s u m e that f(x) is uniformly continuous on an
interval -o~ <~ A < B <~ + ~ . T h e n for any e > 0 we have
(ii) / xX (x) dx = 0 .
J-=
Also assume that f(x) has a b o u n d e d second derivative on an interval -oo ~<
A < B ~< + ~ . T h e n for any e > 0 we have
sup I E L - f l : O(l:3).
A p p l y i n g special but not very different m e t h o d s o n e can see that the result
f o r m u l a t e d for the kernel-type empirical density remains true in the same f o r m
in the case of o u r sequential definition.
4. N e g a t i v e r e s u l t s
In the case of density estimation one cannot obtain such a complete result. This
fact is shown by the following example; let the density function f(x) be defined
as follows:
N o t e that for any n the sample )(1, X2, . , Xn belongs to a finite interval and
that for any adequate definition of the empirical density fn (x), liml+~= f, (x) = 0 a.s.
(for any fixed n). Therefore we obtain
The reader can see that in this example the reason for the nonconsistency is the
bias, not the random error. Realizing that the reason for the bias is the
irregular (nonsmooth) behaviour of the density function one cannot hope that
without a condition about the smoothness of f ( x ) one can find a uniformly
consistent estimator.
A m o r e surprising negative example was given by D e v r o y e (1982). H e has
shown that for any sequence {f,} of empirical densities and for any sequence
{an} of positive numbers tending to 0, among the relatively smooth densities
one can find an f such that
5. O n the r a n d o m error
After studying the bias Efn - f we have to study the r a n d o m error fn - Efn..
We list the known results in the most important cases.
Density estimation 539
O 1
E f t . - E l . ) a = O(1.n-~).
6. T h e m e a n - s q u a r e error
V~ = E ( f . - f)2 = O ( h 2 + n-~ ) ( A + e ~ x ~ B - e) .
V.=E(f~-f)2=O(h4+n~) (A+e<~x<~B-e).
4. Orthogonal expansion
Having these formulas one can see that V, takes its smallest possible value if
h , - n -1/3 in c a s e 1 ,
h , - n -1/5 in cases 2 and 3,
1, ~ n TM in case 4
540 P. R~v~sz
where the sign - means that the corresponding terms are equal to each other
asymptotically.
Making use of these choices we get
12 = E f t , - f)z dx
Then, the cell width h, which minimizes I, is o!n -1/3 + O(n-m), and at such h,'s,
12 = fin-2/3 + O(n-X).
Density estimation 541
In the practical application of this result we have again the problem that the
value of K is unknown, unless we estimate the value of fg__(f"(x))2dx from the
sample.
Another problem which comes into the picture is the optimal choice of A.
Epanechnikov (1969) proved that we can minimize i2 by the choice
if Ixl ~ 5112,
otherwise.
8. Limit theorems
where N(O, 1) is the standard normal law and A z= J'~ AZ(x)dx. Similar results
can be obtained using other definitions of the empirical density.
A much more complicated problem is to evaluate the limit distributions of
the different global distances. In order to obtain such theorems in general we
prove that the process f,(x)-f(x) (or its normalized version) can be ap-
proximated by a sequence of suitable Gaussian processes. To evaluate the limit
distributions of the corresponding functionals of these Gaussian processes is
somewhat easier.
At the first stage we give results for the L 2 distance v, = f c ( f , ( x ) - / ( x ) ) 2 dx.
8.a. Limit theorems of v, (Cs6rg6 and R6v6sz, 1981). H e r e we only mention
some results regarding to a few concrete definitions of the Introduction.
1. Histogram. Suppose that f vanishes outside a finite interval - o o < C <
D < + ~ and has a bounded derivative there. Then
h. fD
Jc f2(x)dx)
\-1/2
[nh,v,
__
I].~N(O,1)
provided that n-lhn 3/2log2n ~ 0 and hS/2n-->O. For example if h, = n -~ then it
is assumed that 2 < a < 2.
3. Kernel-type empirical density (Bickel and Rosenblatt, 1973). Suppose that
f(x) vanishes outside a finite interval -o~ < C < D < +oo and has a bounded
second derivative there. Also assume that h vanishes outside a finite interval
(A, B) with varxe(A,m A(X) = Const. and-fAB xA(x) dx = 0. Then
h~l/2o--l[nh,v, - A21--~
~ N(0, 1)
where
O'2---f~f2(x)dx fBa ( f ; A(x + y)A(x)dx)2dY
and
A 2= h2(x) dx
1)
l i m p { f , ( x ) - { A " - f ~ ] v2 ( z )
,~ \ nh, ] \(2a log n) ~/~ ~"d, <~f(x)
{An_~.~l/2( z + ) }
<~f,(x)+\ nhn ] \ ( 2 a l o g n ) v2 d. f o r a l l 0 ~ x ~ < l
= e x p ( - 2e -z ) ,
6.a. Nearest neighbour idea in the kernel-type definition (Cs6rg~ and R6v6sz,
1982). Let k, = [n"] with < a < 4 and define the r.v.'s
where XT < X~ < - < X * are the order statistics of the sample X1, X2 . . . . . X,
and
where
b(u) = (2 log u) 1/2+ (2 log u) -1/2 log(2~r)-~/212--~-1~ ~AB A(x)A"(x) dx]l/2.
REMARK. In the practical application of the results of this Section, the hardest
problem is the choice of the parameters of h, or l,. In Section 5 we presented
some ideas about the optimal choice of these parameters. Here we must realize
that the theorems of the present section are not valid when the parameter of
the estimator is chosen in the optimal way. Hence applying these results we can
get a confidence band only whenever our estimator is certainly not the optimal
one and we cannot evaluate the confidence band in the case when the estimator
is as close to the unknown f as it is possible.
Beside the two special definitions studied in this Section, similar theorems
can be obtained using other definitions. The most widely investigated case is
'that of the histogram (Smirnov, 1944; Tumanjan, 1955; Woodroofe, 1967;
R6v6sz, 1972). The best result in the histogram-case was obtained by Freedman
and Diaconis (1981). They studied the limiting joint distribution of the location
and size of the maximum deviation between the histogram and the underlying
density.
9. Strong theorems
Under very mild regularity conditions one can prove that the estimators
introduced in Section 1 are uniformly strongly consistent. That is, a number of
theorems say that
log4 n -o 0, nh~
nh. ,7 % h. ",a O, nhl log h~ 1 - - - ' - ~ 0
log h~ 1
.
( nh ) ~/2
limsup 2 log log n If, (x) - f(x)] = C(f(x)) m
where the constant C depends on the underlying definition. For example using
the kernel-type empirical density C = (J")t2(x) dx) m.
10. M u l t i v a r i a t e d e n s i t y
i=1
Assume that f(x) is vanishing outside a bounded, convex open set ~. Let
546 P. Rdv~sz
where
W (x) = { u : Ix - ul
( I x - u ] is the Euclidean distance). Assume further that there exist e > 0 and
,~ > 0 such that
f(x) >i ~3 if x E 5~
l[nhL 2
N o,
where
o2= 2 f ( f A(x+ y)A(x)dx)2dy f ~(x)dx
Mises limit theorems will not be valid anymore. However, most of the limit
theorems of Section 8 remain valid after the mentioned replacing (Bickel and
Rosenblatt, 1973). Hence the results of Section 8 can be used for composite
goodness-of-fit problems.
(4) Some parameters, especially the mode of the distribution can be easily
estimated via the empirical density. In fact the mode of the empirical density is
a good estimator of the real mode (Parzen, 1962; Chernoff, 1964; Eddy, 1982).
Po = qa j G f a ( x ) d x + q 2 ~o'f2(x)dx.
In the case when qa, fa, f2 are unknown, instead of Do(x) one can use the
decision function
where /x, = pa + p2 + ' ' " + Pn, f]n) (resp. f(2")) is an empirical density function
based on the nonzero elements of the sequence {piX(i)}i"=l (resp.
{(1 - P-i ]~"ex
rc(z)~(,)
i J i = l /~
" Using the decision function D (") instead of D Othe probability of
misclassification P, will be larger than Po. However, one can prove (Wolverton
and Wagner, 1969; Rejt6 and R6v6sz, 1973) that the probability that P, is
much larger than P0 is very small if n is big enough.
548 P. R~v~sz
References
Silverman, B. W. (1978). Weak and strong uniform consistency of the kernel estimate and its
derivatives. Ann. Statist. 6, 177-184.
Smirnov, N. N. (1944). Approximate laws of distribution of random variables from empirical data
(in Russian). Uspehi Mat. Nauk 10, 179-206.
Tumansan, S. H. (1955). On the maximal deviation of the empirical density of a distribution (in
Russian). Nauru. Trudy Erevensk. Univ. 48, 3-48.
van Ryzin, I. (1966). Bayes risk consistency of classification procedures using density estimation.
Sankhya Ser. A. 28, 261-270.
Walter, G. and Blum, I. (1979). Probability density estimation using delta sequences. Ann. Statist.
7, 328-340.
Wegman, E. J. and Davies, H. I. (1979). Remarks on some recursive estimators of a probability
density. Ann. Statist. 7, 316-317.
Wertz, W. (1978). Statistical density estimation - A survey.
Wertz, W. and Schneider, B. (1979). Statistical density estimation: a bibliography. Int. Stat. Review
47, 155-175.
Wolverton, C. T. and Wagner, T. I. (1969). Asymptotically optimal discriminant functions for
pattern classification. I E E E Transactions on Information Theory 15, 258-266.
Woodroofe, M. (1966). On the maximum deviation of the sample density. Ann. Math. Statist. 38,
475--481.
P. R. Krishnaiahand P. K. Sen, eds., Handbook of Statistics, Vol. 4 r~
Z.dq~'
O Elsevier SciencePublishers (1984) 551-578
Censored Data
A s i t P. B a s u
2. Types of censoring
551
552 Asit P. Basu
items put on test. Assume X{s to be independent and identically distributed all
having a c o m m o n distribution function F(x). F is usually assumed to be
absolutely continuous. W e may want to terminate the test before complete
information on all n items is available for several reasons. The underlying test
may be a destructive one so that items on test cannot be reused or, because of
time and or cost constraint, we cannot afford to wait indefinitely for all items to
fail.
In survival analysis, often we do not have complete control on the experi-
ment. Patients may enter a hospital or clinic at arbitrary points of time for
treatment and leave (before completion of treatment), or die from a cause
different from the one under investigation. In many cases we may be forced to
terminate the experiment at a given time (end of budget year, say) and try to
develop appropriate inference procedure based on the available data. Depend-
ing on the nature of the test, we usually are led to the following types of
censored data.
(a) Type I censoring. H e r e we assume n items are put on test and we
terminate our test at ~i predetermined time T, so that complete information on
the first k ordered observations
and the remaining observations are known to be greater than X(,). H e r e r (or
r/n) is a fixed constant.
(c) Arbitrary censoring and Random censoring. In (a) and (b) it is assumed
that all items are put on test simultaneously (or that the ordered observations
are available). There may be situations, however, where all items cannot be
tested simultaneously. For example each item may require certain time to set
up the test and it may not be feasible to install all items simultaneously for test.
Similar situations may arise in clinical trials where patients enter a clinic for
treatment at different points of time. A possible situation is illustrated in
Figure 1.
Let Xi denote the survival time for patient i (i = 1, 2 . . . . . n) where all the
patients are being treated for the same disease, say cancer. The study begins at
Censored data 553
X~ (XI< ~)
Patient 1
Patient 2
XE>T2
Patient 3 loss
X3>T3
Patient 4 death
X4>T4
0 T
Start of E n d of
study study
Fig. 1.
time 0 and ends at time T. Since the patients are entering the study at different
points of time, the i-th patient can be observed for a given period, say T~. Thus
X~ is observed if X~ ~< T~. Otherwise, it is censored. In the picture above
321 < Tt. However 322 > T2 so that X2 is not totally observed. For both patients
3 and 4, X / > Ti (i = 3, 4). X3 is censored because the patient withdrew from the
study (is lost so far as the current study is concerned), X4 is censored because
here the cause of death is different from cancer (say heart disease). However, no
distinction is made between these two causes of censoring.
In general, the i-th patient is observed up to time T~ and we observe
min(X/, Ti). We also know whether Xi <~ T/ (uncensored) or X / > 7]i (censored).
In many studies T~'s are considered as given constants. In this case we say we
have arbitrarily censored data. Note, Type I censoring is a special case of
arbitrary censoring, where Ti = T (i = 1, 2 . . . . . n).
Sometimes it is convenient to regard the T~'s to be random also, in that case
we call the above censoring to be random censoring. In this case X and T are
assumed independent. The assumption of randomness may be quite reasonable
in many cases. Note, in this case we are essentially observing the minimum of
two random variables X and T along with an indicator denoting which of the
two is the minimum. In such a case we say we observe the identified minimum
and use it to draw inference about the distribution function F(x). Such a
problem is called the problem of competing risks and many nonparametric
problems can be handled using the techniques developed there. See Basu
(1981), Basu and Ghosh (1980) and Basu and Klein (1982) for a bibliography of
this related area.
(d) Progressive censoring. In the literature several versions of progressive
censoring have been considered. Here the goal is to further reduce the testing
time. In connection of parametric theory, the following version is usually used.
See Klein and Basu (1981).
554 Asit P. Basu
3.1. K a p l a n - M e i e r ( K M ) estimator
Considerable results for censored data have also been obtained based on the
appropriate modification of F , ( x ) . Substantial results have been obtained
specially for random and arbitrarily censored data. H e r e the most important and
widely studied result is the product limit (PL) estimator of the survival function
S ( x ) proposed by Kaplan and Meier (1958). Although this estimator can be
defined for both arbitrarily censored and randomly censored data, for sim-
plicity we shall consider the randomly censored model. In the case of randomly
censored data we observe the following pair
where X~'s are observations on the variable under interest (lifetimes or survival
times), T / s are censoring variables, U i -- m i n ( X i, T/),
Censored data 555
and
6~={~ if U~ is uncensored (X/~< T/),
if U/is censored (X~ > Ti),
(i = 1, 2 . . . . , n). Let U(1 ) ~< U(2) ~< " " " ~< U(r) d e n o t e the o r d e r e d u's and 6[~1be the
value of 6~ associated with u(0. That is
{~ if u(0 is u n c e n s o r e d ,
6[~1= otherwise.
Let U~l)< U~2)< " " < U ( r ) d e n o t e the distinct values of u~, and let 6~] be the value
of 6i associated with U~0. T h e K M estimator is given by
Item n o . ( i ) 1 2 3 4 5 6 7 8 9 10
ui 4 90 55 15 20 35 9 9 45 100
6i 1 0 1 1 0 0 1 0 1 1
u~0 4 9 9 15 20 35 45 55 90 100
6til 1 1 0 1 0 0 1 1 0 1
Note, if the last item is censored the estimator would remain undefined for
values >100. In that case, although it is b o u n d e d by 8(100) and 0, one
sometimes uses the convention that S(x) = 0 for x 1> uc, ).
556 Asit P. Basu
THEOREM 3.1. I f F and H are continuous and T < Ta with G( T) < 1, then the
process
Z ~ ( t ) = X/n{S~(t)- S(t)}, O~
< t ~< T,
The theorem remains true also for the case of arbitrary censoring.
which is well known as the Greenwood's formula for the variance. Peterson
(1977) has proven the strong consistency of K M E whereas F61des and Rejt6
(1981) have established the strong uniform consistency of the KME.
Asymptotic confidence bands for S(t) = F(t) have been obtained by Hall and
Wellner (1980) for the randomly censored model based on KME.
Let
K(t) = C(t){1 + C(t)} -1, C,(t) = n ~ ( N - i ) - ~ ( N - i + 1)-16,
(i:xi<0
and / ( , ( t ) = {1 + C,(t)}-L Let B denote a Brownian bridge process on [0, 1],
and for 0 < a < 1, 0<~h <oo set
Censored d a t a 557
= 1 - 243[A{a(1 - 2a)}-1/2]+ 2 ~] ( - 1 ) k
i=1
THEOREM 3.2. I f T < T~ so that G ( T ) < 1 and F and H are continuous, then,
as n ~ oo,
Pr{S(t) ~< S.(t) + h D . ( t ) for all 0 <~ t <~ T } ~ G+a(A) > G+(A),
P r { ~ . ( t ) - hD.(t) ~< S ( t ) ~ S . ( t ) + A D . ( t ) for all 0 ~< t ~< T}
Ga(A) > G(A),
where
D , ( t ) = n-1/2S,(t)/g,n(t), a = K(T) = C(T)/(1 + C(T)) and
G+(A) = G T ( A ) .
Gillespie and Fisher (1979) and Nair (1981) also provide alternative large
sample confidence bands based on the KME. However, unlike the H a l l -
Wellner bands their bands do not reduce to the standard bands in the
uncensored case.
In case of Type I and Type II censored sample, confidence bands have also
been obtained by Barr and Davidson (1973), Koziol and Byar (1975), Dufour
and Maag (1978). One can, of course, carry out the corresponding goodness of
fit test based on the modified Kolmogorov-Smirnov (KS) statistic. This has
been discussed further in Koziol and Byar (1975).
Similar modifications of the modified C r a m 6 r - V o n Mises statistics for Type
II censored data have been considered by Petitt and Stephens (1978).
Dvoretzky, Kiefer and Wolfowitz (1956) have shown that the sample dis-
tribution function is asymptotically minimax over a wide class of loss functions.
Wellner (1982) gives similar asymptotic minimax properties of the K M E under
a wide variety of loss functions.
There have been further extensions related to the K a p l a n - M e i e r estimator.
Turnbull (1974) has considered the problem when observations are doubly-
censored. That is, he considers the case when some of the observations are
censored on the right and some on the left. Meier (1975) has extended the
theory to more general censoring mechanisms. Campbell (1981) and Langberg
558 Asit P. Basu
and Shaked (1982) have considered bivariate extensions of the survival function
for censored data.
Life test data often are grouped and presented in the form of a life table as used
in actuarial methods. A Bayesian approach, which is weakly nonparametric, to
analyze these data have been considered by Lochner and Basu (1969, 1972,
1976) using a Dirichlet prior. Ferguson (1973) has refined this process by
proposing the Dirichlet process prior on the space of distribution functions. For
randomly censored data Susarla and Van Ryzin (1976) obtain an estimate of the
distribution function using the Dirichlet prior proposed by Ferguson. Ferguson
and Phadia (1979) have further extended the results of Susarla and Van Ryzin
to the class of prior distributions 'neutral to the right'. In reliability context
Dykstra and Laud (1981) define a stochastic process whose sample paths are
assumed to be increasing hazard rates or failure rates r(x), where r(x)=
f(x)/F(x). The posterior distribution of hazard rates is obtained and the Bayes
estimates of F(x) are obtained for both exact and censored data.
4. Two-sample problems
against the alternative that they are different. The two cases of primary interest
are the location alternative
1, if Wi is from F ,
Zi = 0, otherwise.
where the a~'s are given numbers. Thus in the particular case of the Wilcoxon
statistic a~ = i. See Gibbons (1971), Hfijek and Sidfik (1967) and Puri and Sen
(1970) for detailed discussion of various properties of TN.
Suppose, instead of a complete sample, we have a Type II (or Type I)
censored sample. That is, we want to make a decision based on (at most) the
first r of the combined set of N observations.
Initially, many of the tests proposed were of ad hoc nature. For example, in
case of Type II censoring, Sobel (1957) proposed the statistic
r
V~m = ~2 ( n m i - mnl) = m n
_L (..,_n,) (4.2)
iffil i=1 \ m n / '
Vr N
N 2 - ~ liZi,
i=1
where
Ii = (2i - N - 1)/2N, 1 ~< i ~< r,
= r]2N, r+l<-i<-N.
Thus Vr is a linear rank statistic and is related to the Wilcoxon rank test. Basu
(1968) and Lochner (1968) considered a more general approach for modifying a
statistic
N
TIV = ~ , aiZi
i=1
by
N
TNr = a,Z, + X a,Eo(Z, [ Zl . . . . . Zr) (4.4)
1 r+l
560 Asit P. Basu
Rao, Savage and Sobel (1960) consider a number of other censoring schemes,
each of which can be considered variations of Type I or Type II censoring. For
each such variation one can consider suitable rank tests. Thus Young (1970) has
considered the case when one continues observing until m, observations from
F ( x ) are obtained. Halperin and Ware (1974) have considered a variation of
the Wilcoxon statistic when one waits until m r X ' s or n r Y ' s have been obtained.
Spurrier (1981) has considered the case when m r X ' s and n , Y ' s are obtained.
Let the first r ordered observations -oo < W1 < W2 < < W~ < oo be given and
let the Zi's be defined as before. A useful result to study rank statistics for
censored data is,
P ( Z 1 = zl, Z 2 = z2 . . . . . Zr = Zr)
P ( Z 1 = z1 . . . . . Z r = z,)= m - m~ m "
Censored data 561
LEMMA4.1 (Basu, Ghosh and Sen). Let T1, T2 be two (possibly vector valued)
statistics, and for each O, let Lo(T1, T2), LI~(T1) and Lao(T2) by respectively the
joint and marginal densities of T1 and T2 (with respect to some o'-finite measure
tz ). Then
As a corollary we have
are the test statistics corresponding to the LMP tests based on (TI, T2) and T1
alone. A variant of this is due to Sen (1981).
As an application, consider the two sample rank tests for 14o against HE
based on a Type II censored sample, considered by Sobel (1957) and Basu
(1967a, 1968). Here
T ~ = ~ a i z i + n - n~r ( i = l ~=Nr~+
1a~) (4.10)
where TN = EN=IaiZi is the LMP rank test when complete sample is available.
Similarly one can obtain the LMP for scalar alternative.
For the Type I censored sample, considered by Halperin (1960), one similarly
obtains the LMP test which is conditional given r(T) = r. Basu, Ghosh and Sen
(1982) show that, under certain general conditions the two statistics cor-
562 Asit P. Basu
TN, has been considered by Mantel (1966), Thomas (1975), Cox (1972) and Peto
and Peto (1972). Note, if there is no censoring the above reduces to the Savage
test based on exponential scores.
Chatterjee and Sen (1973) have considered the two-sample problem under
progressive censoring, under the more general alternative hypothesis of
regression. Here the alternative hypothesis is given by
Censoreddata 563
where TNk is of the form (4.4). See Sen (1981) for a review of two sample and
other nonparametric tests under progressive censoring.
Let X(1)< X(2) < ' " < X(m) and Y(1)< Y(2)< " " < Y(,) be two independent
ordered samples from two populations, F(x) and G(y) respectively. The
two-sample Kolmogorov-Smirnov test for testing
where Fro(x) and G,(x) are the empirical distributions of the two independent
samples. There have been several modifications of this test corresponding to
the different types of censoring. Tsao (1954) and Conover (1967) studied the
following censoring plans.
Plan I. Continue testing until X(r) is observed. Here the statistic proposed is
Plan 2. Continue testing until X(r) and Y(r) are observed. Here the statistic
proposed is
Plan 3. Continue testing until either X(r) or Y(r) is observed. Here the test
proposed is
Tsao (1954) has provided tables of percentage points for Tr and T'r. Conover
(1967) has studied the properties of these.
Each of the above modifications were for Type II censoring. Similar
modifications for Type I censoring has been considered by Koziol and Byar
(1975). Suppose tests are continued until times T1 (for X population) and T2
(for Y population). Let T = min(T1, T2). Then the statistic considered is
DT = sup (4.18)
- oo<x <T
where Sl(t) = 1 - F(t) and S2(t) = 1 - G(t) are the respective survival functions.
In context of survival studies the random variables of interest are the survival
times (or times to death).
For convenience, let us charge the notation and let T~j denote the j-th
survival time from i-th population (j = 1, 2 . . . . . N~, i = 1, 2). Let 7~j denote the
censoring time corresponding to the j-th observation in i-th sample, so that we
observe
Denote the survival distributions for the censoring variables by C1(') and C2(" )
respectively. The cumulative hazard function fli(t) corresponding to a survival
function Si(t) is given by f l i ( t ) = - I n Si(t). Similarly, define a i ( t ) = - I n H/(t).
Combine all the N1 + N2 observations. Let {tj.,j = 1, 2 , . . . , m} be the distinct
observations, {Tj, i = 1, 2 , . . . , d} the distinct death (survival times) and {cj, i =
1, 2 . . . . . c} the distinct censoring times (m ~ d + c).
As in Section 3, let N/(t) be the number of individuals from sample i at risk
at time t. Similarly, let Di(t) and Li(t) represent the number of deaths and
censorships at time t from some i. Define T=max{tj:Nl(tj)N2(tj)>O}. The
fli(t)'s and a~(t)'s are estimated using Nelson's (1969) estimators and are given
by
Censored data 565
Die-1
i,(t) = ~'. {N~(Tj)- k}-1 ,
7)~<t k=0
(4.20)
/,(t) = ~'~ L,~'-'- {N~(~)- D,(~.)- k}-~ .
Fleming et al. (1980) have given an algorithm for computing the above
statistics and size and power of the test for small and moderate sample sizes
have been computed using Monte-Carlo simulations.
The problem of estimating the ratio of scale parameters in the two-sample
case with arbitrary right censorship has been considered by Padgett and Wei
(1982). To this end, they considered a two-sample version of the Cram6r-Von
Mises statistic.
5. K-sample problem
against the alternative hypothesis that they are different. Two important cases
are
Basu (1967b) considers the following test statistics for the above hypothesis
based on a Type II censored sample. That is, when the first r ordered
observations among all the N = E~ n~ observations are available. Let
Then to test (5.1) against (5.2), the statistic B~N), which is a generalization of the
Kruskal-Wallis statistic, is proposed. Here
Similarly, for testing (5.1) against the ordered alternative (5.3), the statistic
proposed is
( i , j = 1,2 . . . . . k ; i < j ) .
~i ~ = ifi~'j.
Breslow (1970) proposed the statistic S for testing (5.1) against (5.2) under
the assumption that all censoring variables have the same distribution and
studied its properties. Patel and Hoel (1973) and Patel (1975) proposed similar
statistics for generalization of the Jonckheere test and the Friedman test
respectively. Woolson and Lachenbruch (1981) have also considered extension
of the Friedman test.
6. Regression
where fl is the vector of regression coefficients and )t0(t) is the hazard rate
when x = 0. The above model is equivalent to the assumption of Lehmann
alternatives
Since )t0(t) is unknown, Cox (1972, 1975) suggests a partial likelihood approach.
Let R ( t ) denote the risk set, the units that are still in test at time t- and let r~
denote the censoring variable, so that we observe Y~ = min(T~, r~) and 8~ =
I ( T i < - r i ) and x~= (Xil . . . . . xlp), where I is the indicator function (i =
1, 2 . . . . . n). Let Yo) < Y(2)< " " < Y(,) be the ordered observations, assumed
all distinct. Then the partial likelihood is
Off = O. (6.4)
E ( T I x ) = a + xl3, (6.5)
where a is the intercept and fl is the vector of regression coefficients for the
independent variables x.
Miller (1976) suggests minimizing the sum of squares
Y=(Yl,...,Y,)', x=(xij), i = 1 , 2 . . . . . n, j = l , 2 . . . . . p,
n
and
W(13k)= diagonal matrix ((wi(lJk))).
Then
fik+l = [ ( X - X w ) w ( [ $ k ) ( X - J~w)]-x(X - X W ) w ( I J k ) y . (6.7)
y~ is replaced by
and then the usual least squares normal equations are solved. Buckley-James
estimators are also computed iteratively.
Koul, Susarla and Van Ryzin (1981) propose an estimator based on the
following relationship. Assume that the censoring distributions G(t, x ) are
independent of xi. That is let G(t, xi) = G ( t ) for all x~. Then
and then the usual least square estimator normal equations are solved.
Unlike the Miller and Buckley-James estimators, Koul-Susarla-van Ryzin
estimator is explicitly defined and does not require iterative m e t h o d s - w h i c h
may not always converge. Large sample methods are established more
rigorously here. However, the assumption on the censoring distribution is not
considered realistic in many cases.
Miller and Halpern (1981) have compared the above four regression
methods.
570 Asit P. Basu
7. Independence
1
~n=r~(-ffn-_--~ ~ a,(l,l') ifi>r,j>s.
~" I ~" J l~r+l l'=s+l
The above test, based on scores, generalizes the Spearman rank correlation
coefficient.
For arbitrary censored samples, generalization of Kendall's ~- and Spear-
man's p have been considered by several. Brown, Hollander and Korwar (1974)
discuss modifications of Kendall's ~- and apply them to data consisting of
survival time of Stanford heart transplant patients. Latta (1977b) modifies
Spearman's p and applies it to the same data for a comparison to the statistics
of Brown et al. (1974). Weier and Basu (1980) consider several other
modifications where each censored observation is replaced by its conditional
expectation. Thus if X~ is censored at time ui, it is replaced by E~(X~ I X~ > ui)
where Ep denotes the expected value under the Kaplan-Meier estimate.
Censored data 571
8. Other N P m e t h o d s
The above classification rule has been proven to be consistent and the asymp-
totic relative efficiency of this rule with respect to a classification rule for the
normal distribution is computed.
~t
~ff(x) dx <-tz exp(-t//.t) for t~>0. (8.4)
If the reversed inequality is true, F and _P are said to have harmonic new worse
than used in expectation (HNWUE). Klefsj6 (1982) has studied different
properties of the H N B U E and H N W U E classes. The following chain of
implication holds between the six classes.
A V~ _
00 = A W -
(W v,):
and
t~ = log 0q/log(VJV~). (8.6)
For each pair (i, j), we obtain estimator 00 using the method of Padgett and
Wei (1982), and all these are combined to obtain a 'pooled' estimate & of c~,
where
E;=, E~=j+, (log Oik)log(Vk/Vj)
a Ej=I E~=j+~ (log(Vk/Vj)) 2 (8.7)
We next rescale all the variables so that the rescaled value of all follow the
same distribution F0, by using the transformation
Zl.l ~-- ( Wj/ Vo)& Tj, . (8.8)
We can then use (Zjt, 8jr) to obtain the Kaplan-Meier estimator of F0.
The above procedure is further extended by Basu and Ebrahimi (1982) to
include the cases of competing risks.
References
Anderson, P. K., Borgan, O., Gill, R. and Keiding, N. (1982). Linear nonparametric tests for
comparison of counting processes, with applications to censored survival data. Inter. Statistics
Inst. Rev. 50, 219-258.
574 Asit P. Basu
Bailey, K. R. (1979). The general maximum likelihood approach to the Cox regression model.
Ph.D. Thesis, University of Chicago, Chicago, Illinois.
Barr, D. R. and Davison, T. (1973). A Kolmogorov-Smirnov test for censored samples. Tech-
nometrics 15, 739-757.
Basu, A. P. (1967a). On the large sample properties of a generalized Wilcoxon-Mann-Whitney
statistic. Ann. Math. Statistics 38, 905-915.
Basu, A. P. (1967b). On two k-sample rank tests for censored data. Ann. Math. Statistics 38,
1520-1535.
Basu, A. P. (1968). On a generalized Savage statistic with applications to life testing. Ann. Math.
Statistics 39, 1591-1604.
Basu, A. P. (1977). A generalized Wilcoxon-Mann-Whitney statistic with some applications in
reliability. In: Tsokos and Shimi, eds., The Theory and Applications of Reliability Vol. 1.
Academic Press, New York, pp. 131-149.
Basu, A. P. (1981). Identifiability problems in the theory of competing and complementary risks - A
survey. In: Taillie et al., eds., Statistical Distributions in Scientific Work 5. Reidel, Dordrecht,
Holland, pp. 335-348.
Basu, A. P. and Ebrahimi, N. (1982a). Nonparametric accelerated life testing. Technical Report
No. 120, Department of Statistics, University of Missouri-Columbia. I E E E Trans. Reliability 31,
432-435.
Basu, A. P. and Ebrahimi, N. (1982b). Testing whether survival function is harmonic new better
than used in expectation. Technical Report No. 122, University of Missouri-Columbia.
Basu, A. P., Ebrahimi, N. and Klefsj6, B. (1983). Multivariate harmonic new better than used in
expectation distributions. Skand. J. Statistics 10, 19-25.
Basu, A. P. and Ghosh, J. K. (1980). Identifiability of distributions under competing risks and
complementary risks models. Communications in Statistics Theory Method A 9, 1515-1525.
Basu, A. P., Ghosh, J. K. and Sen, P. K. (1982). A unified way of deriving LMP rank tests from
censored data. Technical Report No. 118, University of Missouri-Columbia.
Basu, A. P. and Klein, John (1982). Some recent results in competing risks theory. Survival Analysis,
IMS Monograph 2, 21-29.
Block, H. and Basu, A. P. (1974). A continuous bivariate exponential extension. Jour. Amer. Stat.
Assn. 69, 1031-1037.
Bresiow, N. (1970). A generalized Kruskal-Wallis test for comparing k samples subject to unequal
patterns of censorship. Biometrika 57, 579-593.
Breslow, N. E. and Crowley, J. (1974). A large sample study of the life table and product limit
estimates under random censorship. Ann. Statistics 2, 437-453.
Brown, W. B., Jr., Hollander, M. and Korwar, R. M. (1974). Nonparametric tests of independence
for censored data with applications to heart transplant studies. In: Proschan and Serfling,
eds., Reliability and Biometry: Statistical Analysis of Life Length. SIAM, Philadelphia,
pp. 327-354.
Buckley, J. and James, I. (1979). Linear regression with censored data. Biometrika 66, 429-436.
Campbell, G. (1981). Nonparametric bivariate estimation with randomly censored data. Biometrika
68, 417-422.
Chatterjee, S. K. and Sen, P. K. (1973). Nonparametric testing under progressive censoring.
Calcutta Statistical Association Bulletin 22, 13-50.
Conover, W. J. (1967). The distribution function of Tsao's truncated Smirnov tests. Ann. Math.
Statistics. 38, 1208-1215.
Conover, W. J. (1980). Practical Nonparametric Statistics. Wiley, New York.
Cox, D. R. (1972). Regression models and life tables (with discussion). J. Roy. Stat. Soc. Set. B 34,
187-220.
Cox, D. R. (1975). Partial likelihood. Biometrika 62, 269-276.
Crowley, J. and Thomas, D. R. (1975). Large sample theory for the log rank test. Technical report
No. 415, University of Wisconsin-Madison, Madison, Wisconsin.
Cs6rg~, S. and Horvrlth, L. (1981). On the Koziol-Green model for random censorship. Biometrika
68, 391-401.
Cuzick, J. (1982). Rank tests for association with right censored data. Biometrika 69, 351-364.
Censored data 575
Doksum, K. (1972). Decision theory for some nonparametric models. Proc. Sixth Berk. Symp.
Math. Stat. Prob. 1, 331-341.
Dufour, R. and Maag, U. R. (1978). Distribution results for modified Kolmogorov-Smirnov
statistics for truncated or censored samples. Technometrics 20, 29-32.
Durbin, J. (1973). Distribution Theory for Tests Based on the Sample Distribution Function. SIAM,
Philadelphia.
Dvoretzky, A., Kiefer, J. and Wolfowitz, J. (1956). Asymptotic minimax character of the sample
distribution function and of the classical multinomial estimator. Ann. Math. Statistics 27,
642-669.
Dykstra, R. L. (1982). Maximum likelihood estimation of the survival functions of stochastically
ordered random variables. Jour. Am. Star. Assn. 77, 621-628.
Dykstra, R. and Laud, P. (1981). A Bayesian nonparametric approach to reliability. Annals of
Statistics 9, 356-367.
Efron, B. (1967). The two sample problem with censored data. In: Proc. Fifth Berkeley Symposium
in Mathematical Statistics, IV. Prentice-Hall, New York, 831-853.
Efron, B. (1977). The efficiency of Cox's likelihood function for censored data. J. Amer. Star.
Assoc. 72, 557-565.
Feltz, C. J. (1982). Nonparametric maximum likelihood estimation of stochastically ordered
survival functions. Ph.D. Thesis, University of Missouri-Columbia.
Ferguson, T. S. (1973). A Bayesian analysis of some nonparametric problems. Ann. Statistics 1,
209-230.
Ferguson, T. S. and Phadia, E. G. (1979). Bayesian nonparametric estimation based on censored
data. Ann. Statistics 7, 163-186.
Fleming, T. R., O'Fallon, J. R., O'Brien, P. C. and Harrington, D. P. (1980). Modified
Kolmogorov-Smirnov test procedures with application to arbitrarily right censored data.
Biometrics 36, 607-625.
Frldes, A. and RejtS, L. (1981). Strong uniform consistency for nonparametric survival curve
estimators from randomly censored data. Ann. Statistics 9, 122-129.
FSldes, A. and Rejtr, L. (1981). A LIL type result for the product limit estimator. Z. Wahrsch. Verw.
Geb. 56, 75--86.
Gail, M. H. and Ware, J. H. (1979). Comparing observed life table data with a known survival
curve in the presence of random censorship. Biometrics 35, 385-391.
Gastwirth, J. L, (1965). Asymptotically most powerful rank tests for the two-sample problem with
censored data. Ann. Math. Statistics 36, 1243-1247.
Gehan, Edmund A. (1965a). A generalized Wilcoxon test for comparing arbitrarily singly-censored
samples. Biometrika 52, 203-223.
Gehan, Edmund A. (1965b). A generalized two-sample Wilcoxon test for doubly censored data.
Biometrika 52, 650-652.
Gibbons, J. D. (1971). Nonparametric Statistical Inference. McGraw-Hill, New York.
Gilbert, J. P. (1962). Random censorship. Ph.D. Thesis, University of Chicago.
Gill, R. D. (1981). Large sample behavior of the product limit estimator on the whole line.
Mathematical Centre Report, Amsterdam.
Gillespie, M. J. and Fisher, L. (1979). Confidence bands for the Kaplan-Meier survival curve
estimate. Ann. Statistics 7, 920-924.
H~jek, J. and Sid~ik, Z. (1967). Theory of Rank Tests. Academic Press, New York.
Hall, W. J. and Wellner, J. A. (1980). Confidence bands for a survival curve from censored data.
Biometrika 67, 133-143.
Halperin, M. (1960). Extension of the Wilcoxon-Mann-Whitney test to samples censored at the
same fixed point. J. Amer. Statist. Assoc. 55, 125-138.
Halperin, M. and Ware, J. (1974). Early decision in a censored Wilcoxon two-sample test for
accumulating survival data. J. Amer. Statist. Assoc. 69, 414-422.
Halperin, M., Ware, J. H. and Wu, M. (1980). Conditional distribution-free tests for the two-
sample problem in the presence of right censoring. J. Amer. Statist. Assoc. 75, 638-645.
Hollander, Myles and Proschan, F. (1972). Testing whether new is better than used. Ann. Math.
Statistics 43, 1136-1146.
576 Asit P. Basu
Hollander, M. and Proschan, F. (1979). Testing to determine the underlying distribution using
randomly censored data. Biometrics 35, 393-401.
Hollander, M. and Proschan, F. (1982). Nonparametric concepts and methods in reliability. Tech-
nical Report No. 77, Stanford University.
Hyde, J. (1977). Testing survival under right censoring and left truncation. Biometrika 64, 225-230.
Joe, H. and Proschan, F. (1981). Estimating a decreasing mean residual life distribution from
complete or incomplete data. Scand. J. Statist. (to appear).
Johansen, S. (1978). The product limit estimator as maximum likelihood estimator. Scand. J. Statist.
5, 195-199.
Johnson, R. C. E. and Johnson, N. L. (1980). Survival Models and Data Analysis. Wiley, New York.
Johnson, R. A. and Mehrotra (1972). Locally most powerful rank tests for the two-sample problem
with censored data. Ann. Math. Statistics 43, 823-831.
Kalbfleisch, J. D. and Prentice, R. L. (1980). The Statistical anal Analysis of Failure Time Data.
Wiley, New York.
Kaplan, E. L. and Meier, P. (1958). Nonparametric estimation from incomplete observations. J.
Amer. Statist. Assoc. 53, 457-481.
Kiefer, J. and Wolfowitz, J. (1956). Consistency of the maximum likelihood estimator in the
presence of infinitely many incidental parameters. Ann. Math. Statistics 27, 887-906.
Kitchin, J. (1980). A new method for estimating life distributions from incomplete data. Ph.D.
Dissertation, Florida State University.
Kitchin, J., Langberg, A. and Proschan, F. (1980). A new method for estimating life distributions
from incomplete data. Florida State University, Department of Statistics Technical Report No.
M-548.
Klein, J. P. and Basu, A. P. (198l). Weibull accelerated life tests when there are competing causes
of failure. Commun. Statist. Theor. Meth. A 10, 2073-2100.
Klefsj6, B. (1982). The HNBUE and HNWUE classes of life distributions. Naval Research
Logistics Quarterly 29, 331-344.
Koul, H. L. and Susarla, V. (1980). Testing for new better than used in expectation with incomplete
data. J. Amer. Statist. Assoc. 75, 952-956.
Koul, H. L., Susarla, V. and Ryzin, J. Van. (1981). Regression analysis with randomly right-
censored data. Ann. Statistics 6, 1276-1288.
Koziol, J. A. (1980). Goodness-of-fit tests for randomly censored data. Biometrika 67, 693-696.
Koziol, J. A. and Byar, D. P. (1975). Percentage points of the asymptotic distributions of one and
two sample K - S statistics for truncated or censored data. Technometrics 17, 507-510.
Koziol, J. A. and Green, S. B. (1976). A Cramer-von Mises statistic for randomly censored data.
Biometrika 63, 465-474.
Langberg, N. A. and Shaked, M. (1982). On the identifiability of multivariate life distribution
functions. Ann. Prob. 10, 773-779.
Latta, R. B. (1977a). Generalized Wilcoxon statistics for the two sample problem with censored
data. Biometrika 64, 633-635.
Latta, R. B. (1977b). Rank tests for censored data. Technical Report No. 112, University of
Kentucky.
Lochner, R. H. (1968). On some two-sample problems in life testing. Ph.D. Dissertation, University
of Wisconsin, Madison.
Lochner, R. H. and Basu, A. P. (1969). Bayesian analysis of the time truncated samples. Technical
Report No. 201, Department of Statistics, University of Wisconsin.
Lochner, R. H. and Basu, A. P. (1972). Bayesian analysis of the two-sample problem with
incomplete data. J. Amer. Stat. Assoc. 67, 432-438.
Lochner, R. H. and Basu, A. P. (1976). A Bayesian test for increasing failure rate. In: Tsokos and
Shimi, eds., The Theory and Applications of Reliability, Vol. I. Academic Press, New York, pp.
67-83.
Mantel, N. (1966). Evaluation of the survival data and two new rank order statistics arising in its
consideration. Cancer Chemotherapy Reports 50, 163-170.
Mantel, N. (1967). Ranking procedures for arbitrarily restricted observation. Biometrics 23, 65-78.
Censored data 577
Mehrotra, K. G., Johnson, R. A. and Bhattacharyya, G. (1977). Locally most powerful rank tests
for multicensored data. Communications in Statistics - Theory and Methods A 6, 459-469.
Meier, P. (1975). Estimation of a distribution function from incomplete observations. In: J. Gani,
ed., Perspectives in Probability and Statistics. Academic Press, New York, 67-68.
Miller, R. G. (1976). Least squares regression with censored data. Biometrika 63, 449-464.
Miller, R. G. Jr. (1981). Survival Analysis. Wiley, New York.
Miller, R. G. and Halpern, J. (1981). Regression with censored data. Stanford University Division
of Biostatistics Technical Report No. 66. Biometrika 69, 521-531.
Nair, V. N. (1981). Plots and tests for goodness of fit with randomly censored data. Biometrika 68,
99-103.
Nelson, W. (1969). Hazard Plotting for incomplete failure data. Journal of Quality Technology 1,
27-52.
Oakes, D. (1977). The asymptotic information in censored survival data. Biometrika 64, 441-448.
Oakes, D. (1982). A concordance test for independence in the presence of censoring. Biometrics 38,
451-455.
O'Brien, P. C. (1978). A nonparametric test for association for censored data. Biometrics 34,
243-250.
Padgett, W. J. and Wei, L. J. (1980). Maximum likelihood estimation of a distribution function with
increasing failure rate based on censored observations. Biometrika 67, 470-474.
Padgett, W. J. and Wei, L. J. (1982). Estimation of the ratio of scale parameters in the two-sample
problem with arbitrary right censorship. Biometrika 69, 252-256.
Patel, K. M. (1975). A generalized Friedman test for randomized block designs when observations
are subject to arbitrary right censorship. Communications in Statistics 4, 389-395.
Patel, K. M. and Hoel, D. G. (1973). A generalized Jonckheere k-sample test against ordered
alternatives when observations are subject to arbitrary right censorship. Communications in
Statistics 2, 373-380.
Peterson, A. V. (1977). Expressing the Kaplan-Meier estimator as a function of empirical
subsurvival functions. J. Amen. Statist. Assoc. 72, 854-858.
Peto, R. and Peto, J. (1972). Asymptotically efficient rank invariant test procedures (with dis-
cussion). J. Roy. Stat. Soc. A 135, 185-206.
Pettitt, A. N. (1976). Cramrr-von Mises statistics for testing normality with censored samples.
Biometrika 63, 475-481.
Pettitt, A. N. and Stephens, M. A. (1976). Modified Cramrr-von Mises statistics for censored data.
Biometrika 63, 291-298.
Phadia, E. G. (1973). Minimax estimation of a cumulative distribution function. Ann. Statistics 1,
1149-1157.
Prentice, R. L. (1978). Linear rank tests with right censored data. Biornetrika 65, 167-179.
Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley, New
York.
Rao, U. V. R., Savage, I. R. and Sobel, M. (1960). Contributions to the theory of rank order
statistics: Two sample censored case. Ann. Math. Statistics 31, 415-426.
Rolski, T. (1975). Mean residual life. Bulletin of the International Statistical Institute 46, 266-270.
Savage, I. R. (1956). Contributions to the theory of rank statistics: Two sample case. Ann. Math.
Statistic 27, 590-616.
Sen, P. K. (1981). Sequential nonparametrics. Wiley, New York.
Sen, P. K. (1981). On invariance principles for LMP conditional test statistics. Calcutta Statist.
Assoc. Bull. 30, 41-56.
Shaked, M. (1979). An estimator for generalized hazard rate function. Communications in Statistics,
Theory and Methods, pp. 17-33.
Shaked, M. and Singpurwalla, N. D. (1982). Nonparametric estimation and goodness of fit test-
ing of hypothesis for distribution in accelerated life testing. I E E E Trans. Reliability 31,
69-74.
Shaked, M., Zimmer, W. J. and Ball, C. A. (1979). A nonparametric approach to accelerated life
testing. J. Amen Statist. Assoc. 74, 696-699.
578 Asit P. Basu
Shirahata, S. (1975). Locally most powerful rank tests for independence with censored data. Annals
of Statistics 3, 241-245.
Sobel, M. (1957). On a generalized Wilcoxon statistic for life testing. In: Proc. Working Conference
on the Theory of Reliability. New York University and the RCA, pp. 8-13.
Spurrier, J. D. (1981). Comparison of two independent life tests subject to type II censoring.
Technical report No. 78, Department of Mathematics and Statistics, The University of South
Carolina, Columbia, South Carolina.
Susarla, V. and Van Ryzin, J. (1976). Nonparametric Bayesian estimation of survival curves from
incomplete observations. J. Amer. Stat. Assoc. 71, 897-902.
Tarone, R. E. and Ware, J. (1977). On distribution-free tests for equality of survival distributions.
Biometrika 64, 156-160.
Thomas, D. R. (1975). On a generalized Savage statistic for comparing two arbitrarily censored
samples. Technical Report, Department of Statistics, Oregon State University.
Tsao, C. K. (1954). An extension of Massey's distribution of the maximum deviation between
two-sample cumulative step functions. Ann. Math. Statistics 25, 587-592.
Tsiatis, A. A. (1981). A large sample study of Cox's regression model. Ann. Statistics 9, 93-108.
Turnbull, B. W. (1974). Nonparametric estimation of a survivorship function with doubly censored
data. J. Amer. Statist. Assoc. 69, 169-173.
Turnbull, B. W. and Weiss, L. (1978). A likelihood ratio statistic for testing goodness of fit with
randomly censored data. Biometrics 34, 367-375.
Wei, L. J. (1980). A generalized Gehan and Gilbert test for paired observations that are subject to
arbitrary right censorship. 3. Amer. Statist. Assoc. 75, 364-367.
Weier, D. R. and Basu, A. P. (1980a). An investigation of Kendall's ~- modified for censored data
with applications. Journal of Statistical Planning and Inference 4, 381-390.
Weier, D. R. and Basu, A. P. (1980b). Testing for independence in multivariate exponential
distributions. Australian Journal of Statistics 22, 276-288.
Weier, D. R. and Basu, A. P. (1981). On tests of independence under bivariate exponential models.
In: Taillie et al., eds., Statistical Distributions under Sciennfic Work 5. Reidel, Dordrccht, Holland,
pp. 169-180.
Wellner, J. A. (1982). Asymptotic optimality of the product limit estimator. Ann. Statistics 10,
595-602.
Wilks, S. S. (1962). Mathematical Statistics. Wiley, New York.
Woolson, R. F. and Lachenbruch, P. A. (1980). Rank tests for censored matched pairs. Biometrika
67, 597-606.
Woolson, R. F. and Lachenbruch, P. A. (1981). Rank tests for censored randomized block designs.
Biometrika 68, 427-435.
Young, D. H. (1970). Consideration of power for some two-sample tests with censoring based on a
given order statistic. Biometrika 57, 595-604.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 t~
Elsevier Science Publishers (1984) 579-611
1. Introduction
This research was supported in part by National Science Foundation Grant MCS-8102349. Brian
Yandell is now at the departments of Statistics and Horticulture, University 6f Wisconsin, Madison.
579
580 Kjell A. Doksum and Brian S. Yandell
developed. In Section 5, tests based on the total time on test transform are
considered, while in Section 6 the nonparametric optimality of the total time on
test statistic is developed.
Some of the common distance type statistics are discussed in Section 7 and
graphical methods based on Q - Q plots and the total time on test transforms
are given in Section 8. Section 9 gives some tests designed to detect 'New
Better than Used' alternatives.
The rest of the paper concerns testing for exponentiality in the presence of
right censoring. Censoring arises in many practical problems when individuals
Under study cannot be observed until failure. Section 10 presents several
common types of censoring and details the notation used for this part.
Estimates and their properties are reviewed in Section 11. Very few small-
sample results exist for censored data problems (Chen, Hollander and Lang-
berg, 1980). W e therefore briefly review in Section 12 the weak convergence of
the survival curve and of the cumulative failure rate, or hazard function, to a
Gaussian process. This is used in later sections to examine asymptotic proper-
ties of several generalizations of tests without censoring. These include maxi-
mal deviation (Kolmogorov-Smirnov) tests which can be inverted to yield
simultaneous confidence bands (Section 13); tests based on spacings and the
total time on test (14); and others including average deviation, or Cram6r-von
Mises tests, and linear rank tests (15); Monte Carlo simulation results are
summarized in Section 16. The question of exponentiality is explored using
several tests on data from a prostate cancer study in Section 17.
where 0, A > 0. Their properties are discussed in Barlow and Proschan (1975)
and Kalbfleisch and Prentice (1980). When 0 = 1 both reduce to the exponen-
tial density so the exponential hypothesis can be written Ho: 0 = 1.
Properties of alternatives to exponentiality are most conveniently expressed
through the failure (or hazard) rate function defined by
h(t)=f(t)/[1-F(t)], t >0,
density. The failure times of equipment or components with moving parts are
modelled to have an increasing failure rate distribution since wear would
increase the rate of failure.
In the case of gamma and Weibull distributions, we find that the failure rate
is monotone increasing if 0 > 1 and monotone decreasing if 0 < 1. In fact, for
the Weibull density, we have
These densities reduce to the exponential density when 0 = 0, the failure rates
are increasing when 0 > 0.
is nondecreasing. This class has nice closure properties: It is the smallest class
of F ' s which includes the exponential distribution and is closed under the
formation of coherent systems (Birnbaum, Esary and Marshall, 1966) and it is
closed under convolution (Block and Savits, 1976).
(3) The class of NBU (New Better than Used) distributions F is the class
with
582 Kjell A. Doksum and Brian S. Yandell
IFR C I F R A C N B U .
Thus the gamma and Weibull distributions with 0 > 1 are examples of F ' s for
all three classes as are the FL and Fra distributions when 0 > 0.
For further results on these nonparametric classes, see Barlow and Proschan
(1975), and Hollander and Proschan (1984, this volume).
3. Parametric tests
supL(t;Oo
t
Tests for exponentiality 583
For smooth models, as in Section 2(a), the value of R (t) can be computed on a
computer. The test rule based on R(t) is to reject exponentiality when
R(t)>~ k~, where k~ is the ( 1 - a ) - t h quartile of a X2 distribution with one
degree of freedom (e.g., Bickel and Doksum, 1977, p. 229).
Another test suitable for a parametric alternative f(t; O,A) is Neyman's
(1959) asymptotically most powerful C(a) test. This test is asymptotically most
powerful in the class of all similar tests, that is, in the class of all tests that have
level a no matter what the value of the unknown parameter A is.
Let
then it can be easily shown that in our setup the C(a) test reduces to a test
which rejects exponentiality for large values of the test statistic
n i=l
and
~'Z(h)= foh2(t)e-' dt- [~oth(t)e-' dtJZ. (3.2)
The test rule is to reject H0 when T(h) >!c~, where c~ is the upper a critical
value from a standard normal distribution, i.e. c0.0s= 1.645. For the four
parametric models fG, fw, fL and fM of the previous section, we find, after some
simplification,
n
n n
THEOREM 3.1. The test that rejects Ho when TL>~C~ is consistent for any
alternative F in the class of I F R A distributions.
PROOF. R e w r i t e TL as
Table 3.1
Failure data for right rear brake on
D9G-66A caterpillar tractor
derived the locally most powerful test for Weibull alternatives and obtained the
C ( a ) test based on Tw.
where 7"(0)= 0 and T o ) < - . - < T(,) are the ordered T's. Using the Jacobian
result on transformations of random variables, (e.g., Bickel and Dol~sum, 1977,
p. 46), we find that under the exponential hypothesis, D 1 . . . . . V n are in-
dependent and each has the exponential distribution (4.1).
The D ' s are called the normalized sample spacings, or just spacings for short.
They are useful since for the important class of IFR alternatives, there will be a
stochastic downward trend in the spacings and tests that are good for trend will
be good for IFR alternatives. T o make this claim precise, we define a
distribution F to be more IFR than G, written F <c G, if G-1F is convex, where
G-1F is defined by P F ( T - 1 F ( T ) <- t) = G(t), t >t 0 (Van Zwet, 1964; Bickel and
Doksum, 1969). With this definition, ' F is IFR' is equivalent to ' F < o K ' , where
K ( x ) denotes the standard exponential distribution 1 - e -x. Moreover, for the
gamma and Weibull families FG,o and Fw, o of Section 2; FG,o2<cFG,ol and
Fw, o2<cFw, ol are both equivalent to 01 < 02.
W e say that there is a stronger downward trend in D1 . . . . . D , than
D [ , . . . , D " if D}/Di is nondecreasing in i.
Now we can make precise the notion that the more increasing the failure
rate, the stronger the stochastic downward trend in the spacings.
LEMMA 4.1. Suppose F is more I F R than G. Let T1 . . . . . T, be a sample from F
with corresponding spacings D1 . . . . , D.. Then there is a sample T'~. . . . . T" with
distribution G and spacings Di, , D " such that there is a stronger downward
trend in D~ . . . . . D , than in D~ . . . . . D ' .
PROOF. Let 7"(1)<... < T(,) be the ordered failure times and let T~0=
G-~F(T(o), i = 1 . . . . . n. Then Th) . . . . . T[,) are distributed as order statistics
from G. Next let D ] = ( n - i + 1)(T~i)-T~i-1)). Since the function G-aF is
convex, its slope is increasing and thus
W = nl .i__~l
" c (~+1)[ -log ( 1 - n-~l)] .
The choice of c depends on the alternative, and for the parametric alternatives
fG, fw, fL, fM of Section 2 the respective asymptotically optimal choices of c are
(Bickel and Doksum, 1969)
t4
X
12
10
8
:'< xx
6 x x X x
x
4 x x x
x~xx xx
x x x x x X ~ x x
2 x x x
X ~x X
"-~ :~< ~ . . . . ~ .~.~,,
:~, ~ ~:'<:. ~ %~~'~, ~,~ ~..x ~,'~'(^~ x ~
0
S=
n~=lC(n----~)u i i
THEOREM 4.1. I f --c(i/(n + 1)) and J(i/(n + 1)) are nondecreasing in i, then the
linear rank and spacings statistics W and S are trend monotonic.
Recall that a similar test is one where the probability of rejecting H0 when
H0 is true is the same for all values of the scale parameters A. This probability
is the significance level a. Tests that reject H0 when T >I k, where k is a critical
constant and T is trend monotonic, are similar. This is because the downward
trend in AD1. . . . . AD, is the same as that of D1 . . . . . D,, thus T(DI . . . . , Dn) =
T(AD1, . . . , AD,).
From Lemma 4.1 and Theorem 4.1 we get the following important result.
COROLLARY 4.1. Let/3(T, F ) denote the power of the test that rejects Ho when
T >i k, where T is trend monotonic. Then the test is unbiased and has isotonic
power with respect to the IFR ordering, i.e. if F is in the IFR class, then the power
fl(T, F ) is greater than the significance level ot = fl(T, K), and if F is more I F R
than G, then fl(T, F ) >I fl(T, G).
At this point, we have two classes of tests that are good for the non-
parametric IFR class in the sense of being unbiased and having isotonic power.
In each of the two classes of tests, we can obtain the asymptotically optimal test
for a parametric alternative f(t; 0, h), by choosing
where E = 0.5772.
In the case of CM(U)= --U, we have Cu = - and s 2 = l~(n - 1)/(n + 1).
5. T e s t b a s e d o n t h e t o t a l t i m e o n test t r a n s f o r m
o r d e r e d failure times. A t time T(o, the total time the n items have spent on test
is
w,
~.'I=, Dj
THEOREM 5.2. Let fl(T, F ) denote the power of the test that rejects Ho when
T >1 k, where T is monotonic. Then the test is monotonic and has isotonic power
with respect to the IFR ordering, i.e. if F is in the IFR class, then the power
fl(T, F ) is greater than the significance level a, and if F is more IFR than G, then
fl(T, F ) >i fl(T, G).
v=E
i=1
Since V is distributed as the sum of uniform variables under H0, its distribution
is very close to normal. The exact distribution is tabled in Barlow et al. (1972)
for n ~< 12. For n > 9 ,
where J is some nondecreasing function on (0, 1). They found that for a given
parametric alternative f(t; O, A), the test based on Vj will be asymptotically
most powerful if J(u) is chosen to equal - c ( u ) where c(u) is the function given
in (4.1) and (4.2). Thus for the linear failure rate alternative rE(t, 0, A),
-E71~log(1- W~) is asymptotically optimal, while for the Weibull alternative
fw(t; 0, A), E~'_:1log[-log(1 - W~)] is asymptotically optimal.
Other tests based on the spacings Di or total time transforms W~, have been
considered by St6rmer (1962), Seshadri, Cs6rg~ and Stephens (1969), Cs6rg~,
Seshadri and Yalovsky (1975), Koul (1978), Azzam (1978), Parzen (1979) and
CsiSrg3 and R6v6sz (1981b), among others. An excellent source for results on
spacings is the paper by Pyke (1965).
6. Nonparametric optimality
~?l(u)= ~
dO
F-l(u)
[1-F(v)]dv, 0<u<l,
Note that H ~ I ( 1 ) = EF(Ti)= mean of T~. The reason for the inverse notation is
that H ~ 1 can be regarded as the inverse of a distribution in (0, 1). W e let H or
HF denote this distribution. Note that W~ of the previous section can be
regarded as H ~ ( i / n ) where Fn is the empirical distribution function of 7"1. . . . . Tn.
It is easy to check that when F is exponential, H ( u ) = u, 0 <<-u <~1; while F is
I F R iff H ( t ) is convex and H ( t ) <- t on [0, 1]. Thus the problem of testing for
exponentiality can be formulated in terms of H as testing
The optimality criteria we are going to consider is the minimax criteria, i.e. we
want to find the test that maximizes (asymptotically) the minimum power over a
nonparametric class O. The term minimax is used since in decision theory
terminology, risk = 1 - power.
We cannot take O to be the whole I F R class since then the minimum power
would always be a. The total time on test transform n ~ 1 gives us a convenient way
of separating alternatives from H0. W e let O(A), 0 < A < 1, be the class of all
distributions F where H is convex, H ( t ) <<-t, and
sup[t- H(t)]/> a.
If A is fixed, the minimum power over O(A ) will tend to one, thus we must allow
A = A n to depend on n, in fact the interesting cases have
An = O(n-'/~).
L e t / 3 (~T, F ) denote the power of the level a Total Time on Test test which
rejects H0 when V I> k,, then
Now suppose that /3(~0j, F ) denotes the power of the test which rejects H0
when Vj = E7-1~J(W~) is greater than the appropriate critical constant. W e
want to choose J to maximize the limiting minimum power. This is achieved by
choosing J ( w ) = w; thus the Total Time on Test test CT is optimal in the sense of
being asymptotically minimax. The result follows from the fact that if
lim,_~(X/nA,) = c, c E [0, oo], then
The proof can be obtained (under appropriate conditions) from Barlow and
D o k s u m (1972) and Koul and Staudte (1976).
7. Distance statistics
k~ =0"2-t d~
n k / n + 0.26 + (0.5/X/n)
For other distance statistics and their properties, see Seshadri, Cs6rg~ and
Stephens (1969), Durbin (1973, 1975), Cs6rg6 , Seshadri and Yalovsky (1975),
Sarkadi and Tusnady (1977), and Cs6rgc; and R6v6sz (1981(a)).
QF(t) = K-I[F(t)]
and check if this plot falls close to a straight line through the origin. Since we
cannot use the log of zero, we use the modification
and plot O(t) for t = t(o, i = 1 . . . . . n, where {t(0} are the order statistic of the
sample. Since the t(~/are sample quantiles, the resulting plot of (t(i), K-l[i - /n])
is called a 0 - 0 plot. ^
The reliability of O(t) can be judged by giving the simultaneous level a
confidence band
I,:-'(Po(t)+ ko)l,
where k~ is the level a critical value for the D,~ test of Section 7. We reject
exponentiality if the line t/i does not fall entirely within the band. This
graphical test is equivalent to the D* test of the previous section.
Note that, using Section 4, a convex shape for Q(t) indicates an IFR
alternative.
1o0
//II
pb
08
i/I
ip pf
bpl fb
06
Hnl(x)
0.4
f~sbb dJlpjp
02
p
I 1 I I I ~ I I I
0o0
00 02 04 06 08 io0'
X- i
n+l
Fig. 8.1. Total time on test plot for the tractor data of Table 3.1.
where b~ is the critical value of the maximum of the Brownian Bridge on [0, 1].
Thus b~ is given in Owen (1962, p. 439).
Tests for exponentiality 595
9. NBU alternatives
y(F) =
I0I0 [S(t)S(v) - S(t + v)] dF(t) d F ( v ) .
and considers the corresponding test statistic ~(Fn). H e develops the asymptotic
distribution and gives the Pitman asymptotic relative efficiencies 0.931, 1.006 and
0.946 of sC(F,) to y ( F , ) for linear failure rate, Weibull and Makeham alternatives,
respectively.
For further results on measures of N B U alternatives, see Koul (1978) and
Hollander and Proschan (this volume, Chapter 27).
least r failure times, for reasons of power, but cannot afford to wait until all
individuals fail.
In many clinical trials, the beginning and end of the observation period is
fixed, but individuals may enter the study at any time. This is an example of
'fixed' or 'progressive type I' censoring, in which the Ci, i = 1. . . . . n, are fixed
but not necessarily equal.
'Random censorship' refers to experiments in which the censoring times
are randomly distributed. This may occur when censoring is due to com-
peting risks, such as loss to follow-up or accidental death. However, T~
and Ci may be dependent, as is the case when individuals are removed from
study based on mid-term diagnosis. The lack of independence brings problems
of identifiability and interpretation (Horvath, 1980; see Prentice et al. (1978)
for review).
Several other possible assumptions deserve mention. Hyde (1977) and
Mihalko and Moore (1980) considered left truncation with right censoring. Left
truncation may correspond to birth or to entering the risk stage of a disease
(Chiang, 1979). Mantel (1967), Aalen (1978), Gill (1980) and others generalize
this to arbitrary censoring.
Various authors (Koziol and Green, 1976; Hollander and Proschan, 1979;
Koziol 1980; Chen, Hollander and Langberg, 1982) assumed a 'proportional
hazards' model for censoring. That is, G = S ~ with /3 the 'censoring
parameter'.
All these types of censoring are special cases of the multiplicative intensity
model (Aalen, 1975, 1976, 1978; Gill, 1980). For our purposes, let N(t), t >>-O,
be the number of failures in [0, t] and R(t) be the number at risk of failure at
time t~>0. If we are only concerned with right censorship, then R(t)=
#(Y~ 1> t). More generally R(t) must be predictable, that is left-continuous with
right-hand limits and depending only on the history of the process
{N(u), R(u); 0 ~< u <~ t}. We assume that for each t > 0 , the jump dN(t) is a
zero-one random variable with expectation R(t)dH(t), in which H ( t ) is the
cumulative rate, or hazard function. Aalen (1975, 1978) and Gill (1980) and
later authors use the fact that
The tests presented in later sections embody estimates of the survival curve,
the censoring curve, and/or the hazard function. The survival curve is usually
Tests for exponentiality 597
/ 1 \
S~(t) = I ] ( 1 - o - - 7 ~ 7 ~ ) / ( T / ~ < C i ) i f 0 ~ < t < Y(n) ,
{ilYi<-t>
=0 if t > Y(.),
with the Efron (1967) convention that the last event is considered a failure. The
censoring curve may be estimated in a similar fashion, with the relation
G.(t)S.(t) = 1 - R(t+)/n.
S. and (3. are biased but consistent and self-consistent (Efron, 1967). If S is
continuous and G is left-continuous, then S. is asymptotically normal (Breslow
and Crowley, 1974). If S and G are both continuous then S. is strongly
uniformly consistent on any finite interval in the support of both S and G
(F61des and Rejt6, 1981).
The hazard function is estimated by the Nelson (1969) estimator
V(t) = f0l S - a G -1 d H .
Several asymptotic tests for censored survival data are based on the weak
convergence of the survival curve Sn or the hazard function to a Gaussian
598 Kjell A. Doksum and Brian S. Yandell
Z, ~ Bo V, Z , / ( l + V) ff B o ( V / ( I + V))
Z~/(l+ V) ~ B o ( V / 0 + V))
~ TsG ~ T~G
S2 d V = SG-l dH < ~ .
.10 dO
Nair (1980, 1981) and Gill (1983) introduced weight functions which allow
weak convergence to weighted versions of B and B .
on D[0, T].
Gill (1983) proved a similar result on the whole line for a restricted class of
weight functions.
Tes~for exponen~ali~ 599
f j q-2(t) dt < oo
These results will be used with various weight functions in later sections.
Cs6rg~ and Horvfith (1982a, 1982b) showed that Z, (for the survival curve or
hazard function) can be strongly approximated on [0, T] by a Brownian bridge
process. They required continuity of G, but mention in Remark 3.3 (Cs6rg~;
and Horvath, 1982) that continuity and independence of competing risks may
not be needed (see Horvath, 1980). Their results yield the same test statistics as
those available from the weak convergence results. In addition they provide the
rate of convergence, and Chung and Strassen type laws of the iterated logarithm
(Cs6rg6 and Horvath, 1983).
z.(t) ~ / v . ( t )
sup ~/V--~-~,) q~v~--~,)} ~ sup ]q(x)B(x)l,
an~t~Tn a~x<-b
sup
a.~x~T. 1 t) q ~ ~x~bSUpIq(x)B(x)l.
The cited authors restrict attention to the finite intervals [a,, T,] with T,-~p T <
Tsa and a,/> 0. The limiting distribution then depends upon S and G, with
600 K j e l l A . D o k s u m a n d B r i a n S. Y a n d e l l
a = V(a.)/V(T.) and b= 1
a =
V(a.) and b=
v(T.)
1 + V(a.) 1 + V(T.)
Hn(t)+ Kq,~n-1/2(l+ V . ( t ) ) q - l ( ~ )
tabled by Borovkov and Sycheva (1968). The choice q(u)= 1 yields bands
proportional to (1 + V,(t)) with asymptotic distribution equivalent to the Kol-
mogorov-Smirnov distribution tabled by Pearson and Hartley (1976, Table 54)
(see Hall and Wellner (1980) for the case T < Tsc).
Useful approximations to the distributions of supa~x,blq(x)B(x)l and
supa~x~blq(x)B(x)] can be found in the papers by Jennen and Lerche (1981) and
Jennen (1981).
The above tests are consistent but biased against continuous alternatives.
They are distribution-free asymptotically, up to the choice of interval end
points. The choice of q(.) is open, with the obvious remark that different
choices emphasize different intervals of the survival or hazard function. The
Borovkov-Sycheva (1968) type choice is appealing as the bands are then propor-
tionally wider than pointwise confidence intervals. These bands also have equal
variance at every point.
Testsfor exponentiality 601
Fleming and Harrington (Fleming, O'Fallon, O'Brien and Harrington, 1980;
Fleming and Harrington, 1981) introduced a class of Kolmogorov-Smirnov
type tests which differ in an important manner from those considered above.
They point out (Fleming and Harrington, 1981) that the asymptotic distribution
of tests based on (12.1) with q = 1 depend on the maximum of a Gaussian
process with variance function which depends on the censoring curve,
They claim that such a test 'has the undesirable property that its probability of
rejection of [the null hypothesis] i~ased upon information up to time t sys-
tematically tends to zero when censorship of data after time t is increased'. The
tests based on (12.1) are asymptotically distribution-free (Nair, 1980), but the
power against alternatives will certainly depend on the choice of T and the
degree of censoring.
Fleming and Harrington (1981) propose instead to examine
- d ( S ~) = - S a-~ dS = S d H .
See Fleming and Harrington (1981). Note that T may be replaced by Tn-~P T
and the weights and transformations discussed in Section 12 may be used here,
with the obvious modifications.
One-sided maximal deviation tests and simultaneous confidence bands arise
in an analogous manner. See the above references for details.
602 Kjell A. Doksum and Brian S. Yandell
Barlow and Proschan (1969) first derived the distribution of the total time on
test plots under the exponential hypothesis for censored data. Barlow and
Campo (1975) considered several types of censoring, showing the form of the
total time on test and indicating how censoring may affect the stochastic
ordering of scaled total time on test plots. Others (Lurie, Hartley and Stroud,
1974; Mehrotra, 1982) considered weighted spacings tests under type II censor-
ing. Aalen and H o e m (1978) considered the multiplicative intensity model of
Aalen (1978), generalizing earlier results to arbitrary censorship. The A a l e n -
H o e m approach will be considered here.
We construct a random time change on the counting process of failures to
derive a stationary Poisson process under the null hypothesis of exponentiality.
The total time on test transform, based on this random time change, has the
same distribution as that in the noncensored case. Define
O(t) = f0t R ( u ) du
in which R ( u ) is defined as in Section 10. If to = 0 and h < t2<" < tk are the k
distinct failure times, assuming no tied failures, then
D, = R ( u ) du = ~b(t~)- ~ ( t H )
ti- 1
is the i-th spacing. Aalen and H o e m (1978) show that if the survival curve is
S(t) = e x p ( - H ( t ) ) then
N * ( t ) = N(~b-~(t))
N * ( ~ ( t i ) ) = i, i = 1. . . . , k ,
and ( D 1 , . . . , Dk) has the same distribution as a random sample from S(-). For
exponential S ( x ) = e -ax,
= Pr{N*(x) = O} = e - ~ .
Thus many results for noncensored data apply to (D1, , Dk). T h e scaled total
time on test transform is
Tests for exponentiality 603
~,(t,)l~(t~), i= O, 1. . . . , k.
This is plotted for some censored data on prostate cancer in Section 17. The
tests based on spacings presented in the first part of this paper generalize in a
natural way. In particular the cumulative total time on test statistic of Barlow et
al. (1972) becomes, for fixed k,
k-1
vk = ~ ~,(t,)lg,(t,).
i=1
Several other goodness of fit tests have been proposed in the literature for
censored data. These include tests based on contingency tables (Mihalko and
Moore, 1980), average deviations (Koziol and Green, 1976; Cs6rg6 and
Horvath, 1981; Nair, 1980, 1981), generalized ranks (Breslow, 1975; Hyde,
1977; Hollander and Proschan, 1979; Gill, 1980; Anderson et al., 1981, Har-
rington and Fleming, 1981), and kernel density or failure rate estimators
(Yandell, 1983; see Bickel and Rosenblatt, 1973). We briefly present general
forms of the average deviation and generalized linear rank tests.
The average deviation, or Cram4r-von Mises, tests are based on weighted
average deviations, from the null distribution. Let K , ( x ) = V,(x)/(1 + V,(x))
and
for specified weight function q. Similar statistics obtain for the hazard function
and for the transform based on equation (12.1). Asymptotic distribution of W2,
for q = 1 corresponds to that of the classical Cram6r-von Mises test
in the case of no censoring, and is tabled in Pearson and Hartley (1976, Table
54). Koziol and Green (1976) show that ~0] converges to a distribution which
depends on the censoring parameter /3 of the proportional hazards model
(Koziol and Green, 1976). Clearly, the choice of weights q(-) will force
emphasis on different aspects of the distribution S.
604 Kjell A. Doksum and Brian S. Yandell
foT~K(s)(dH~(s)- dH*(s))
in which H*(t)= f;o I[R(s)> 0] dH(s) is the estimable portion of H(.). K(t) is
some function of the history of the survival process, {(N(u), R (u)), u ~ [0, t]}.
If K(t)= R(t), this becomes, with T, = Y(n),
(N(T~)-for~R dH)2/foT~ R d H
which converges to chi square with one degree of freedom. Hyde's (1977)
statistic is a modification of this to allow left truncation. The asymptotic theory
for general K(.) is presented in Anderson et al. (1981) and Gill (1980). Finally we
mention that Burke (1982) has constructed a test for the hypothesis that both T
and C have exponential distributions.
rather than marked change at any one point. The power o f / 9 . against Weibull
alternatives dropped from 0.904 to 0.576 as the censoring parameter increased
from 0.5 to 1. This suggests looking at the statistics of Fleming and Harrington
(1981).
Unfortunately (for our situation), the simulations of Fleming et al. (1980)
(Fleming and Harrington, 1981; Harrington and Fleming, 1981) were only done
for 2-sample situations. Further, these simulations concern statistics which
differ from those considered here. They examine variations on Kolmogorov-
Smirnov tests and several linear rank tests.
Hollander and Proschan (1979) compare the Cram6r-von Mises statistic g~
with two linear rank statistics.
Data were obtained from Hollander and Proschan (1979) on 211 patients
with stage IV prostate cancer who were treated with estrogen in a Veterans
Administration Cooperative Urological Research Group (1967) study. The
observations span the years 1967 through March, 1977. Ninety patients died of
prostate cancer, 105 died of other diseases and 16 were alive in March, 1977.
The live patients and deaths from other causes were counted as censored.
Koziol and Green (1976) failed to reject the hypothesis of exponentiality
with parameter A = 1/100. Using the ~O~ Cram6r-von Mises statistic with the
data truncated at an earlier date, Hollander and Proschan (1979) could not
reproduce the earlier value of ~O2, but their value and those of the Hyde (1977)
and their own test were not significant at a = 0.10. The significance prob-
abilities of the tests varied considerably (0.86, 0.49, 0.14, respectively). Cs6rg6
and Horvath (1981) state that Koziol has computed the Cram6r-von Mises W 2,
the Kolmogorov-Smirnov D,, and the Kuiper statistic with p-values of 0.15,
0.1, and 0.04, respectively. The ordering of p-values reflects the deviation of S,
from S in Figure 1 of Hollander and Proschan (1979). Cs~irg6 and Horvath's
(1981) version of the Cram6r-von Mises test is somewhat more significant
(p = 0.0405).
Our graphical tests indicate that the data may not be exponential, or may at
least be a borderline situation. Figure 17.1 is the total time on test plot,
showing the same criss-cross of the exponential case curve as seen in Figure 1
of Hollander and Proschan (1979). The hazard function plot of Figure 17.2
suggests that the data may be exponential over most of its range, but the rate
appears to taper off. Figures 17.3 and 17.4 are both transformations of the
survival curve (see Nair, 1981). Confidence bands are 80% based on the
Borokov-Sycheva (1968) weights.
T h e P - P plot in Figure 17.3 shows some discrepancy with the exponential.
This is a plot of
1o0
08
06
yl f ft
ix tw
/I j
11
HnI(x)/H~I(1)
i J J1
04
fll jlj
02
0o0 f I I t i I i I i
0o0 02 04 06 08 1o0
i
x = n+--Tl-
Fig. 17.1. Total time on test plot for the prostate data.
2o0
1o5
f
P
I
I---'
Hazard
function 1o0
05 /,,'f-
I ~ I t p I I
0o0
0 20 40 60 80 100 120 140 160
Age
Fig. 17.2. Hazard function plot for the prostate data.
Tes~[or exponentiali~ 607
1o0
0.8
06
Sn(S-1(u)) 04
0o2
0o0 ,I I I I I I i I i
0o0 02 04 06 08 1o0
U
Fig. 17.3. P-P plot with 80% simultaneous confidence band for the prostate data. The straight line
(diagonal) represents the exponential hypothesis.
100
80
60
40
20
s-1(Sn(X))- x 0
-20
-40
-60
-80
-100
(X, S - l ( S n ( x ) ) - x ) , x ~ O.
Since S(u), = e -~a is continuous, the shift function, or Q-Q, plot and the P - P
plot contain the same information. The rate parameter for these two plots was
estimated from the data as ,( = 0.00939.
References
Aalen, O. (1975). Statistical inference for a family of counting processes. Ph.D. dissertation,
Department of Statistics, Univ. California, Berkeley.
Aalen, O. (1976). Nonparametric inferences in connection with multiple decrement models. Scand.
J. Statistic. 3, 15-27.
Aalen, O. (1978). Nonparametric inference for a family of counting processes. Ann. Statist. 6,
701-726.
Aalen, O. and Hoem, J. (1978). Random time changes for multivariate counting processes. Scand.
Actuarial J. 1978, 81-101.
Andersen, P. K., Borgan, O., Gill, R. and Kieding, N. (1981). Linear nonparametric tests for
comparison of counting processes, with applications to censored survival data. Inter. Statist. Rev.,
to appear.
Azzam, M. M. (1978). Tests for increasing failure rate and convex ordering. Ph.D. dissertation,
Department of Statistics, Univ. California, Berkeley.
Barlow, R. E., Bartholomew, D. J., Bremner, J. M. and Brunk, H. D. (1972). Statistical Inference
under Order Restrictions. Wiley, New York.
Barlow, R. E. and Campo, R. (1975). Total time on test processes and applications to failure data
analysis. In: Reliability and Fault Tree Analysis, SIAM, pp. 451-481.
Barlow, R. E. and Doksum, K. A. (1972). Isotomic tests for convex orderings. Proc. 6th Berkeley
Syrup. Math. Statist. Probab. I, 293-323.
Barlow, R. E. and Proschan, F. (1966). Inequalities for linear combinations of order statistics from
restricted families. Ann. Math. Statist. 37, 1574-1592.
Barlow, R. E. and Proschan, F. (1969). A note on tests for monotone failure rate based on
incomplete data. Ann. Math. Statist. 411, 595-600.
Barlow, R. E. and Proschan, F. (1975). Statistical Theory of Reliability and Life Testing. Holt,
Rinehart and Winston, New York.
Bickel, P. J. (1969). Tests for monotone failure rate II. Ann. Math. Statist. 411, 1250-1260.
Bickel, P. J. and Doksum, K. A. (1969). Tests for monotone failure rate based on normalized
spacings. Ann. Math. Statist. 40, 1216-1235.
Bickel, P. J. and Doksum, K. A. (1977). Mathematical Statistics: Basic Ideas and Selected Topics.
Holden-Day, San Francisco.
Bickel, P. J. and Rosenblatt, M. (1973). On some global measures of the deviations of density
function estimates. Ann. Statist. 1, 1071-1095. Correction note (1975), Ann. Statist. 3, 1370.
Birnbaum, Z. W., Esary, J. D. and Marshall, A. W. (1966). Stochastic characterization of wearout
for components and systems. Ann. Math. Statist. 37, 816--825.
Block, H. W. (1975) and Savits, T. H. (1976). The IFRA closure problem. Ann. Probab. 4,
1030-1032.
Tests for exponentiality 609
Hollander, M. and Proschan, F. (1972). Testing whether new is better than used. Ann. Math. Stat.
43, 1136-1146.
Hollander, M. and Proschan, F. (1979). Testing to determine the underlying distribution using
randomly censored data. Biometrics 35, 393--401.
Hollander, M. and Proschan, F. (1984). Nonparametric concepts and methods in reliability. In: P. R.
Krishnaiah and P. K. Sen, eds., Handbook of staastics, Vol. 4, Nonparametric Methods, this volume.
HorvS.th, L. (1980). Dropping continuity and independence assumptions in random censorship models.
Studia Sci. Math. Hung. 15, 381-389.
Hyde, J. (1977). Testing survival under right censoring and left truncation. Biometrika 64, 225-230.
Jennen, C. (1981). Asymptotishe Bestimmung von KenngrSssen sequentieUer verfahren. Doctorial
Dissertation. University of Heidelberg.
Jennen, C. and Lerche, H. R. (1981). First exit densities of Brownian motion through one-sided
moving boundaries. Z. Wahrsch. Verw. Gebiete 55, 133-148.
Kalbfleisch, J. l). and Prentice, R. L. (1980). The Statistical Analysis of Failure Time Data. Wiley,
New York.
Koul, H. L. (1977). A test for new is better than used. Comm. Statist. A 6, 563-573.
Koul, H. L. (1978a). A class of tests for testing "new is better than used". Canad. J. Statist. 6,
249-271.
Koul, H. L. (1978b). Testing for new is better than used in expectation. Comm. Statist. A, 7,
685-701.
Koul, H. L. and Staudte, Jr., R. G. (1976). Power bounds for a Smirnov statistic in testing the
hypothesis of symmetry. Ann. Statist. 4, 924-935.
Koziol, J. A. (1980). Goodness-of-fit tests for randomly censored data. Biometrika 67, 693--696.
Koziol, J. A. and Green, S. B. (1976). A Cramer-von Mises statistic for randomly censored data.
Biometrika 63, 465-474.
Lilliefors, H. W. (1969). On the Kolmogorov-Smirnov test for the exponential distribution with
mean unknown. J. Amer. Statist. Assoc. 64, 387-389.
Lurie, D., Hartley, H. O. and Stroud, M. R. (1974). A goodness of fit test for censored data.
Commun. Statist. 3, 745-753.
Mantel, N. (1967). Ranking procedures for arbitrarily restricted observations. Biometrics 65,
311-317.
Mehrotra, K. G. (1982). On goodness of fit tests based on spacings for type II censored samples.
Commun. Statist. 11, 869-878.
Meier, P. (1975). Estimation of a distribution from incomplete observations. In: J. Gani, ed.,
Perspectives in Probab. and Statistic.: Papers in Honour of M. S. Bartlett. Academic Press, New
York, pp. 67-82.
Mihalko, D. P. and Moore, D. S. (1980). Chi square tests of fit for type II censored data. Ann.
Statist. 8, 625-644.
Nair, V. N. (1980). Goodness of fit test for multiply right censored data. Tech. Report, 1 Sept. 1980,
Bell Tele. Lab., Holmdel, N J, 22 pp.
Nair, V. N. (1981), Plots and tests for goodness of fit with randomly censored data. Biometrika 68,
99-103.
Nelson, W. (1969). Hazard plotting for incomplete failure data. J. Qual. Tech. 1, 27-52.
Neyman, J. (1959). Optimal asymptotic tests of composite statistical hypothesis. Probab. and Statist.
The Harold Cramer Volume, Almquist and WikseUs, Uppsala, Sweden, pp. 213-234.
Owen, D. B. (1962). Handbook of Statistical Tables. Addison-Wesley, Reading, MA.
Parzen, E. (1979). Nonparametric statistical data modeling. J. Amer. Statist. Assoc. 74, 105-131.
Pearson, E. S. and Hartley, H. O. (1975). Biometrika Tables for Statisticians, Vol. 2. Griffin,
London.
Prentice, R. L., Kalbfleisch, J. D., Peterson, A. V., Jr., Fournoy, N., Farewell, V. T. and Breslow, N.
E. (1978). The analysis of failure times in the presence of competing risks. Biometrics 34,
541-554.
Proschan, F. and Pyke, R. (1967). Tests for monotone failure rate. Proc. Fifth Berkeley Syrup. Math.
Statist. Probab. 3, 293-312.
Tests for exponentiality 611
Research supported by the Air Force Office of Scientific Research, under Grant A F O S R 82-K-0007
to Florida State University and by the National Institute of General Medical Sciences under Grant
R01 GM21215 to Stanford University. Part of this research was done while M. Hollander was on
sabbatical leave visiting Stanford University.
613
614 Myles Hollander and Frank Proschan
2.2. DEFINITION . Let F have density fi Then the failure rate function r(x) is
defined as
r(x) = f ( x ) / F ( x ) (2.1)
Physically, we may interpret r(x) dx as the probability that a unit alive at age
x will fail in (x, x + dx), where dx is small, r(x) is also called the conditional
failure rate function, the hazard rate function, and the intensity function.
W A R N I N G : In some of the literature, r(x) is called the hazard function; this
conflicts with our usage, as defined in Definition 2.3.
F r o m (2.1), we obtain by integrating and exponentiating, the well-known
identity:
2.3. DEFINITION. Assume the failure rate function exists. Then fd r(u)du is
called the hazard function.
for x ~> 0 satisfying F ( x ) > 0. Throughout, R(x) will denote the hazard func-
tion.
2.4. DEFINITION. Suppose r(x) exists. Then the distribution F has increasing
failure rate (IFR) if r(x) ~ for 0 ~< x < o0. M o r e generally, F is I F R if F(0) = 0 and
2.6. DEFINITION. Let F have failure rate r. Then F has increasing failure rate
average (IFRA) if F(0) = 0 and (i/x) fd r(u) du ~ for all 0 < x < ~. M o r e
generally, F is I F R A if F ( 0 ) = 0 and (1/x)R(x) ~ for 0 < x < ~ .
2.7. DEFINITION. F is new better than used (NBU) if for all 0 ~< x < ~,
0~<y < ~ :
2.8. DEFINITION. F is new better than used in expectation (NBUE) if the mean
/~ of F is finite and
616 Myles Hollander and Frank Proschan
2.9. DEFINITION. F has decreasing mean residual life (DMRL) if F(0)= 0 and
~o~ (x + t) dt
-~-x) ~ in x (2.7)
For the classes of life distributions defined above (and for their duals) we will
apply the acronym (say IFR) to the distribution function, survival function,
hazard function, random variable, etc., as needed; e.g., we will say 'an IFR
random variable'.
A dual class may be defined for each of the life distribution classes defined
above by reversing the inequality or direction of monotonicity, and by making
appropriate adjustments at end points of support. E.g., we define the DFR
distribution by:
2.10. DEFINITION. Suppose r(x) exists. Then the distribution F has decreasing
failure rate (DFR) if r(x) ~ for 0 ~<x < oo. More generally, F is DFR if
F ( x + y)/F(x) $ in x, (2.8)
2.11. IFR. If the failure rate exists, it is increasing with age. More generally,
the conditional probability of completing a mission of fixed duration success-
fully decreases with initial age of the device.
2.12. IFRA. It has been shown (Birnbaum, Esary and Marshall, 1966) that a
coherent system of independent IFR components has an IFRA system life. See
Nonparametric conceptsand methods in reliability 617
Barlow and Proschan (BP) (1981) for the definition and properties of a
coherent system.
It has also been shown that a device subject to shocks governed by a Poisson
process, which fails when accumulated damage exceeds a fixed threshold has an
I F R A distribution (Esary, Marshall and Proschan, 1973).
2.13. NBU. A used NBU device of any fixed age has stochastically smaller
residual lifelength than does a new device. This interpretation of (2.5) follows
after dividing both sides by F(x).
2.15. DMRL. The older the device is, the smaller is the mean residual life.
This represents a weaker version of the following IFR property: The older the
device, the smaller stochastically is the residual life.
If(x1- Yl)
f(xz-ya)
f(x1- Y2) >/0
f(x2 Y2) "
2.18. IFRA. From Definition 2.6 we see that the hazard function R(x) is
starshaped.
2.19. DEFINITION. A function f(x) >t 0 defined on [0, oo) is starshaped if f(0) = 0
and (1/x)f(x) "~ for x > 0.
618 Miles Hollander and Frank Proschan
2.23. Let life distribution F have hazard function R. Then F is NBU iff R is
superadditive.
2.24. DEFINITION. A function f(x) >10 defined on [0, oo) is superadditive iff
fo F(x)
F ( x + t ) dt $ in x,
which implies
Nonparametricconceptsand methodsin reliability 619
2.26. NBUE CHARACTERIZATION. Let F have mean/z. Then F is NBUE iff the
survival function Pl(x)=d,f (1/lz)fTP(y)dy has a failure rate function rl that
satisfies
DMRL S
F = I F~ d/z(a) (3.1)
F = i p,F~, (3.2)
i=l
3.4. THEOREM(Esary, Marshall and Proschan, 1973). Suppose that (3.3) holds.
Then
(a) /sk is discrete IFR ~ H(_t) is IFR.
(b) /sk is discrete 1FRA ~ H(t) is IFRA.
(c) /sk is discrete N B U ~ ffI(t) is NBU.
(d) /sk is discrete N B U E ~ H(t) is NBUE.
(e) /sk is discrete D M R L ~ ISI(t) is D M R L .
A similar theorem holds for the dual classes representing beneficial aging.
Note that Theorem 3.4 does not represent a closure theorem but rather a
preservation or inheritance theorem. That is, given a class of discrete life
distributions of a certain type, under Shock model 3.2, the corresponding class
of continuous life distributions is engendered. Thus the type of life distribution
describing discrete Pk is inherited by continuous/Q(t).
We may now summarize the closure and inheritance properties of the life
distribution classes corresponding to adverse and beneficial aging.
'Mixture of noncrossing distributions' in Table 3.1 refers to the subclass of
mixtures (3.1) in which for every oq ~ 0~2, either F~ is stochastically smaller
than F~z or F~ is stochastically larger than F~z.
An interesting feature is shown in Table 3.1. With the exception of the
622 Myles Hollander and Frank Proschan
~'~.~'~'~ ~ . ~ ' ~
t t t t t t t t t t
.~.~.~.~.~ .~.~ . . . . . .
~5U50 ooooo
ZZZZZ
ZZ
!~ ~~
ZZ ZZZZZ
.g
N
.~.~
~a ~
o
Nonparametric concepts and methods in reliability 623
DMRL class, each of the classes of life distributions representing adverse aging
is closed under convolution of distributions. On the other hand, none of these
classes is closed under mixture of distributions. For the classes representing
beneficial aging, the reverse is true: Each is closed under mixture of dis-
tribution (in the NWU and NWUE cases, the distributions being mixed are
noncrossing). On the other hand, none of these beneficial aging life distribution
classes is closed under convolution of distributions.
4. Applications
4.1. BOUNDS. Knowing that the life distribution is in a certain class permits us
to develop a bound for survival probability (or other parameters of interest)
given the mean, a specified percentile, or other limited information.
Example:
IFR Survival Function Bound. Let F be IFR with mean/z. Then
P IFR /[i.e.,
\
F ( x + y ) ~, in x~>0 for y > 0 | "
]
F(x)
~)~$+t $ in s ~ 0 for t > 0 ;
This bound on the renewal function holds uniformly on the whole positive
Nonparametric concepts and methods in reliability 625
The first systematic treatment of the IFR class appears in Barlow, Marshall
and Proschan (1963). The paper proves the closure of the IFR class under
convolution, closure of the D F R class under mixture (see Table 3.1), the bound
on IFR survival (see 4.1), and a number of other fundamental properties of the
IFR class and of the failure rate function. Actually, D F R mixture closure is a
consequence of an earlier result of Artin (1964), in which he shows that the
sum of log convex functions is log convex.
The I F R A class was introduced in Birnbaum, Esary and Marshall (1966) as
the class of life distributions of coherent structures of IFR components. I F R A
life distributions have been shown to arise much more generally as first passage
626 Myles Hollander and Frank Proschan
time distributions (Brown and Chaganty, 1983). IFRA distributions also de-
scribe lifelengths experiencing damage from random shocks under fairly general
assumptions (Esary, Marshall and Proschan, 1973). This paper also shows that
the renewal random variable N ( t ) (see 4.5) has a discrete IFRA distribution.
The IFRA convolution closure result listed in Table 3.1 is due to Block and
Savits (1976).
The NBU and N B U E classes are treated in a systematic way and their
fundamental roles in maintenance analysis are shown in Marshall and Pros-
chan (1972).
A convenient single reference for IFR, IFRA, NBU, NBUE is BP (1981). It
also contains references which supplement the results presented in the book.
See also Mehrotra (1981) for a discussion of mixtures of NWUE.
The D M R L class is receiving a good deal of current attention, especially in
biostatistics. (See Chen, Hollander and Langberg, 1983b; and Hollander and
Proschan, 1975.) In medical research, one measure of effectiveness of treatment
is the mean residual life (MRL) of the patient; MRL, in general, is also a
measure of great importance in demography, life insurance, communication
between doctor and patient, and comparison of diseases. IMRL mixture
closure is discussed in Haines and Singpurwalla (1974) and Brown (1981).
Bondesson (1983) showed that the DMRL class is not closed under con-
volution.
The optimum spares provisioning application of 4.3 is discussed in BP (1965,
Chapter 6). A more general treatment appears in BP (1981, Chapter 7). The
optimum checking application of 4.4 is described in BP (1965, Chapter 4).
Application 4.5 is developed in detail originally in BP (1964) and then under
weaker hypotheses in Marshall and Proschan (1972).
Research on multivariate versions of life distribution classes based on
notions of aging is currently being vigorously pursued. A survey is presented in
Block and Savits (1981).
Systematic treatment of multistate performance of components and systems
is in its infancy, but the infant is growing rapidly. A survey is presented in
El-Neweihi and Proschan (1980). Additional research is discussed and
references are given in Ross (1979), Butler (1979a,b), Gritiith (1980), Natvig
(1980), and Fardis and Cornell (1981).
Since the r.v. nF,(x) has a binomial distribution with parameters n and
p = F(x), it is easily seen that
E(f~(x)) = F(x), Var(ff'~(x)) = n-'F(x)F(x)
and
n'/2(F,(x) - F(x))~ N(O, 0-2)
Dvoretzky, Kiefer and Wolfowitz (1956) have shown that the sample dis-
tribution function is asymptotically minimax over a wide class of loss functions.
Phadia (1973) proves minimaxity of Pn under the loss function
where W is a given finite measure (a weight function) on (R, ~), the real line
with the 0--field ~ of Borel subsets of R. See Read (1971) for a setting in which
if', is asymptotically inadmissible.
Confidence bands for F(x) can be obtained by inverting the Kolmogorov
goodness-of-fit statistic D=sup-~<x<~lP.(x)-Fo(x)l;D is used to test the
hypothesis/40: F(x)= Fo(x), where Fo(x) is completely specified. From tables
of the distribution of D under H0 (cf. Birnbaum, 1952) we can find a critical
value c~ such that P{D <~c~ I H0} = 1 - a . Then the 'curves' {/Sn(x)-ca,/6n(x) +
c~} form a simultaneous confidence b a n d for F(x) with confidence coefficient
1-t~.
The e.d.f, plays a central role in estimating parameters (other than F(-)
itself). For example if O(F) is a parameter of interest such as the mean, median,
or standard deviation of F, 0(/~) is a natural estimator of O(F).
The e.d.f, also plays a useful role in Efron's (1979) method of bootstrapping.
Bootstrapping uses the e.d.f, to estimate the sampling distribution of some
prespecified r.v. R(X,F) (for example, one may be interested in R(X, F ) =
t(X)-O(F), where fiX) is an estimator of O(F)), where X = (X1 . . . . . X,).
628 Myles Hollander and Frank Proschan
where X ( 1 ) ~ ~ X(n) are the ordered X's. log L can be made arbitrarily large
by taking f(X(,)) arbitrarily large, so first consider the (constrained) class o~M of
IFR distributions with failure rate bounded by M. Grenander (1956) and
Marshall and Proschan (1965) (also see Barlow et al., 1972) first find the unique
f U (say) in ~M which maximizes (7.2). Letting M -~ ~, it is then shown t h a t / ~ u
converges in distribution to an estimator 0,. The latter is called the M L E for F
in the class of IFR distributions.
Grenander shows that log L is maximized over .~M by a d.f. with failure rate
constant between observations, and the estimator for r (corresponding to F in
W,M)is
rM,(Xo))= min{min max [v l-~u(r~l+ . . .+r~_l)]
v~i+l u<-i 1 -1
,M} (7.3)
where r, = M and
rj = {(n - j)(Xo+t)- X0))}-' for j = 1, 2 . . . . . n - 1. (7.4)
Letting M ~ ~, the estimator for r, corresponding to F in the IFR class, is
f, (X(0) = min max I v - u]-l[(n-, u)(X(.+l)-X(.))+...
v~i+l u~i
For the remaining values of x, f~(x) is 0 for 0 ~<x < 32(1), oo for x ~>X(,), constant
between observations, and right continuous.
The M L E t~, for the d.f. is obtained via
t~,(x) = 1 - e x p [ - f ~ f , ( u ) d u ] , (7.6)
where f, is given by (7.5). Marshall and Proschan Q965) show that if F is IFR
with continuous failure rate r, then for all x, G , ( x ) - ~ F ( x ) almost surely.
Barlow et al. (1972) point out that f, and On perform badly in the tails of the
distribution.
Rather than compute rn directly via (7.5), a more intuitive (and easier
computational) method proceeds as follows. Note that the values rj--
1/{(n-j)(X(j+l)-X(i))} may be considered naive estimates for the constant
failure rate on the interval [X(j), X(j+x)), for on that interval the observed 'time
on test' is ( n - j)(X(j+~)-X(j)) and there has been 1 failure. To obtain fn, first
calculate the naive estimators given by (7.4). If these estimators are in the
'right' order, that is if rt <~ 1"2<~ " " " <~ m-l, then take r , ( X ( o ) = ri, i = 1 . . . . . n - 1.
If there is a 'reversal' in that r~ > r;+l, then replace both r~ and ri+1 by their
harmonic mean
Keep averaging this manner just to the point necessary to eliminate all
reversals.
Similar calculations will yield the M L E of a failure rate known to be
decreasing over time. See Marshall and Proschan (1965) for details.
Dykstra and Laud (1981) obtain Bayesian estimates of the failure rate and
the distribution function for both complete and censored data models.
1111111111111111 8
II n II II II II II II 8
f21.
O II II LI II II II II II 8
II II II II II II II II 8
II
~ ~ 8
II II II It II II II II
I
+
0
Nonparametric concepts and methods in reliability 631
0, O~<X < X ( 1 ) ,
-In/~(x) = Aix, X(o <~ x < Xo+~), i = 1 . . . . . n - 1, (7.7)
X(n) <~x,
a.s. fF-l(u)
1 fF-1(u) ~ -1
(7.11)
632 Myles Hollanderand FrankProschan
From (7.10) it follows that the M L E of an I F R A F is a strongly consistent
estimator of
Marshall and Proschan give various examples where G~ differs from F. In the
-- Am a.s.
distributions such that F~ ~ F. As they point out, the estimator if", defined by
(7.13) below, only partially succeeds. Samaniego and Boyles study the function
/~.(x + v)
S~(x)=sup ~ " , (7.13)
' F,(y)
The d.f. Fo(x) can be viewed as the prior guess at the unknown F(x). Using the
result that the posterior distribution of P, given a sample X1. . . . . X,, is
634 Myles Hollander and Frank Proschan
Dirichlet with parameter a +El"--1 6xi, in conjunction with (7.16), gives the
Bayes estimator for a sample of size n as
f
R(a) = [a(R)/{(a(R) + 1)(a(R) + n)}] J Fo(x)(1 - Fo(x)) dW(x).
(7.19)
is a necessary and sufficient condition for the Bayes risk of/~'+1, with respect to
the Dirichlet process prior, to be larger than the overall expected loss using
H'+I. A sufficient condition for H'+I to be better than F'+I is
The sequence of estimators given by (7.20) can also be used for the problem
of simultaneously estimating n + 1 distribution functions. For note that by
interchanging the roles of samples 1 and n + 1, the H estimator defined by
(7.20) becomes an estimator of F1 based on X1 and the 'past' samples
X2. . . . , X'+I. More generally, an estimator of Fj based on all the samples is
where
p; = a (R )/{a (R) + ms}. (7.25)
Note that if all the sample sizes are equal condition (7.22) reduces to n > 1
(the latter condition was given in Theorem 3.1 of Korwar and Hollander
(1976)). Their result is reminiscent of, though much weaker than, the famous
James and Stein (1961) result (see also Stein, 1955, and Efron and Morris, 1975)
for simultaneous estimation of k normal means. The James-Stein estimator
636 Myles Hollander and Frank Proschan
does better, for each point in the parameter space, when k/> 3, in terms of
mean squared error, than the classical rule which estimates each population
mean by its sample mean. The Korwar-Hollander result says that in the equal
sample size case, if there are at least three distribution functions to be
estimated, one can do better (not pointwise for each point in the parameter
space but on the average where the average is with respect to the Dirichlet
prior) than using, for each distribution, the corresponding sample distribution
function.
EXAMPLE 7.2. The data of Table 7.2, adapted from Proschan (1963), give the
intervals between successive failures of the air conditioning systems of three
'720' jet airplanes. We use these data to illustrate the estimators defined by
(7.24).
Table 7.2
Intervals between failures of air conditioning systems
Plane
23 97 50
261 51 44
87 11 102
7 4 72
120 141 22
14 18 39
62 142 3
47 68 15
225 77 197
71 80 188
246 1 79
21 16 88
42 106 46
20 206 5
5 82 5
12 54 36
120 31 22
11 216 139
3 46 210
14 111 97
71 39 30
11 63 23
14 18 13
11 191 14
16 18
90 163
1 24
16
52
95
Nonparametric concepts and methods in reliability 637
For the data in Table 7.2, n + 1 -- 3, ml = 30, m2 -- 27 and m3 24, and note that
=
and 16, is the e.d.f. It can be shown that H;l(fln) = Sj/n, ] = 0 . . . . . n, where
J
Ss = ~'~ (n - i + 1)(X(o- X(,-a)),
i=1
and Xtl ) < . . - < X(,) are the order statistics of a random sample X 1. . . . , X n
from F, X(o)%f0 and So = 0. Ss is called the total time on test at Xq). If we set
638 Myles Hollander and Frank Proschan
U i = Si/S,, j = O, 1 . . . . . n, (8.2)
in favor of
H0 is rejected in favor of
A = ~ ajD;/Sn (8.3)'
j=l
where
aj = 6-I{(n + l)3j - 3(n + 1)2j2 + 2(n + l)j3}. (8.4)
The null distribution of A can be determined by using the result that under H0,
D1, D2 . . . . . Dn are iid according to F ( x ) = 1 - exp(-Ax). Klefsj6 provides null
distribution tables of A* (given by (8.5)) for the sample sizes n = 5(5)75, giving the
upper and lower 0.01, 0.05, 0.10 percentiles. Klefsj6 also shows that under H0 the
statistic
Other IFR tests. Many tests of H0 versus H1 have been based on the
normalized spacings, while other tests utilize only the ranks of the normalized
spacings. Bickel and Doksum (1969) study both types of tests based on
spacings. Their tests are partially motivated by a result of Proschan and Pyke
(1967) that shows that when F is IFR, the D ' s exhibit a decreasing trend in that
P(Di ~>D r ) < 1 whenever i > j . This led Bickel and Doksum to define a test
function ~b = 4~(D1. . . . . /9,) (the probability of rejecting H0 in favor of H1,
given the D's) to be monotone in the D ' s if
Bickei and Doksum show that all monotone tests are rank tests. Furthermore,
letting Ri denote the rank of Di in the joint ranking from least to greatest of
D1 . . . . . Vn, they showed that the rank test which rejects H0 in favor of HI for
large values of W1 = Ef=l i l o g ( l - Rg(n + 1)) is asymptotically most powerful
for IFR Makeham alternatives in the class of linear rank statistics. Bickel and
Doksum also found that the Pitman asymptotic relative efficiency e(W1, M), oi
W1 with respect to the Proschan-Pyke (1967) rank statistic M, is equal to ~ for
all sequences of alternatives {F0.} tending to H0. The Proschan-Pyke statistic is
M = ~ i < j ~ ( R i , Rj) where 4,(a, b ) = 1 if a > b, 0 otherwise, and H0 is to be
rejected in favor of Ht for large values of M.
Although W1 dominates M with respect to Pitman asymptotic relative
efficiency, that result does not hold for finite n and fixed IFR alternatives.
Furthermore, although null distribution tables for W1 are easily generated
(using the fact that under H0 all n! possible values of (R1 . . . . . Rn) are equally
likely) we are unaware of such tables, whereas the M-statistic can easily be
referred to published tables of Kendall's rank correlation coefficient.
(Specifically, refer 2 M - n(n - 1)/2 to the null distribution of the statistic K as
given in Table A.21 of Hollander and Wolfe (1973).) A normal approximation
treats
where g(t) = (1 - t) -a fYlogo-o x-t e-X dx. Large values lead to rejection of H0 in
favor of/-/1. Bickel and Doksum show that W0 is asymptotically equivalent to
Proschan and Pyke's M, and that, in the class of linear rank statistics, W2 is
asymptotically most powerful for linear failure rate alternatives, W3 is asymp-
totically most powerful for Weibull alternatives, and W4 is asymptotically most
powerful for gamma alternatives.
Bickel and Doksum also consider test statistics, such as the total-time-on-test
statistic, based on studentized spacings statistics. (We discuss the total-time-on-
test statistic in the context of a test for N B U E alternatives in Section 8.5.) In
particular, Bickel and Doksum show that the rank tests, despite being as good
in terms of Pitman asymptotic relative efficiency as their counterparts based on
studentized linear spacings, are less powerful than their counterparts based on
the studentized linear spacings.
H0 is rejected in favor of
B = ~ [3iD/S,, (8.6)'
j=l
where
/3j = 6-1{2j3 - 3j 2 + j(1 - 3n - 3n 2) + 2n + 3n 2 + n3}. (8.7)
Klefsj6 provides null distribution tables of B* (given by (8.8)) for the sample
sizes n = 5(5)75, giving the upper and lower 0.01, 0.05, 0.10 percentiles. He also
Nonparametric concepts and methods in reliability 641
8.3. Ar~ NBU TEST (Hollander and Proschan). Hollander and Proschan (1972)
developed a test of
= ~ - f o ~o F(x + y ) d F ( x ) d F ( y )
def 1
= a - k (F) (8.9)
Looking at the left-hand-side of (8.10), we see that a (F) is less than -~when the
conditional chance "that a used item (which has already survived past the
random time X2) will survive an additional random time X3" is less than the
chance "that a new item will survive a random time X3" (the latter chance of
course being ).
Replacing F by the e.d.f. F,, Hollander and Proschan suggest rejecting H0 in
favor o f / / 3 if J, defined by (8.11), is too small. (J is asymptotically equivalent
to a (F,) and is more convenient to work with.) H0 is rejected in favor of
where ~0(a, b) = 1 if a > b, 0 otherwise and the E' is over all n(n - 1)(n - 2)/2
triples (al, a2, a3) of three integers such that 1 <~0/i <~n, 0/1 # a2, al ~ a3 and
0/2 "~ 0/3-
Hollander and Proschan give upper and lower critical values of
in the a = 0.01, 0.025, 0.05, 0.075 and 0.10 regions for n = 4(1)20(5)50. The
normal approximation treats
j * = {n(43215)}*a(J - ~) (8.12)
Hollander and Proschan also show that the test which rejects for large values
of J is consistent against the class of continuous NBU distributions. (For more
on this NBU test, see Hollander and Wolfe (1973) and Cox and Hinkley
(1974).)
where ee(x) = {f7 F ( u ) du}/ff'(x) is the mean residual life at time x for x I> 0
(and e~(x) =- 0 whenever F(x) = 0). The parameter 1,(F) is an average value of
the deviation ff(X)ff'(y){eF(x)--eF(y)}, with the weights F(x) and F ( y )
representing the proportions of the population still alive at times x and y
respectively, and thus furnishing comparisons concerning the mean residual
lifelengths from x and y respectively. Hollander and Proschan replace F by Fn,
the e.d.f., and a statistic asymptotically equivalent to ~,(Pn) is
n
In order to make the test scale invariant, Hollander and Proschan utilize
V *= V/~2, significantly large values suggesting DMRL alternatives and
signficantly small values suggesting IMRL alternatives. They give critical values
(obtained by Monte Carlo sampling) of V ' = {(210)n}l/2V * for n = 2(1)20(5)50
and t~ in the upper and lower 0.01, 0.05, 0.10 regions. Exact tables of V' are
given by Langenberg and Srinivasan (1979) for a in the upper and lower 0.01,
0.05, 0.10 regions and n = 2(1)20(5)60. Hollander and Proschan show that,
under H0,
The mean residual life function. There is an inversion formula giving the
survival function _P(x) in terms of the mean residual life function eF(x). The
formula is often attributed to Cox (1962, Exercise 1, p. 128), but see Kotz and
Shanbhag (1980) for a general result and related references. Hall and Wellner
(1981) use the inversion formula and show how knowledge of mean residual life
functions can be used in modelling and model identification. Bryson and
Siddiqui (1969) point out that the mean residual life function lends itself readily
to graphical analysis; Figure 2 of Bryson and Siddiqui plots the empirical mean
residual life function ~F(X) = {f2 Fn(u) du/F~(x)}I{x < X(,)} for survival times of
patients suffering from chronic granulocytic leukemia, with x = 0 taken as the
date of diagnosis. Yang (1978) established strong consistency of gF on a finite
interval [0, T] and also showed that the associated process nm{gF(X)- eF(X)}
converges weakly to a Gaussian process. Hall and WeUner (1979) strengthened
Yang's results; in particular they extended her weak convergence result to the
positive real line, and they also derived nonparametric simultaneous confidence
bands for eF(X). Hall and Wellner (1981) show that the empirical mean residual
life function is a useful addition to the arsenal of techniques (histograms,
empirical survival functions, failure rate estimators, total-time-on-test plots,
etc.) for analysis of survival data.
Proschan (1975), and later Klefsj6 (1983), show that K arises in a natural way
as a test against NBUE (or NWUE) alternatives. Hollander and Proschan
consider the parameter ~7(F)= f f ( X ) { e F ( O ) - - e r ( x ) } d F ( x ) as a measure of
deviation for a given F from
K = n -2 ~ dig(i),
i=1
where
d~ = ~ - 2i + . (8.16)
8.6. SOME PITMAN EFFICIENCYVALUES. One commonly used measure for com-
paring two competing test sequences {T~,.}, {T2..} satisfying certain regularity
conditions is the Pitman asymptotic relative efficiency (A.R.E.) (cf. Lehmann,
1975). Let {F0.} be a sequence of alternatives with 0. = 00 + bn -v2, where b is an
arbitrary positive constant and Foo satisfies the null hypothesis. The Pitman
efficacy for a test statistic sequence {T.} (say) that is asymptotically normal with
mean E o ( T . ) and standard deviation o'o(T.)/n against the sequence of alter-
natives {F0.} is, subject to suitable regularity (cf. Lehmann, 1975),
For two such competing test sequences {TI,,}, {T2,,}, the Pitman A.R.E.
eF(TI, T2) of 7"1 with respect to T2 is eF(T1, 7"2)= {cr~(F)/cr2(F)} z. We may
roughly say that the Pitman A.R.E. of T1 with respect to T2 is the limiting
ratio of samples sizes n2/nl such that both tests achieve equal power against
equal alternatives that are 'close to' the null hypothesis.
Here Foo is exponential and we consider the linear failure rate, Makeham,
Pareto, Weibull, and gamma distributions (F1, F2, F3, F4, F5 say) with c o r -
responding densities
Table 8.1
Pitman A.R.E.
A* B* V* K* 2
C MAX
For the J* statistic given by (8.12), Hollander and Proschan (1972) show that
eF4(J*, K * ) = 0.937 and evl(J*,K * ) = 0.45. Other efficiency values for J* are
given by Koul (1978b) and Deshpande (1983). Other efficiency values for K*
are given by Bickel and Doksum (1969), and Borges, Proschan and Rodrigues
(1982).
EXAMPLE 8.1. We use the methylmercury poisoning data of Table 7.1 to
illustrate the IFR test based on A*, the IFRA test based on B*, the NBU test
based on J*, the D M R L test based on V*, and the NBUE test baaed on K*.
Table 8.2 expedites the calculation of these statistics by giving the ordered
sample, the normalized spacings, and the a's,/~'s, c's and d's defined by (8.4),
(8.7), (8.14) and (8.16), respectively.
Table 8.2
Calculation of A*, B*, V*, K*
i X(O Di ai ~i ci di
Although the tied values at 82 are not consistent with the assumption that F
is continuous, we use the null distribution tables based on that assumption.
For A we find using (8.3)', (8.4), (8.5) and Table 8.2, A* = 3.77, with P < 0.01
from Klefsj6's (1983) table for n = 10. For B we find using (8.6)', (8.7), (8.8)
and Table 8.2, B * = 4.98 with P < 0.01 from Klefsj6's (1983) table for n ----10.
For J we find, using (8.11), J = 0 (since Xo0)< X(~)+ X(z)) and from Hollander
and Proschan (1972), P = 1/(~)= 1/43758= 0.00002. For V we find using
(8.13), (8.14), (8.15) and Table 8.2, V ' = 2.10 and 0.01 < P < 0.05 from Lan-
genberg and Srinivasan's (1979) table (the 0.01 critical value is V ' = 2.14). For
K, we find using (8.16), (8.17), and Table' 8.2, (10)K* +4.5 = 7.74 with a
P < 0.01 from Barlow's (1968) table.
There has been vigorous research in the area of survival analysis for
censored data. Recent books covering portions of the research are Lee (1980),
Elandt-Johnson and Johnson (1980), Kalbfleisch and Prentice (1980), Miller
(1981), and Lawless (1982). These are many types of censoring including Type I
censoring, Type II censoring, and random censoring (cf. Miller, 1981). Many of
the inferential procedures discussed in Sections 7 and 8 have been generalized
to accommodate the various types of censoring. Space limitations prohibit a
comprehensive account here, and instead we will simply reference some of the
generalizations in the randomly censored model.
In the randomly censored model, instead of observing a complete sample
X1 . . . . , X,, one is able to observe only the pairs Zi = min(Xi, T/), 6i = 1 if
Z~ = X~ (i-th observation is uncensored) and 6~ = 0 if Z~ = T~ (i-th observation
is censored). We assume that X ~ , . . . , X, are iid according to the continuous
life distribution F, T~. . . . . T, are iid according to the continuous censoring
distribution H, and the T's and X ' s are mutually independent. The censoring
distribution H is typically, though not necessarily, unknown and is treated as a
nuisance parameter.
The Kaplan-Meier (1958) estimator (KME) can be viewed as a non-
parametric M L E (see Kaplan and Meier, 1958) and when there is no censoring
it reduces to the e.d.f, of Section 7.1. Under our continuity assumptions, the
K M E Fk,(t) can be written as
nKn(t)
F k , ( t ) = [-I c f 2 ' I l Z ( , ) ~ t}, t ~ [0, oo1, (9.1/
i=1
where cm = (n - i)(n - i + 1)-1, Z(1) < ' . - < Z(,) are the ordered Z's, 6(o is the 6
corresponding to Z(0, K , ( t ) = n - ~ E T = l I { Z i <<-t} is the empirical distribution
function of the Z's, and where a product over an empty set of indices is defined
to be 1.
Large sample properties of the K M E have been studied by many authors. In
Nonparametric concepts and methods in reliability 649
substituting the respective KME's for F, G,...), and showed how an ap-
propriate transformation of nlr2{Fk,(t)-F(t)} yielded the standard Weiner
process. There are many two-sample competitors; see also Mantel (1966), Peto
and Peto (1972), Latta (1977a), Aalen (1978), Fleming et al. (1980) and the
survey papers of Oakes (1981) and Andersen et al. (1982).
One of the earliest K-sample censored data tests of the hypothesis that K
population distributions FI, F2. . . . . Fr are equal is Breslow's (1970) general-
ization of the Kruskal-Wallis test. See also Tarone and Ware (1977), Prentice
(1978), Brookmeyer and Crowley (1980), and Andersen et al. (1982) for
competitors.
A paired-sample test for censored data is proposed by Wei (1980).
Sign tests and confidence intervals for the median survival time when the data
are right censored are given by Brookmeyer and Crowley (1982) and Emerson
(1982).
Tests, using randomly censored data, that the underlying distribution is a
specified distribution F0 (say), include those proposed by Breslow (1975),
Koziol and Green (1976), Hyde (1977), Turnbull and Weiss (1978), Hollander
and Proschan (1979), Gail and Ware (1979), Koziol (1980), Fleming et al.
(1980), Cs6rg6 and Horvfith (1981), Woolson (1981), and Andersen et al.
(1982). Turnbull and Weiss (1978) and Chen (1981) have devised procedures for
the goodness-of-fit problem where the null hypothesis is composite.
Tests of the independence of X, Y, where one (or both) variables are subject
to censoring include a generalization of Kendall's 7 due to Brown, Hollander
and Korwar (1973) and a generalization of Spearman's p due to Latta (1977b).
Two-sample tests, K-sample tests, goodness-of-fit tests and tests of in-
dependence can also be developed from the general regression methods for
censored data proposed by Cox (1972) in a landmark paper. Competing
regression methods are due to Miller (1976), Buckley and James (1979), and
Koul, Susarla and Van Ryzin (1981). Miller (1981) and Miller and Halpern
(1981) contrast the advantages and disadvantages of the different regression
approaches; approaches that can be used to determine the effects of covariates
on survival. See Miller (1981), Birnbaum (1979), Basu (1984), and Doksum and
Yandell (1984)for concise descriptions of additional techniques for censored data.
References
Aalen, O. O. (1978). Nonparametric inference for a family of counting processes. Ann. Statist. 6,
701-726.
Aggarwal, O. P. (1955). Some minimax invariant procedures for estimating a cumulative dis-
tribution function. Ann. Math. Statist. 26, 450-463.
Andersen, P. K., Borgan, O., Gill, R. and Keiding, N. (1982). Linear nonparametric tests for
comparison of counting processes, with applications to censored survival data. Inter. Statist. Inst.
Rev. 50, 219-258.
Artin, E. (1964). The Gamma Function. Holt, New York.
Barlow, R. E. (1968). Likelihood ratio tests for restricted families. Ann. Math. Statist. 39, 547-560.
Barlow, R. E., Bartholomew, D. J., Bremner, J. M. and Brunk, H. D. (1972). Statistical Inference
Under Order Restrictions. Wiley, New York.
Nonparametric concepts and methods in reliability 651
Barlow, R. E. and Campo, R. (1975). Total time on test processes and applications to failure data
analysis. In: R. E. Barlow, J. Fursell and N. Singpurwalla, eds., Reliability and Fault Tree
Analysis. SIAM, Philadelphia, pp. 451-481.
Barlow, R. E. and Doksum, K. (1972). Isotonic tests for convex orderings. Proc. Sixth Berk. Syrup.
Math. Statist. Prob. 1, 293-323.
Barlow, R. E., Marshall, A. W. and Proschan, F. (1963). Properties of probability distributions with
monotone hazard rate. Ann. Math. Statist. 34, 375-389.
Barlow, R. E. and Proschan, F. (1964). Comparison of replacement policies, and renewal theory
implications. Ann. Math. Statist. 35, 577-589.
Barlow, R. E. and Proschan, F. (1965). Mathematical Theory of Reliability. Wiley, New York.
Barlow, R. E. and Proschan, F. (1966). Inequalities for linear combinations of order statistics from
restricted families. Ann. Math. Statist. 37, 1574-1592.
Barlow, R. E. and Proscban, F. (1969). A note on tests for monotone failure rate based on
incomplete data. Ann. Math. Statist. 40, 595-600.
Barlow, R. E. and Prosehan, F. (1981). Statistical Theory of Reliability and Life Testing. Second
Printing Publisher: To Begin With, 1137 Hornell Drive, Silver Spring, Maryland 20904.
Basu, A. P. (1984). Censored data. Chapter 25 in this volume.
Bergman, B. (1977). Crossings in the total time on test plot. Scand. J. Statist. 4, 171-177.
Bickel, P. J. and Doksum, K. (1969). Tests for monotone failure rate based on normalized spacings.
Ann. Math. Statist. 40, 1216-1235.
Bickei, P. J. and Freedman, D. A. (1981). Some asymptotic theory for the bootstrap. Ann. Statist.
9, 1196-1217.
Bimbanm, Z. W. (1952). Numerical tabulation of the distribution of Kolmogorov's statistic for
finite sample size. J. Amer. Statist. Assoc. 47, 425-441.
Bimbaum, Z. W. (1979). On the Mathematics of Competing Risks. D H E W Publication No. PHS
79-1351.
Bimbaum, Z. W., Esary, J. D. and Marshall, A. W. (1966). Stochastic characterization of wearout
for components and systems. Ann. Math. Statist. 37, 816-825.
Birnbaum, Z. W. and Tingey, F. H. (1951). One-sided confidence contours for probability
distribution functions. Ann. Math. Statist. 7,2, 592-596.
Block, H. W. and Savits, T. H. (1976). The IFRA closure problem. Ann. Prob. 4, 1030-1032.
Block, H. W. and Savits, T. H. (1981). Multivariate classes in reliability theory. Mathematics of
Operations Research 6, 453-461.
Bondesson, L. (1983). On preservation of classes of life distributions under reliability operations:
Some complementary results. Naval Res. Logist. Quarterly 30, 443-447.
Borges, W., Proschan, F. and Rodrigues, J. (1982). A simple test for new better than used in
expectation. Florida State University Department of Statistics Report No. M-604.
Breslow, N. E. (1970). A generalized Kruskal-WaUis test for comparing K samples subject to
unequal patterns of censorship. Biometrika 57, 579-594.
Breslow, N. E. and Crowley, J. (1974). A large sample study of the life table and product limit
estimates under random censorship. Ann. Statist. 2, 437-453.
Brookmeyer, R. and Crowley, J. (1982a). A k-sample median test for censored data. J. Amer.
Statist. Assoc. 77, 433-440
Brookmeyer, R. and Crowley, J. (]982b). A confidence interval for the median survival time.
Biometrics 38, 29-41.
Brown, B. Win. Jr. and Hollander, M. (1977). Statistics: A Biomedical Introduction: Wiley, New
York.
Brown, B. W., Hollander, M. and Korwar, R. M. (1974). Nonparametric tests of independence for
censored data with applications to heart transplant studies. In: F. Proschan and R. J. Serfiing,
eds., Reliability and Biometry. SIAM, Philadelphia, pp. 327-354.
Brown, M. (1981). Further monotonieity properties for specialized renewal processes. Ann. Prob. 9,
891-895.
Brown, M. and Chaganty, N. R. (1983). On the first passage time distribution for a class of Markov
chains. Ann. Probab. 11, 1000-1008.
Bryson, M. C. (1974). Heavy-tailed distributions: Properties and tests. Technometrics 16, 61-68.
652 Miles Hollander and Frank Proschan
Bryson, M. C. and Siddiqui, M. M. (1969). Some criteria for aging. J. Amer. Statist. Assoc. 64,
1472-1483.
Buckley, J. and James, I. (1979). Linear regression with censored data. Biometrika 66, 429-436.
Butler, D. A. (1979a). A complete importance ranking for components of binary coherent systems,
with extensions to multistate systems. Naval Res. Log. Quart. 26, 565-578.
Butler, D. A. (1979b). Bounding the reliability of multistate systems. Stanford University Technical
Report No. 193.
Chert, C. (1981). Correlation-type goodness-of-fit tests for randomly censored data. Stanford
University Division of Biostatistics Technical Report No. 73.
Chen, Y. Y., Hollander, M. and Langberg, N. A. (1982). Small-sample results for the Kaplan-
Meier estimator. J. Amer. Statist. Assoc. 77, 141-144.
Chen, Y. Y., Hollander, M. and Langberg, N. A. (1983a). Testing whether new is better than used
with randomly censored data. Ann. Statist. 11, 267-274.
Chen, Y. Y., Hollander, M. and Langberg, N. A. (1983b). Tests for monotone mean residual life
using randomly censored data. Biometrics 39, 119-127.
Cox, D. R. (1962). Renewal Theory. Methuen, London.
Cox, D. R. (1972). Regression models and life tables (with discussion). J. R. Statist. Soc. B 34,
187-220.
Cox, D. R. and Hinkley, D. V. (1974). Theoretical Statistics, Chapman and Hall, London.
Cs6rg6, S. and Horvhth, L. (1981). On the Koziol-Green model for random censorship. Biometrika
68, 391-401.
Deshpande, J. V. (1983). A class of tests for exponentiality against increasing failure rate average
alternatives. Biometrika 711, 514-518.
Doksum, K. (1972). Decision theory for some nonparametric models. Proc. Sixth. Berk. Syrup.
Math. Statist. Prob. 1, 331-341.
Doksum, K. (1974). Tailfree and neutral random probabilities and their posterior distributions.
Ann. Prob. 2, 183-201.
Doksum, K. and Yandell, B. S. (1984). Tests for exponentiality. Chapter 26 in this volume.
Dvoretzky, A., Kiefer, J. and Wolfowitz, J. (1956). Asymptotic minimax character of the sample
distribution function and of the classical multinomial estimator. Ann. Math. Statist. 27, 642-669.
Dykstra, R. L. and Laud, P. (1981). A Bayesian nonparametric approach to reliability. Ann. Statist.
9, 356-367.
Efron, B. (1967). The two sample problem with censored data. Proc. Filth Berk. Syrup. Math.
Statist. Prob. 4, 831-854.
Efron, B. (1979). Bootstrap methods: Another look at the jackknife. Ann. Statist. 7, 1-26.
Efron, B. and Gong, G. (1983). A leisurely look at the bootstrap, the jackknife and cross-
validation. The American Statistician 37, 36-48.
Efron, B. and Morris, C. (1975). Data analysis using Stein's estimator and its generalizations. J.
Amer. Statist. Assoc. 70, 311-319.
Elandt-Johnson, R. C. and Johnson, N. L. (1980). Survival Models and Data Analysis. Wiley, New
York.
El-Neweihi, E. and Proschan, F. (1980). Multistate reliability models: A survey. Multivariate
Analysis V, 523-541.
Emerson, J. D. (1982). Nonparametric confidence intervals for the median in the presence of right
censoring. Biometrics 38, 17-27.
Esary, J. D., Marshall, A. W. and Proschan, F. (1973). Shock models and wear processes. Ann.
Prob. 1, 627-649.
Fardis, M. N. and Cornell, C. A. (1981). Analysis of coherent multistate systems. I E E E Trans.
Reliability 30, 117-122.
Ferguson, T. S. (1973). A Bayesian analysis of some nonparametric problems. Ann. Statist. 1,
209-230.
Ferguson, T. S. (1974). Prior distributions on spaces of probability measures. Ann. Statist. 2,
615-629.
Ferguson, T. S. and Phadia, E. G. (1979). Bayesian nonparametric estimation based on censored
data. Ann. Statist. 7, 163-186.
Nonparametric concepts and methods in reliability 653
Oakes, D. (1981). Survival times: Aspects of partial likelihood (with discussion). Inter. Statist. Inst.
Rev. 49, 235-264.
Peterson, A. V. (1977). Expressing the Kaplan-Meier estimator as a function of empirical
subsurvival functions. J. Amer. Statist. Assoc. 72, 854-858.
Peto, R. and Peto, J. (1972). Asymptotically efficient rank invariant test procedures (with dis-
cussion). J. R. Statist. Soc. A 135, 185-206.
Phadia, E. G. (1973). Minimax estimation of a cumulative distribution function. Ann. Statist. 1,
1149-1157.
Prentice, R. L. (1978). Linear rank tests with right censored data. Biometrika 65, 167-179.
Proschan, F. (1963). Theoretical explanation of observed decreasing failure rate. Technornetrics 5,
375-383.
Proschan, F. and Hollander, M. (1972). Testing whether new is better than used. Ann. Math.
Statist. 43, 1136-1146.
Proschan, F. and Hollander, M. (1975). Tests for the mean residual life. Biometrika 62, 585-593.
Proschan, F. and Pyke, R. (1967). Tests for nomotone failure rate. Proc. Fifth Berk. Syrup. Math.
Statist. Prob. 3, 293-312.
Read, R. R. (1971). The asymptotic inadmissibility of the sample distribution function. Ann. Math.
Statist. 42, 89-95.
Robbins, H. E. (1964). The empirical Bayes approach to statistical decision problems. Ann. Math.
Statist. 35, 1-20.
Ross, S. M. (1979). Multivalued state component systems. Ann. Prob. 7, 379-383.
Samaniego, F. J. and Boyles, R. A. (1980). Estimating a distribution function when new is better
than used. University of California at Davis Division of Statistics Technical Report No. 22.
Sethuraman, J. and Tiwari, R. C. (1982). Convergence of Dirichlet measures and the interpretation
of their parameters. In: S. S. Gupta and J. O. Berger, eds., Statistical Decision Theory and
Related Topics III, Vol. 2. Academic Press, New York, pp. 305-315.
Singh, K. (1981). On the asymptotic accuracy of Efron's bootstrap. Ann. Statist. 9, 1187-1195.
Stein, C. (1955). Inadmissibility of the usual estimator for the mean of a multivariate normal
distribution. Proc. Third Berk. Symp. Math. Statist. Prob. 1, 197-206.
Susarla, V. and Van Ryzin, J. (1976). Nonparametric Bayesian estimation of survival curves from
incomplete observations. J. Amer. Statist. Assoc. 71, 897-902.
Tarone, R. E. and Ware, J. (1977). On distribution-free tests for equality of survival distributions.
Biometrika 64, 156-160.
Turnbull, B. W. and Weiss, L. (1978). A likelihood ratio statistic for testing goodness of fit with
randomly censored data. Biometrics 34, 367-375.
van Belle, G. (1972). Personal communication.
Wei, L. J. (1980). A generalized Gehan and Gilbert test for paired observations that are subject to
arbitrary right censorship. J. Amer. Statist. Assoc. 75, 634-637.
Wilks, S. S. (1962). Mathematical Statistics. Wiley, New York.
Woolson, R. F. (1981). Rank tests and a one-sample logrank test for comparing observed survival
data to a standard population. Biometrics 37, 687-696.
Yang, G. L. (1978). Estimation of a biometric function. Ann. Statist. 6, 112-116.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 ")~
Elsevier Science Publishers (1984) 657--698
Ulrich M f i l l e r - F u n k
1.1. Generalities
In this article we are going to survey sequential probability ratio tests (SPRT)
and related procedures, tests with power one as well as truncated sequential
tests designed for situations that are somewhat vaguely classified as non-
parametric. There will be no mentioning of life testing problems, which are
dealt with in a separate chapter. As to the basic facts concerning sequential
tests, the reader is referred to B. K. Ghosh 'Sequential Tests of Statistical
Hypotheses' (1970). A complete and detailed treatment of the sequential
nonparametric procedures proposed up to now can be found in a recent
monograph by P. K. Sen 'Sequential Nonparametrics' (1981). The field partly
consists of ad hoc tests, a more or less coherent statistical justification only
existing for sequential rank tests. The latter, therefore, will somewhat pre-
dominate.
Methodically, nonparametric theory largely constitutes a chapter of asymp-
totic statistics and all the more this is true in the sequential case. Accordingly,
the tools employed include functional limit theorems and 'nonlinear' renewal
arguments but are rather different from the ones that shape the exact (finite)
sequential theory. In most cases the validity of these probabilistic limit
theorems rests on a decomposition of the statistics under consideration into an
average (or, more generally, a U-statistic) of independent variates and into a
remainder term approaching zero sufficiently fast. Once a representation of this
kind has been established, the distribution theory required has no longer
anything specifically nonparametric about it but solely hinges on analytic and
probabilistic standard techniques. A closer look at these things would lead us
too far afield. Instead, we shall mostly call for a limit theorem without
specifying exactly the conditions on which it is valid. Incidentally, most of these
tools are comparatively recent and that is why this part of statistics is still in a
state of flux despite its early outset, cf. Noether (1954), Tsao (1954) and
Romani (1956).
657
658 Ulrich Miiller-Funk
dFo
L,,(Fo: Fo)= L.(O)= fl lo(X~), lo(x) = ~ (x).
j=l
where /" and l~ of course, stand for the first and the second log-derivative of 19.
Sequential nonparametric tests 659
From the local point of view,/~, resp. L, are the test statistics to be used in the
one-sided testing problem O = 0 vs. O > 0 (i.e. ~9 =]O, O[, O > 0 ) resp. the
two-sided testing problem O = 0 vs. O ~ 0 (i.e. O =]O___,O[, O__<0 < O) within
this parametric submodel. N o t e that n-X/~, asymptotically behaves like n -1 L"~,
(up to a constant) which is why two-sided tests are usually simply based on L,.
In the remaining parts of this section we are going to list some statistical
hypotheses that have undergone a more intensive study within a sequential
context and to introduce the statistics related to them.
(xl . . . . , x . ) ~ ( r ( x l ) , . . . , y(x.)) (~ ~ r, xj ~ R ~)
In what follows, we shall be concerned with three standard problems that fit
into the foregoing scheme. As it turns out that ~ , is generated by a vector of
ranks in these cases, we shall prefer the more suggestive notations LR,, LR, to
L,, / ~ , etc. Here, the two-sided testing problems require a few extra con-
siderations as it has to be shown that n-IE,R, and n-1/~2, again behave in
essentially the same way and hence that the commonly proposed tests employ-
ing/~R, are asymptotically justifiable. In order not to overburden the exposition
we Shall mainly deal with one-sided alternatives only and refer the reader
to Miiller-Funk et al. (1983) for some comments concerning the two-sided
case.
662 UlrichMailer-Funk
The classes Fo(y, z) = {~o(G(y), H(z)), 0 <~ 0 < O}, G, H E ~1, will serve our
purposes. On the atoms (Rr~ . . . . . R ~ , R , Z , . . . , R,Z) = (r~. . . . . r,, S~. . . . . s,)
the p.r. and its slope take on the form
n
L,(o) = (1 + (1.1o)
~.(t) = 2~o.jllj-L,jl(nt),
j=l
~ 0 < t < 1, (4,~j)l~j.. cW,
j=l
(1.12)
Tn~ Y z
~9 lnR ni @ 2nR ni
j=l
In what follows, we shall restrict our attention to either of the following types
of scores:
For a smooth q~, it does not make any difference asymptotically, which of the
two is employed. If ~ is nondecreasing and if its Lebesgue integral over the
unit interval is zero, then 0~(.) reflects the stochastic ordering by means of
which we introduced the hypotheses, i.e.
Next, let us come to some fundamental results concerning the above quantities.
Although we shall state some of them a bit informally, we like to put them into
the shape of theorems.
where dK(F1, F2) = supxlFl(X)- F2(x)[ denotes the Kolmogorov distance between
d.f.
664 Ulrich Miiller-Funk
The above theorem generalizes SLLN due to Sen (1970), Hajek (1974), Sen
and Ghosh (cf. Sen, 1981, p. 120), Rieder (1981, 1982), etc. It will be useful in
connexion with a well-known inequality obtained by Dvoretzky et al. (1956):
For some d 1>0,
This result makes it easy to establish the invariance principle for linear rank
statistics under the null-hypothesis and alternatives close to it. Moreover, Sen
and Ghosh (1972), (1973b), (1974b) used it to prove strong embedding
theorems which (slightly modified) read as follows: If is normalized in the
sense L2, then (switching to a new basic probability space if necessary) there
are i.i.d, standard normal variables W1, WE. . . . SO that under ~)0 a.s.
T, - ~ Wj = o(c,) (1.16)
j=l
where the growth of the positive constants c, depends on the score function.
For a broad class of q~ (including the inverse ~-1 of the standard normal d.f.)
we may choose c, = (n log log n) ~/2. In that case, (1.16) implies the LIL under
the null-hypothesis.
Invariance principles under arbitrary, fixed d.f. F @ 91 (resp. E 92) were
obtained by various authors using different methods. We refer to Sen (1981),
Chapters 4, 5, 6 for references. Here, we are going to reproduce a somewhat
stronger assertion (requiring more restrictive assumptions on the score func-
tions q~), cf. Sen and Ghosh (1971), (1973a), Lai (1975a), Miiller-Funk (1979),
B6nner et al. (1980), Sen (1981, p. 135, 164) for proofs in each of the three
cases.
so that for all K > 1 there are 0 < y = T(K)<2 -1, n(K)E N for which
The formulation of this result requires further comments. Firstly, the func-
tions hF in question are not just any but quite specific ones and we dropped
their definition for the mere sake of space. Secondly, the regularity conditions
referred to above are fulfilled for any ~0 with a continuous second derivative so
that ~p resp. ', ~p" do not increase faster near zero and one than ~-a and its
first two derivatives. Especially, the normal scores statistics are included. The
foregoing Theorem yields the invariance principle as well as the LIL. Over and
above that it will be our main tool in obtaining approximations to average
sample numbers (ASN). For that purpose, the uniformity statement in the
above result turns out to be crucial.
In the one- and the two-sample location model it is more convenient to vary
the statistics by a translation parameter than the distributions, that is to make
use of the identity ~ o ( T ) = 9~o(T(O)), where T,(O) are the linear rank statistics
based on the shifted observations Xj + 0 resp. (Yj, Zj + O). When dealing with
local alternatives, 7",(0) can be expanded around 0 = 0. To make this more
precise, we associate the following quantities with every d.f. F which possesses
an absolutely continuous Lebesgue density f and a finite Fisher information
I(F):
I(F) = Ja f'2(x)/f(x) dx,
Asymptotic linearity results were first proved by Jure~kovfi (1969) and van
Eeden (1972), who showed that (D,),=I tends to zero in probability. Almost
sure versions are due to Sen and Ghosh (1971), Jure~kovfi (1973) and Sen
(1980). At the same time, van Eeden's paper supplements earlier work by
Lehmann (1966) concerning orderings of vectors of ranks. In case of a non-
decreasing score function, it can be drawn from these sources that
i.e. T, is stochastically larger under the alternative than under the null-
situation.
uniform distribution over a finite set of points is in the domain of v(-), i.e. that
V, = v(~',) is always defined. Finally, we require that this functional is con-
tinuous in the sense that V = (V,),~I is a strongly consistent estimator for u(F)
in case F is the true d.f. It has been common usage for a long time to turn
estimators into test statistics for the parameter involved. In order to frame a
pair of hypotheses we fix some v0 in the range of v(.) and put, for instance,
(In the latter case, of course, we have to assume that d = 1.) The pertaining
V-statistics comprise statistics of the Cram6r-von Mises-type in the first case and
certain linear combinations of order statistics (in short: L-statistics) in the second
one. AS for the general theory of all these sorts of statistics we refer the reader to
Puri and Sen (1971), Serfling (1980) and Sen (1981). As almost all statistics of
interest can be made arbitrarily close to a U-statistic we shall somewhat
concentrate on that type of statistics. Let us recall those structural results
concerning U-statistics and their V-statistics counterparts that will turn out to be
crucial in the course of the following discussion of sequential tests. To begin with,
(Un, ~ , ) ~ p is a reversed martingale and a.s. converges towards v(F) under F.
Next, we have
THEOREM D (Hoeffding). Suppose that v(.) = Vg(.) is regular and that F is in the
domain of u(.). Then
(a) There are U-statistics Uo,) corresponding to kernels gi (depending on F) of
length j, 1 <~j ~ p, so that
Sequential nonparametric tests 667
The martingale property and the Hoeffding decomposition (1.20) are the
main tools for proving limit theorems (CLT, invariance principle, LIL) and
inequalities for U-statistics. For instance,
EFI U. - p U ~ ) - . ( F ) [ 2k = O ( n - 2 0 . (1.21)
PF(SUp
k>~n
IU . - v(F)l > e) = O(n -2k+1) VE > 0 . (1.22)
As for an analogue to Theorem C for that type of statistics confer Sen (1977a)
or Govindarajulu and Mason (1980).
Apart from a trivial modification, linear rank statistics, too, can be regarded
as V-statistics. In fact
7". = ( 1 + + 1)).
where F0 is a fixed d.f. In this particular case and if d = 1 the testing problem
(1.19) simply boils down to
This problem as well as the two-sample problem (1.3) will be treated, accord-
ingly, by means of the Kolmogorov-Smirnov type statistics K m and K (2),
respectively, where
2. N o n p a r a m e t r i c Waid tests
(inf 0 = 0). Such a test, of course, is nothing else but a familiar SPRT, if
Q. = log L. is chosen to be the log-p.r, corresponding to a fixed pair of
Sequential nonparametric tests 669
2.1. R a n k S P R T
Let us look at any of the testing problems described in Section 1.2. We fix
alternatives F,. ~ (91, i = 0, 1, and denote the corresponding p.r. again by
L , (FI : Fo), LR, (F1 : Fo) = Eo(L, (171: Fo) [ ~,). A rank SPRT (N~, 6~), i.e. a Wald
test built upon Oh = log LR,(Fa:Fo), and is but a special case of an invariant
SPRT, for which various results are available in the literature. Savage and
Savage (1965), for instance, stated sufficient conditions that ensure the finite-
ness of the stopping times. Wijsman (1977a, 1977b) and Lai (1975b, c)
examined the properties of these random times more closely. We shall not
reproduce their findings but refer the reader to Wijsman's (1979) excellent
survey paper on the subject. Little can be said on behalf of the other questions
raised above without appealing to some kind of asymptotics. At least Wald's
approximations to the stopping bounds remain valid, i.e.
where a and /3 are the error probabilities under F0 resp. F1. The Wald-
Wolfowitz theorem is no longer in force but Eisenberg et al. (1976) established
some sort of weak admissibility. Perhaps the most convincing argument sup-
porting the use of a rank SPRT, however, is asymptotic in nature. First, we are
going to quote a somewhat stripped version of a general result due to Lai
(1981). Let (~ = (~,),~1 be any filtration contained in 2[ and denote by 5E(o~,/3)
the class of ~-measurable sequential tests (N, 6) such that
THEOREMF. For 0 < a, [3, ot + [3 < 1, let b = b(a, [3) < a = a(a, [3) be chosen so
that (2.3) holds for (N~(b, a),6~(b, a)), Qn log LeA, and that, moreover,
670 Ulrich Miiller-Funk
Suppose there are finite constants Io < 0 < 11 such that for all ~ > 0
under F1 (in the sense of 1-quick convergence). Here, suitably selected means
that F0 minimizes the Kullback-Leibler numbers K - L(F1 : .) over ~0. Besides,
Ia coincides with K - L(FI:Fo), i.e.
on ]0, l[(d).) Hence the optimality of the classical SPRT can be extended to
composite hypotheses, at least asymptotically, if L, is replaced by LR,
throughout. The entire approach, however, suffers from a serious drawback.
There are hardly any interesting classes of alternatives that lead to simple
expressions for logLn,,. The examples to come and two related problems
treated by Govindarajulu (1975, pp. 281,283) seem to comprise all rank-SPRT
which have been investigated in detail.
We are going to take up the two-sample testing problem specified in Section
1.2. For this problem, Wilcoxon et al. (1963) first proposed a SPRT based on
ranks. These authors treated grouped data and ranking within groups, a topic
we shall turn to only later on. Savage and Sethuraman (1966) suggested a
rank-SPRT within the meaning of the present paper, i.e. a SPRT which makes
more effective use of the data by a complete reranking of the observations at
each stage. Their procedure was further investigated by Sethuraman (1970),
Savage and Sethuraman (1972), Govindarajulu (1975) among others. Research
concerning generalizations of Stein's lemma partly originated from the Savage-
Sethuraman paper. In all of the afore-mentioned articles the p.r. are built upon
g)0 and Lehmann alternatives 9l(rl),
These alternatives lack intuitive meaning but are popular for their analytical
tractability. It has been pointed out that both ~)0 and ~R('q) turn into simple
hypotheses after the reduction by invariance. With those alternatives, (1.7)
becomes
(1+ ,q)"(2n)! ~ 1
LRn(,q ) = n 2n 11 W . ( Y j ) W . ( Z j ) ~ n-1 log LR,,('q)
j=l
n
= log(4(1 + 'q))- 2 - n - 1 Z {log W.(Yj)+ log W.(Zj)} + O(n -I log n),
j=l (2.7)
To be in agreement with our previous notation we write Io(rl ) resp. I~(71) instead of
1(7/I F) if F ~ 6o resp. F E ~(~1).
(i) (Savage and Sethuraman, 1966). For all F ~ ~21 t3 g)o for which I(rl [ F) = O,
there exists some 0 < ~ < 1 so that for On = log Lgn(r/) and all n sufficiently large:
PF(N~(b, a) > n) < ~-n ('N~(b, a) exponentially bounded').
(ii) (confer Berk and Savage, 1968). For all Fo E ~)o, F1E ~R(r/):
n -1 log Lnn(r/)--> Ii(r/) exponentially fast under F1.
A refined version of part (i) appears in Sethuraman (1970) and Wijsman (1979).
Part (ii) is but a special case of the main result in the Berk-Savage (1968) paper
which deals with a broad class of nonparametric alternatives.
The validity of the functional limit theorem and other probabilistic state-
ments concerning log LR,(r/) can be drawn from Lai (1975a). In a recent paper
Woodroofe (1982a) obtained a Chernoff-Savage theorem that allows for an
application of the nonlinear renewal theory developed by Lai and Siegmund
(1977, 1979). This approach yields refined approximations to both error prob-
abilities.
In many practical situations, observations only become available in groups or
the evaluation of an item requires such an effort of time that grouping seems
advisable. Two SPRT based on ranks were proposed for experiments wherein
groups of m observations are taken sequentially. The test presented by
Wilcoxon et al. (1983) is based on E n1 lOg LRn(rl)
(k) (k)
, where tRm(rl) is the p.r.
computed from the k-th group. The statistics (L~)m(~l))k>~lform an i.i.d, sequence
whence all interesting features of this test can be drawn from the standard
theory. As this device merely joins together independent experiments but
neglects the information that can be gained by comparing observations from
different groups, it is suspected to be somewhat inefficient. In order to meet
this objection, Bradley et al. (1966) suggested to maintain the above sampling
scheme but to rerank the whole data collected once a new group of obser-
vations is obtained, i.e. suggested the use of Wald tests based on (10g LRnm),>~l.
These authors, too, assume Lehmann alternatives and mention some elemen-
tary properties of their procedure (a.s. termination, adequacy of the Wald
approximations (2.2)). It should be added that these papers also treat samples
of Y- and Z-observations which are not necessarily of the same size.
The discussion of the one-sample problem for testing symmetry is somewhat
less complete but largely parallels the one of the two-sample case. For that
reason we shall not enter into details but refer the reader to Weed et al. (1974)
and Weed and Bradley (1971). Choi (1973) considered the corresponding
sequential tests for independence.
rank-p.r., which are awkward to handle for almost all classes of alternatives, we
can employ their slopes, i.e. linear rank statistics. It is easy to conjecture that
the resulting tests will asymptotically enjoy some kind of optimum property
within a local approach. To corroborate this and, at the same time, to obtain
approximations to OC and ASN, we shall rely on an invariance principle. This
tool as well as other results needed to answer the questions posed at the
beginning are valid, however, for many other statistics, too. Accordingly, it
seems economical first to discuss properties of Wald tests based on more
general statistics O = (0,),I>1 and subsequently to turn to those aspects that are
characteristic of linear rank statistics resp. U/V-statistics.
Termination properties of Wald tests. We shall mainly come across statistics Q
that not only obey the SLLN, i.e. for all F E g)0 tO ~1 there is some/z (F) E so
that for all e > 0
but for which, in addition, some information concerning the rate of this
convergence is available. In case I x ( F ) ~ 0, the a.s. finiteness (integrability,
exponentially boundedness) of N~(b, a) can be concluded from this by means
of some crude bounds. If, for instance, p1:,(e)= O(n-r), then
(2.8)
<~pF,(e) (IIX(F)I> e > 0 , n large)
implies the existence of all moments EF(N~(b, a)) s, s < r. In case IX(F) = 0 the
behaviour of O is reminiscent of that of a recurrent random walk. In fact, we
shall reduce this case to the discussion of the corresponding situation known
from the classical SPRT when the log-p.r, becomes driftless under the 'excep-
tional' parameter point. T o this end, we assume that for these F ~ ~90 tA ~1 we can
find statistics D F = (DFn)n~l and a random walk S F ~- ( S F n ) n ~ l , EF(SF1 ) = O, SO
that, for some 0 < y ~< t,
Fix k t> 1 (to be specified later on). With every n ~> 1 we associate the numbers
nj = [jk-~n], 1 <~j <~k ([-] denoting the interger part). Then,
674 Ulrich Miiller-Funk
for every n > 2k. These estimates make it easy to derive sufficient conditions
for the finiteness of N~) and its moments. Essentially following Lai (1975b) we
arrive at
then
EF((N~(b, a))') < ~ .
We dwelled a little bit on details because the argument used above is quite
typical of the way in which classical results are carried over to a nonparametric
set up. Mutatis mutandis, this remark applies as well to the asymptotic theory
to come.
Local approximations to OC and ASN. Let us briefly recapitulate the neces-
sary distribution theory. For every A 0 > A > 0 , let Fa, P~ be p.m. and let
Sa = (Sa,),~l be the partial sums of variables that are independent under both
F~ and Fa. With Sa we associate the Donsker processes 5P~(.) which result from
linearly interpolating the values ASa, at the epochs /tEn, n/> 0 (Sa0------0). (The
definition of these processes is, in essence, a matter of scaling and depends on
the nature in which Fa, Pa are parametrized. We are going to deal exclusively
with that case that has been labelled the 'regular' one.) Processes ~a(-) are
defined in a like manner.
Sequential nonparametric tests 675
Under contiguity, (2.11) is valid with Pa instead of /5a. The foregoing dis-
cussion, therefore, indicates that some ~ (. I ~', 1) will emerge as a limit process
of ~A('). If ff'a ---F0 and if Q is a (reversed) martingale sequence under F0, then
an even simpler derivation of that limiting behaviour is possible; confer Sen
(1975). In this case, the tightness of E0((~a(t))0~,~r) is implied by the con-
vergence of the finite dimensional marginals (Brown, 1971; Loynes, 1970) and,
accordingly, it suffices to verify that, for all 0 < tl < < tk and e > 0,
Such weak convergence results under local alternatives form the basis of
approximations to OC and of Sen and Ghosh's (1980) approach to the
sequential asymptotic relative Pitman efficiency (ARPE). In order to obtain the
approximate ASN under Fa as well, we need a stronger result (requiring more
restrictive assumptions). In varying forms, it has been known for a long time.
To be more precise, we select some F0@ g)0 together with alternatives
{FA: A0 > A > 0}. The formulation of the invariance principle below calls for
Borel functions ha(') so that for some constant c > 0:
Put SA, = ~' ha (Xj). Next, the Wald test (N~(b, a), 6~(b, a)) is turned into an
asymptotic sequential test by setting N ~ A ) = N ~ ( b A -1, aA-~), and 6 ~ ( d ) =
1As for the definitionand basic facts, confer H~ijekand Sid~ik(1967, p. 202).
676 Ulrich Miiller-Funk
THEOREM I. Suppose that 52a(Xx)~ 520(X1) in total variation, that (2.12) holds
true, and that there are constants d > O, > 3~> O, K > 2 so that
Then, as A ~ O,
where the initial sample size n0(A) tends to infinity, but where A2n0(A)
converges to zero. Sen (1981, p. 258, 267) essentially proved
THEOREM I'. Suppose that all assumptions of Theorem I are satisfied except that
in (2.14) we replace On by DnOn, where D (Dn)n~l are (92-measurable)
=
statistics for which some K' > 1 exists so that for all e > 0 there are constants
d' > O, nl >i 1 for which
Then the assertion of Theorem I holds true for No(A) instead of N~(A).
Sequential nonparametric tests 677
We shall not reproduce the familiar formulas for the statistically relevant
limiting expressions
Xb,a(~, 0"2) = P(~(7"* ] ~, 0-2) ~- b), Ab,a(~, 0"2) = E("/'*(~ (" I if, '2))),
siderations suggest to look for the following kind of expansion under F0:
n
~an(t)=EQ~=llan(Xi)[~")'
= t = A2n'
n
-- E0(1-I(l+
-j=l
)h
= (1+ naE0(i(N) [ .) + . . . )
j=l
-~ exp(r/A ~
j=l
Eo([(Xj)I (S,)-~12A2nEo(n-l~/'2(Xj)
j=l
I~,)/2)
the nonsequential theory in order to relate the limiting shift of the test statistics
to the ratios of sample sizes. We shall not touch on the more delicate question
of comparing sequential tests that have different types of stopping boundaries.
For the difficulties connected with that confer Berk (1975b, 1976) and the
references given there. Another topic that is left out is the problem how to
obtain correction terms for the foregoing approximations or how to ensure
second (higher) order properties of nonparametric Wald tests. So far, nobody
seems to have made efforts in that direction.
Asymptotics under a fixed distribution. In case O behaves like a random
walk with mean zero and variance o-2(F) per observation, Lai (1975b) deter-
mined the limit of N~(b, a) and its moments as rain{a, Ibl} tends to infinity and
Ibl(a + lbl) -~ approaches 0 < w < 1. Omitting regularity conditions, his result
reads as follows:
bounded for every F E 9)1 for which O~(F) is strictly positive. In fact, for all
e > 0 and n large enough,
We know from (1.18), that ~Sh(T, ) is stochastically larger than Ep0(T,)= ~o0(T,)
for every choice of F0 E 00 and F1 E 9)1 and all n. With the help of an i.i.d, sample
drawn from the rectangular d.f. we can construct random sequences T o =
(T~)),~I, i -- 0, 1, so that (i) E(T )) and 9.F~(T) are equal and (ii) T () ~ T ) (in each
component). Because of
moreover, so that (i) 4'a --> 4'o in mean square, (ii) zl sup{14'a(t)l: 1 > t > 0}<~ 1,
(iii) A 2 sup{14'a(t)l: 1 > t > 0} = o(1) as A $ 0.
Fix F0 E (90, r / > 0 and put ~an(dt) = (1 + r/A4'a(t)) dt, Fan = ~an(F0). Let han
be the corresponding Borel functions in the Chernoff-Savage representation.
We shall find it convenient to express the score function in the form ~o(t)=
~00((t + 1)/2), where ~0 is again skew. Somewhat tedious calculations show that
(2.12) is fulfilled with ~"= r/p and o-2= 1, where
1
p = p(~oo, 4'0) =
f0 o(t)4'o(t) dt E [-1, 1]. (2.18)
the use of the statistics Ix(Fo)T~,. The natural thing to do, of course, is to
replace/x (F0) by an estimate (/2.),~, which meets the requirements of Theorem
I'. Sen and Ghosh (1974a) proposed the Hodges-Lehmann type estimator
_ 2 Un,a/2
n n(Ou,.-Ot,.)' n> l,
0.,. = inf{O: T.(O) > -u.,./2}, 0l,. = sup{O: T.(O) > u.,~/e}
where n-1/2Un,~/2 tends to the upper a/2-point of the standard normal dis-
tribution. The demonstration that (~,)n~l satisfies all conditions required by
Theorem I' seems to be rather laborious, however; confer Sen (1981; p. 265)
for a sketch of proof. Under alternatives F0("- ~A), the limiting formulas for
OC and ASN again become Xb,a('t-Itz(Fo), 1) resp. Ab,a(rltz(Fo), 1) if (fi,T,,),_-l is
employed. Let ~00be defined as in (1.5) and, as before, denote its right-hand tail
by 4,. The above rank procedure is (weakly) asymptotically LMP with respect
to this model in case q~ = q,. Here, of course, the starting point F0 matters.
(2) Wald tests modelled on the SPRT for H0 against H~ (cf. (2.16)) can be
treated in a similar fashion. If such SPRT-type tests are constructed with the
help of Zi ( T , - nA/2), the changes that have to be made go without saying. In
the one-sample location model, (2.17) and the asymptotic linearity result would
suggest the use of the quantiy
It has been pointed out earlier that (A1), i.e. the Chernoff-Savage represen-
tation, is valid on regularity conditions (cf. the references in 1.2). Under
suitable assumptions, moreover, the natural candidate for 6-2, and that is
o-206,), is strongly consistent as required by (A2). In case of a regular func-
tional, both (A1) and (A2) can easily be verified by means of Theorems D and
E. That goes for (A2) as well because the limiting variance of U/V-statistics is
itself a regular functional as observed by Sen (1960) and Sproule (1969). Sen
proposed the following estimator: Let U,j be the 'U-statistic corresponding to
the kernel &(yl . . . . . Yj-1)= g(Xj, Yl . . . . ,Y i-l) and the random sample
X1. . . . . Xj_I, Xj+I, . . . , X,' and put
In this case we can define /], in analogy to 9,. With regard to Theorem D(b)
both, of course, are asymptotically equivalent.
684 Ulrich Miiller-Funk
and the decision rule ~ o ( a ) = (O,{]Qo(a)= n}),~l. In other words, the test
decides in favor of g)l whenever sampling stops and is subject to the conditions
where log2 t = log((log t)+). In this case we can choose the stopping curve in
such a way that a(t) ~ r(t log2 t) m, r > c, and such that
If the convergence in (3.4) is uniform with respect to F E 60, then a(-) together
with a____suitablyselected no induces a TWP 1. In all cases of interest, moreover,
the lira in (3.2) is actually equal to c [P~] for some particular F ~ g)0. If this
happens, a boundary leading to a TWP 1 cannot tend to infinity at a rate less
than (n log2 n) 1/2, vice versa. Accordingly, this rate is the best one can strive for
in order to render the ASN under g)l small. The foregoing remarks, however,
only enable us to carry out the test in practice if we can derive explicit bounds
686 Ulrich Miiller-Funk
for q (m). Such 'iterated logarithm inequalities' for the sample mean were derived
by Darling and Robbins (1967a, 1967b). If O is a (reversed) martin-
gale, then one can try to mimic the technique in the last mentioned paper, i.e.
to apply the C h o w - H a j e k - R e n y i inequality to appropriate blocks of statistics
(2),. The best we can hope for with that device, however, is a boundary which
increases to infinity at some rate tl/2(log t) K and which, accordingly, leads to a
test with a comparatively large stopping variable. Corresponding inequalities
for Brownian motion were obtained by Robbins (1970), Robbins and Siegmund
(1970, 1973). Hence it is near at hand to shift the problem to the limit by means
of a suitable invariance principle. If the (2), happen to be sums of i.i.d, variates,
the necessary limit theorems can be found in Robbins and Siegmund (1970) as
well as in Lai (1976b). Extensions to these results to 'disturbed' random walks
yield TWP 1 that approximately fulfill the first relation in (3.1). Technically, we
can handle the oncoming remainder terms and random coefficients by the same
method that has been employed in the discussion of the termination properties
of Wald tests in Subsection 2.2. Proceeding that way, we arrive at the following
result, which is but a corollary to Lai (1976b, Theorem 5).
THEOREM J. Fix F Eg)o and suppose that there are i.i.d, variables
W1, WE. . . . . EF(W2)= 1 and EF(W1) = O, and statistics D = (D,)n~l (both
depending on F ) so that for some 0 < y <
DnOn - ]~ Wj = o ( n r) [PF].
j=l
(ii) I f 0 < Dn ~ 1 [PF] and if there is some 0 < d < 1 so that (3.5) is fulfilled
with da(.) instead of a(.), then (3.6) remains true.
Suitable rate (t log2 01/2 boundaries were made fairly explicit by Robbins and
Siegmund (1970, 1973).
Truncated versions: Nonparametric repeated significance tests. Armitage
being concerned with medical trials introduced certain sequential three-
SequenKal nonparametric tes~ 687
decision procedures into the statistical literature; confer his monograph (1975)
for a more recent account. The related two-decision procedures have been
termed repeated significance tests (RST). In short, their procession can be
described as follows. A target sample size m is specified and the incoming
observations are constantly scrutinized. Sampling stops and the null-hypothesis
is rejected as soon as the accumulated data shows enough evidence to do so. If
this does not happen to be the case up to (and including) time m, then we stick
to S)0. Tests of this kind were proposed in an ad hoc fashion and motivated on
ethical grounds as well as on practical considerations.
Armitage only allowed for models involving the binomial or the normal
distribution. The first model, of course, corresponds to a crude 0-1
classification of the data by the experimenter while the second one actually
requires that the observations can be measured on a physical scale. In order to
permit an assessment of the data that is somewhere between these extremes, it
is near at hand to use ranks. Miller (1970) started research on such RST by
proposing a test based on the Wilcoxon statistics. A full account of further
developments in this area is contained in a survey paper by Sen (1978).
Formally, a RST is but a TWP 1 truncated at the time m. More precisely, it is
determined by a stopping time min IVo(a), m and a terminal decision rule
(A,,B,)=(O,{No(a)=n}) if n<m and (Am, Bm)=({filo(a)>m},
{No(a)= m}), where No(a ) is defined as before but where a(-) is now
subject to the requirement that
Finiteness of the ASN, integrability of the stopping rules 1Vo(a). Obviously, the
answer to the problem becomes trivial in the case of a RST and turns out to be
negative under the null-hypothesis for a TWP 1. It remains to derive conditions
which ensure the finiteness of the moments of lQo(a) under an alternative F
satisfying (3.3). No general result seems to exist that embraces all cases of
interest. Many examples, however, are covered by the following device. Sup-
pose that a(t) <~rt 1-q for all t sufficiently large, r > 0, 1 > q i> 0. For the time
being, moreover, we assume that O, = nU,, n ) p , where U = (U,),~p is again
a sequence of U-statistics corresponding to some kernel g. Fix 1 > to > 0, e > 0
and argue as follows:
688 Ulrich M i i l l e r - F u n k
const, n -2k(1-q) ,
provided g possesses moments of order 2k. Putting Sj = j(pU} 1)+ v(F)), c(t)=
(r + e)t l-q, the first summand in (3.9) becomes
According to Gut (1974), 1Qs(C) has moments up to order 2k. Multiplying across
(3.9) with n s-l, 0 < s < 2k(1 - q) - 1, and adding up, we realize that EF(No(a)) s
is finite. Because of Theorem D(b), the above reasoning extends to V-statistics.
After that we can move on to statistics that can be bounded or approximated
from below by U/V-statistics. This is true, for instance, for L-statistics with a
smooth weight function, confer Helmers (1981), or for signed linear rank
statistics with an increasing, convex score function. As for the latter case,
simply note that
where the first summand is essentially a V-statistic and where the second term
is bounded.
Asymptotic OC and ASN, the limiting distribution of fifo(a). First let us
consider horizontal barriers A-la the convenience of which has been men-
tioned in connexion with RST; confer (3.8). In this case, approximate expres-
sions for OC with respect to local alternatives {Fa : A0 > A > 0} starting from
some FoE 60 are provided by the well-known Bartlett formula if Q again
behaves like a linearly shifted Brownian motion, ~ (-] ~', 1) say. Expressed in
formulas,
Sequential nonparametric tests 689
~-
fo"
P(l~l(sl#,l)<a, VO~s~u)du.
THEOREM K. Fix a d.f. F ~ ~1. Assume that there are constants c > 0, 0 ~<p < 1,
# = / ~ ( F ) > 0 as well as a real-valued continuous function q(.) vanishing at
infinity so that, as n ~ 0% A ~ 0,
where M (.) = MF(') is a centered Gaussian process almost all sample paths of which
are supposed to be continuous. Let b(.) be an asymptotic inverse of t/a(t) and set
nr = b(r/tz ), r > 0 . Then, as r -- ~,
Asymptotic expressions for the ASN are easy to conjecture. To this end,
suppose again that a(t) = tPl(t), 0 ~<p < 1, w h e r e l(.) is slowly varying, and that
b(t) is an asymptotic inverse of t/a(t). When applied to the positively drifting
sequence O, the SLLN and the renewal argument at which we already alluded
690 Ulrich Miiller-Funk
in Section 2.2 imply that 1Vo(ra)-b(r/iz(F)) PF-a.s. This fact suggests the
heuristic formula
Ev(1Vb(ra))- b'(r/tx(F)) as r ~ o o .
The uniform integrability of the stopping times that is necessary for its validity
remains to be checked in every special case. No such condition is required, of
course, if we deal with the truncated random times min{IVo(ra), mr}, rnr =
mr(A)= b(r/A), A > 0 . In this case, as r T oo (and A remains fixed),
If we consider two different sequences of such test statistics, O and R say, then
Tests with power one. TWP 1 for ~)0 versus ~)l(q~) based on T~ were first
considered by Sen and Ghosh (1973b) in the univariate one-sample problem.
These authors used a strong embedding theorem instead of Theorem J in order
to specify suitable boundaries; confer (1.16). (In the present case, this device is
more convenient as only milder conditions have to be imposed.) The two-
sample as well as the independence case can be dealt with along the same lines,
Sequential nonparametric tests 691
cf. Sen (1981, p. 241), or with the help of Theorem J. So far, most efforts have
been concentrated on the determination of the quantities a(.), no, but little has
been done otherwise.
Repeated significance tests. Research on rank RST was initiated by Miller
(1970) and pursued by Lombard (1976, 1977), Sen (1977b) and others. The
unbiasedness of these procedures can be shown in the same manner as with
Wald tests in Section 2.2. As the invariance principle holds true under g)0 and
alternatives close to it, it is clear that (3.10) is valid with the familiar shift
parameters ~'. If, for example, we consider nonparametric alternatives dFan =
(1 + ~TAOa(F0)), F0 ~ ~0, as in the preceding sections, then ~ takes on the form
~"= r/p(q~0, ~O0), etc. By means of Theorems C and K and the previous dis-
cussion, we obtain the efficiency numbers (O~2(F)/O~a(F))I-Pfor any two RST
based on linear rank statistics using score functions ~Pl and q~z, respectively.
Alternatively, we can compare these tests with RST based on sequential rank
statistics. The lattei" were first proposed by Reynolds (1975), who Considered
the Wilcoxon type. Later on, statistics of this form were employed by M/iller-
Funk (1980) and Lombard (1981); confer also Mason (1981). In the univariate
one-sample case, which we are going to treat again by way of example, the
general form of these statistics is
No boundary increasing at a rate essentially slower than (t log 01/2 comes within
the range of such curves. No attempts seem to have been made so far to arrive
at better boundaries by other methods. On the other hand, Darling and
Robbins were able to derive explicit upper bounds for the ASN.
Burdick (1973) tackled the one-sample problem for testing symmetry by a
method that is similar to the one used by Darling and Robbins. His test
procedure is based on the quantity sup{IM+~(x)-M;(x)l: x e R}, where M+~(x)
resp. M ; ( x ) is the number of positive resp. negative observations among
X1 . . . . , Xn the absolute value of which does not exceed x.
Repeated significance tests of Kolmogorov-Smirnov type do not seem to
have been investigated in detail up to now.
to be the sole specimen of this kind in the literature. H e deals with the
two-sample case and assumes Lehmann alternatives, i.e. F ( x , y ) = J ( x ) J ~ ( y ) ,
0 < ~7 < ~. In contrast to Section 2.1, however, the parameter ~7 is no longer
considered to be known but taken into account by means of the rank ML-
estimator. Asymptotic properties of this test procedure are determined (under
a fixed alternative).
S o m e related procedures. We mentioned earlier that the model underlying a
TWP 1 is meant to describe a type of quality control problem. Sometimes,
however, the sequential detection (disruption) problem more realistically
reflects the situation. To formulate it in mathematical terms, let X1, X2 . . . . be a
sequence of independent r.v. with d.f. F1, F 2 . . . . , F / ~ ~1. Consider the pair of
statistical hypotheses
References
Ahmad, R. and AI-Mutair, M. A. (1979). The univariate and multivariate k-sample problem: A
nonparametric sequential contrast method approach. Trans. 8-th Prague Conference on In-
formation Theory, vol. C, Reidel, Dordrecht, pp. 17-29.
Armitage, P. (1975). Sequential Medical Trials. Blackwell, Oxford.
Bartlett, M. S. (1946). The large sample theory of sequential tests. Proc. Cambr. Phil. Soc. 42,
239-244.
Bahadur, R. R. and Raghavachari, M. (1972). Some asymptotic properties of likelihood ratios on
general sample spaces. Proc. VI-th Berkeley Symp. Math. Stat. Prob.
Bandyopadhyay, U. (1980). Semi-sequential nonparametric tests for two populations with one
sample fixed. Calcutta Stat. Ass. Bull. 29, 45--64. Correction: Calcutta Stat. Ass. Bull 30, 95.
Behnen, K. (1972). A characterization of certain rank-order tests with bounds for the asymptotic
relative efficiency. Ann. Math. Stat. 43, 1122-1135.
Berk, R. H. (1973). Some asymptotic aspects of sequential analysis. Ann. Stat. 1, 1126--1138.
Berk, R. H. (1975a). Locally most powerful sequential tests. Ann. Stat. 3, 373-381.
Berk, R. H. (1975b). Comparing sequential and non-sequential tests. Ann. Stat. 3, 991-996.
Berk, R. H. (1976). Asymptotic efficiencies of sequential tests. Ann. Stat. 4, 891-911.
Berk, R. H. and Savage, I. R. (1968). The information in a rank order and stopping the stopping
time of some associated SPRT's. Ann. Math. Stat. 39, 1661-1674.
Bhattacharya, P. K. and Mallik, A. (1973). Asymptotic normality of the stopping times of some
sequential procedures. Ann. Stat. 1, 1203-1211.
Bhattacharya, P. K., Frierson, D. (1981). A nonparametric control chart for detecting small
disorders. Ann. Stat. 9, 544-554.
BSnner, N., Miiller-Funk, U. and Witting, H. (1980). A Chernoff-Savage representation for
correlation rank statistics with applications to sequential tests. In: I. M. Chakarvarti ed., Asymp-
totic Theory of Statistical Tests and Estimation. Academic Press, New York.
Bradley, R. A. (1967). Topics in rank-order statistics. Proc. V-th Berkeley Syrup. Math. Stat. Prob.
Bradley, R. A., Martin, D. C. and Wilcoxon, F. (1965). Sequential rank tests. I. Monte Carlo
studies in the two-sample case. Technometrics 7, 463-484.
Bradley, R. A., Merchant, S. D. and Wilcoxon, F. (1966). Sequential rank tests. II. Modified two
sample procedures. Technometrics 8, 615-623.
Braun, H. (1976). Weak convergence of sequential rank statistics. Ann. Stat. 4, 554-575.
Sequential nonparametric tests 695
Brown, B. M. (1971). Martingale central limit theorems. Ann. Math. Statist. 42, 59--66.
Burdick, D. L. (1973). A best sequential test for symmetry when the probability of termination is
not one. Ann. Stat. 1, 1195--1199.
Choi, S. C. (1973). On nonparametric sequential tests for independence. Technometrics 15,
625--629.
Chow, Y. S., Hsiung, C. A. and Yu, K. F. (1980). Limit theorems for a positively drifting process
and its related first passage times. Bull. Inst. Math. Acad. Sin. 8, 141-172.
Chow, Y. S. and Teicher, H. (1978). Probability Theory. Springer, Heidelberg.
Cox, D. R. (1963). Large sample sequential tests of composite hypotheses. Sankhya Ser. A 25,
5-12.
Darling, D. A. and Robbins, H. (1967a). Iterated logarithm inequalities. Proc. Nat. Acad. Sci. 57,
1188-1192.
Darling, D. A. and Robbins, H. (1967b). Inequalities for the sequence of sample means. Proc. Nat.
Acad. Sei. 57, 1577-1580.
Darling, D. A. and Robbins, H. (1968a). Some further remarks on inequalities for sample sums.
Proc. Nat. Acad. Sci, 60, 1175-1182.
Darling, D. A. and Robbins, H. (1968b). Some nonparametric sequential tests with power 1. Proc.
Nat. Acad. Sci. 61, 804--809.
DeGroot, M. H. (1960). Minimax sequential tests of some composite hypotheses. Ann. Math.
Statist. 31, 1193-1200.
Dvoretzky, A., Kiefer, J. and Wolfowitz, J. (1953). Sequential decision problems for processes with
continuous time parameter. Testing hypotheses, Ann. Math. Statist. 24, 254-264.
Dvoretzky, A., Kiefer, J. and Wolowitz, J. (1956). Asymptotic minimax character of the sample
distribution function and the classical multinomial estimator. Ann. Math. Stat. 27, 642-669.
Eisenberg, B., Ghosh, B. K. and Simons, G. (1976). Properties of generalized sequential probability
ratio tests. Ann. Stat. 4, 237-251.
Fabian, V. (1956). A decision function. Czechoslovak Math. J. 6, 31-41.
Farrell, R. H. (1964). Asymptotic behavior of expected sample size in certain one sided tests. Ann.
Math. Statist. 35, 36-72.
Ghosh, B. K. (1970). Sequential Tests of Statistical Hypotheses. Addison-Wesley, Reading.
Ghosh, M. and Sen, P. K. (1976). Asymptotic theory of sequential tests based on linear functions of
order statistics. In: Ikeda et al., eds., Essays in Probability and Statistics. Tokyo.
Ghosh, M. and Sen, P. K. (1977). Sequential rank tests for regression. Sankhya Ser. A 39, 45-62.
Govindarajulu, Z. (1975). Sequential Statistical Procedures. Academic Press, New York.
Govindarajulu, Z. (1977). Stopping time of a c-sample rank-order sequential probability ratio test.
Trans. 7th Prague conference 1974, Vol. B. Reidel, Dordrecht, pp. 163-174.
Govindarajulu, Z, and Mason, D. M. (1980). A strong representation for linear combinations of
order statistics with applications to fixed-width confidence intervals for location and scale
parameters. Tech. Rep.
Gut, A. (1974). On the moments and limit distributions of some first passage times. Ann. Prob. 2,
277-308.
Hajek, J. (1974). Asymptotic sufficiency of the vector of ranks in the Bahadur sense. Ann. Stat. 2,
75-83.
Hajek, J. and Sidak, Z. (1967). Theory of Rank Tests. Academic Press, New York.
Hall, P. (1974). Two asymptotically efficient sequential t-tests. In: J. Hajek, ed., Proc. Prague Conf.
Asympt. Methods. Academia, Prague.
Hall, W. J. and Loynes, R. M. (1977). Weak convergence of processes related to likelihood ratios.
Ann. Stat. 5, 330-341.
Helmets, R. (1981). A Berry-Essen theorem for linear combinations of order statistics. Ann. Stat.
9, 342-347.
Hewett, J. E. and Spurrier, J. D. (1979). Some two-stage k-sample tests. J. Amer. Stat. Ass. 74,
398-404.
Holm, S. (1973a). A sequential rank test. In: Proc. Prague Syrup. Asymp. Star., pp. 157-172.
Holm, S. (1973b). The asymptotic minimax character of sequential binomial and sign tests. Ann.
Stat. 1, 1139-1148.
696 Ulrich Miiller-Funk
Miiller-Funk, U. (1983a). Sequential signed rank statistics. Comm. Stat. C-Sequential Analysis 2,
123-148.
Mfiller-Funk, U. (1983b). A quantitative SLLN for linear rank statistics. Star. and Dec. 1, 371-378.
Dec. 1.
Mfiller-Funk, U., Pukelsheim, F. and Witting, H. (1983). Locally most powerful tests for two-sided
hypotheses. To appear in Proc. I V Pann. Syrup. Math. Star.
Noether, G. E. (1954). A sequential test of randomness against linear trend. Ann. Math. Stat. 25,
176 (abstract).
Orban, J. and Wolfe, D. A. (1980). Distribution-free partially sequential placement procedures.
Comm. Star. A 9, 883-904.
Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley, New York.
Randles, R. H. and Wolfe, D. A. (1979). Introduction to the Theory of Nonparametric Statistics. Wiley,
New York.
Reynolds, M. (1975). A sequential signed-rank test for symmetry. Ann. Stat. 3, 382-400.
Rieder, H. (1981). Robustness of one- and two-sample rank tests against gross errors. To appear in
Ann. Star. 9.
Rieder, H. (1982). A general robustness property of rank correlations. Unpublished manuscript.
Robbins, H. (1970). Statistical methods related to the law of the iterated logarithm. Ann. Math.
Statist. 41, 1397-1409.
Robbins, H. and Siegmund, D. (1970). Boundary crossing probabilities for the Wiener process and
sample sums. Ann. Math. Statist. 41, 1410-1429.
Robbins, H. and Siegmund, D. (1973). Statistical tests of power one and the integral representation
of solutions of certain partial differential equations. Bull. Inst. Math. Acad. Sin. 1, 93-120.
Romani, J. (1956). Tests no parametricos en forma secuencial. Trabajos de Estadistica 7, 43-96.
Ruymgaart, F. and van Zuijlen, M. C. A. (1978). On convergence of the remainder term in linear
combinations of functions of order statistics in the non-i.i.d, case. Sankhya Ser. A 40, 369-387.
Savage, I. R. and Savage, L. J. (1965). Finite stopping time and finite expected stopping time. J.
Roy. Stat. Soc. 27, 284-289.
Savage, I. R. and Sethuraman, J. (1966). Stopping time of a rank-order sequential probability ratio
test based on Lehmann alternatives. Ann. Math. Stat. 37, 1154-1160; Correction Ann. Math.
Stat. 38, 1309 (1967).
Savage, I. R. and Sethuraman, J. (1972). Asymptotic distribution of the log likelihood ratio based
on ranks in the two sample problem. In: Proc. VIBerkeley Syrup. Math. Stat. Prob.
Schmitz, N. (1982). Minimax sequential tests for the drift of a Wiener process. To appear in:
Mathematische Systeme in der Okonometrie.
Sen, P. K. (1960). On some convergence properties of U-statistics. Calcutta Star. Ass. Bull. 10,
1-18.
Sen, P. K. (1970). On some convergence properties of one-sample rank-order statistics. Ann. Math.
Star. 41, 2140-2143.
Sen, P. K. (1973a). Asymptotic sequential tests for regular functionals of distribution functions.
Teoria Veroyatnostey iee Primenyia 18, 235-249.
Sen, P. K. (1973b). An asymptotically optimal test for the bundle strength of filaments. Ann. Star.
1, 526-537.
Sen, P. K. (1975). Rank statistics, martingales, and limit theorems. In: M. L. Puri, ed., Stat. Inf. and
Rel. Topics. Academic Press, New York.
Sen, P. K. (1977a). On Wiener process embedding for linear combinations of order statistics.
Sankhya Set. A 39, 138-143.
Sen, P. K. (1977b). Some invariance principles relating to jack-knifing and their role in sequential
analysis. Ann. Stat. 5, 316-329.
Sen, P. K. (1977c). Tied-down Wiener process approximations for aligned rank order statistics and
some applications. Ann. Stat. 5, 1107-1123.
Sen, P. K. (1978). Nonparametric repeated significance tests. In: P. P. Krishnaiah, ed., Develop-
ments in Statistics, vol. I. Academic Press, New York.
Sen, P. K. (1980a). On almost sure linearity theorems for signed rank order statistics. Ann. Star. 8,
313-321.
698 Ulrich Miiller-Funk
Sen, P. K. (1980b). Asymptotic theory of some tests for a possible change in the regression slope
occurring at an unknown time point. Z. Wahrsch. Verw. Geb. 52, 203-218.
Sen, P. K. (1981). Sequential Nonparametrics. Wiley, New York.
Sen, P. K. (1983a). Some recursive residual rank tests for change-points. In: M. H. Rizvi et al., eds.,
Recent Advances in Statistics: Papers in Honor of Herman Chermoff's Sixtieth Birthday.
Academic Press, New York.
Sen, P. K. (1983b). Tests for change-points based on recursive U-statistics. Comm. Stat.-Sequ.
Analysis 1, 263-284.
Sen, P. K. and Ghosh, M. (1971). On bounded length sequential confidence intervals based on
one-sample rank-order statistics. Ann. Math. Stat. 42, 189-203.
Sen, P. K. and Ghosh, M. (1972). On strong convergence of regression rank statistics. Sankhya Ser.
A 34, 335-348.
Sen, P. K. and Ghosh, M. (1973a). A Chernoff-Savage representation of rank order statistics for
~b-mixing processes. Sankhya Ser. A 35, 153-172.
Sen, P. K. and Ghosh, M. (1973b). A law of the iterated logarithm for one-sample rank order
statistics and an application. Ann. Stat. 1, 568--576.
Sen, P. K. and Ghosh, M. (1974a). On sequential rank tests for location. Ann. Stat. 2, 540-552.
Sen, P. K. and Ghosh, M. (1974b). Some invariance principles for rank statistics for testing
independence. Z. Wahrsch. Verw. Geb. 29, 93-108.
Sen, P. K. and Ghosh, M. (1980). On the Pitman efficiency of sequential tests. Calcutta Star. Ass.
Bull. 29, 65-72.
Serfling, R. (1980). Approximation Theorems of Mathematical Statistics. Wiley, New York.
Sethuraman, J. (1970). Stopping time of a rank-order sequential probability ratio test based on
Lehmann alternatives II. Ann. Math. Stat. 41, 1322-1333.
Siegmund, D. (1968). On the asymptotic normality of one sided stopping rules. Ann. Math. Stat. 39,
1493-1497.
Sproule, R. N. (1969). A sequential fixed-width confidence interval for the mean of a U-statistic.
Ph.D. dissertation, Chapel Hill, North Carolina.
Spurrier, J. D. (1978). Two stage sign tests that allow first stage acceptance and rejection. Comm.
Stat. Ser. A 7, 399-408.
Spurrier, J. D. and Hewett, J. E. (1979). Two-stage Wilcoxon tests of hypotheses. J. Amer. Stat.
Ass. 71, 982-987.
Strauch, J. (1982). A nonparametric interminable test for symmetry of power one. Comm.
Star. C-Sequential Analysis 2, 87-97.
Tsao, C. K. (1954). Sequential rank sums tests. Ann. Math. Stat. 25, 177 (abstract).
van Eeden, C. (1972). An analogue of signed-rank statistics of Jureckova's asymptotic linearity
theorem for rank statistics. Ann. Math. Star. 43, 791-802.
Weed, Jr., H. D. and Bradley, R. A. (1971). Sequential one-sample grouped signed rank tests for
symmetry. J. Amer. Stat. Assoc. 66, 321-326.
Weed, Jr., H. D., Bradley, R. A. and Govindarajulu, Z. (1974). Stopping times of two rank order
sequential probability ratio tests for symmetry based on Lehmann alternatives. Ann. Stat. 2,
1314-1322.
Wijsman, R. A. (1977a). A general theorem with applications on exponentially bounded stopping
time without moment conditions. Ann. Stat. 5, 292-315.
Wijsman, R. A. (1977b). Obstructive distributions in a sequential rank-order test based on
Lehmann alternatives. In: S. S. Gupta and D. S. Moore, eds., Statistical Decision Theory and
Related Topics II. Academic Press, New York.
Wijsman, R. A. (1979). Stopping time of invariant sequential probability ratio tests. In: P.
Krishnalah, ed., Developments in Statistics I1. Academic Press, New York.
Wilcoxon, F., Rhodes, L. J. and Bradley, R. A. (1963). Two sequential two-sample grouped rank
tests with applications to screening experiments. Biometrics 19, 58-84.
Witting, H. and N611e, G. (1970). Angewandte Mathematische Statisik. Teubner, Stuttgart.
Woodroofe, M. (1982a). On sequential rank tests. Univ. Michigan preprint.
Woodroofe, M. (1982b). Likelihood ratio tests with ranks. Univ. Michigan preprint.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 90
Elsevier Science Publishers (1984) 699-739
1. Introduction
699
700 Pranab Kumar Sen
H: F1 . . . . . Fq#Fq+l . . . . . F~ f o r s o m e q : l ~ q ~ n - 1 ,
(2.2)
that is, r E (tq, tq+l], for some unknown q: 1 ~< q <~ n - 1. A similar situation
arises in the sequential detection problem (viz., Shirayayev (1963, 1978)), where
one may conceive of an infinite sequence {Xi; i>~1} of independent r.v.'s,
gathered over a sequence {t~; i >I 1} of ordered time points, such that for some
integer q (>~1), {Xi;i<~q} are i.i.d.r.v, with a d.f. F1 and {X~; i>~ q + 1} are
i.i.d.r.v, with a d.f. Fq+l; here q may even be equal to ~. The problem is to raise
an alarm if q < o% while, for q = o% the process should not be unnecessarily
stopped. Thus, one would like to choose a stopping number N, such that if
q <0% then, E { m a x ( N - q , 0)} = E ( N - q ) + (or some other measure of the
excess of N over q) should be as small as possible, while, for q = 0% the
probability of a false alarm (i.e., N < ~) should be small, i.e., E N should be
large. Thus, in a sequential detection problem, we have a genuine sequential
scheme, while, in a change-point problem, n is specified in advance, though the
test for (2.1) against (2.2) may be done sequentially or not. Though, in the
context of statistical quality control, the sequential detection problem seems to
be more appropriate, it may be remarked that in view of the customary
adjustments and inspections at regular intervals, the change-point problem
remains equally appropriate. However, within each inspection period (i.e., for a
maximum n specified in advance), the test for (2.1) against (2.2) may be made
recursively, so that an early termination may be recommended whenever
desired. This may call for some quasi-sequential schemes in a change-point
problem. Some of these procedures are discussed here.
In the parametric case, the simplest change-point problem relates to the shift
alternative where one takes Fdx) = F ( x - 0~), i/> 1, F defined on E and the 0i
real, and then one wants to test for H0:01 . . . . . 0. = 0 (unknown) against
H:01 ..... Oq#Oq+l . . . . . 0., for some unknown q E [ 1 , n - 1 ] . In a
somewhat more general setup, one may consider a linear model:
where the i:~ are specified vectors of real elements, the fl~ are unknown
regression coefficients (vectors), and one wants to test for the constancy of
regression relationships over time i.e., for H0: fll . . . . . ft, = fl (unknown)
against H : fll . . . . . ~q ~ flq+l . . . . . ft, for some unknown q E [1, n - 1].
The shift alternative is a particular case of (2.3) when the c~ are all real
elements, equal to 1 and fli = 0i, i 1> 1. Tests for the change-point problem
relating to the model (2.3) have been studied by a host of workers. Most of
these parametric tests are non-sequential in nature, and they are based on the
residuals based on the terminal estimate of the hypothesized common ft. Some
quasi-sequential tests based on recursive estimates of fl are also available; the
monograph due to Hackl (1980) contains a good account of these parametric
procedures. Recursive residuals can be used with advantage in this problem.
For normal F, a detailed account of some cumulative sum (CUSUM) pro-
cedures based on recursive residuals is due to Brown, Durbin and Evans
(1975). The theory has been extended to the nonparametric case (of unknown F
belonging to some suitable family) by Sen (1982a). Chernoff and Zacks (1964)
have considered some Bayes procedures (for normal F ) having a good impact
on the nonparametric case too. In this article, we confine ourselves to the
developments in the nonparametric case only. Also, in view of the fact that the
case of the shift alternative has been treated in the article by G. K. Bhat-
tacharyya (viz., Chapter 5), we shall deal witla this model only very briefly, and
then proceed on to the case of the general model in (2.3).
Assuming that the observations are initially from a symmetric (unknown)
distribution with a specified median 00, Page (1955) proposed a test for a
change-point based on the cumulative sums of the s g n ( X / - 00), i t> 1. Bhat-
tacharyya and Johnson (1968) considered a general class of locally optimal rank
tests for the same problem, where a Bayesian setup of Chernoff and Zacks
(1964) has been incorporated. If we assume that for the change-point ~-,
P{'r=ti}=di, i = 1 , . . . , n , and set Di=Ej~idh i = l , . . . , n , and if R+i stands
for the rank of [X~ - 00[ among IX1- 001. . . . . IX, - 00[, for i = 1 . . . . . n, then, for
an assumed form f of the density function corresponding to the d.f. F (having a
finite Fisher information), the statistic
where gnl < " " < gnn are the ordered r.v. of a sample of size n from the
uniform (0, 1) d.f. and dp(u)=-f'(F-l(u))/f(F-l(u)), O < u < l is the Fisher
score function. In particular, if we take for F a double exponential, logistic or
normal d.f., then a+(k) = 1, k/(n + 1) or the expected value of the k-th order
702 P r a n a b K u m a r Sen
statistic of a sample of size n from the chi distribution with 1 degree of freedom
(1 ~< k <~ n), and these correspond to the weighted sign, signed rank and normal
scores statistics. Note that, often, in the absence of any knowledge on the d~,
they are all taken to be equal (i.e., Di = i/n, i = 1 . . . . . n), and this leads to a
simplified version of (2.4). When the initial level 00 is not specified, as is mostly
the case, the corresponding locally optimal invariant test statistic is
where R.i = rank of X~ among X1. . . . . X., for i = 1. . . . . n. For small values of
n, the exact null distribution of S. (or L.) can be obtained by direct enumera-
tion of the 2"(n!) (or n!) equally likely realizations of the vectors of ranks and
signs (or ranks), while the asymptotic normality results hold under very general
conditions on the Di and the score function th. In the context of equal d~ and
for the specific scores ~b(u)=sgn(u-) and ~b(u)= u-, 0 < u < l , more
simplified expressions for the S. and L. are available. If we define a two-
sample median (or Wilcoxon) statistic by Mk,.-k (or Wk,.-k) when X1 . . . . , X k
constitute the first sample and Xk+~, , X . the second sample, then L. may be
expressed as the sum (over k: 1 <~ k ~<n - 1) of these pseudo two-sample
statistics. The pseudo two sample approach has further been considered by A.
Sen and Srivastava (1975). They considered the statistics
where Eo and varo stand for the expectation and variance under Ho and Uk, n_ k
is the Mann-Whitney form of the Wilcoxon statistic. Two-sided versions of
(2.7) and (2.8) were also considered by them. Pettitt (1979) considered a variant
form of (2.8), viz.,
and its two-sided version, and (2.8) and (2.9) differ only with respect to the
weights ~ / k ( n - k)/12. Schechtman and Wolfe (1981) have also considered
these statistics and provided some simulation studies of the allied distribution
theory for specific values of n. For some asymptotic theory, we may refer to
Sen (1978, Section 6). P. K. Bhattacharya and Frierson (1981) have considered
some sequential procedures (for the control chart setup) for detecting a
possible shift in location; some allied work is also due to Lombard (1981, 1983).
These procedures are genuinely distribution-free. Some alternative asymp-
totically distribution-flee procedures for more general models have been
considered by Sen (1977, 1980, 1982b).
Nonparametric procedures for some miscellaneous problems 703
= [ k -
\ m - 1] ~ rk(X~l . . . . . X~,,_I, Xk), k >! m . (2.11)
l <.il<...<im_l<.k-1
Thus, at the k-th stage (k ~> rn + 1), the recursive residual based off U~ is
..,, ...,
where the summation Ek,i extends over all possible 1 ~< i 2 < " - < im ~< k with
704 Pranab Kumar Sen
P{1)+~ ~ x l H o } ~ 2 ~ ( x ) - 1, (2.18)
co
P{19. <-x[Ho}-+ ~ (--1)k[q~((2k + 1 ) x ) - q~((2k - 1)x)], (2.19)
k=-oo
where ~ ( x ) is the standard normal d.f. Thus, with the (asymptotic) critical
values determined from (2.18)-(2.19), a control chart type procedure may be
based on the normalized CUSUMS Wk/gk, rejecting the null hypothesis
whenever the graph crosses the critical level(s). The tests have been shown to
be consistent under very general regularity conditions. Also, for local alter-
natives, the limiting nonnull distribution theory of D"+ , and / ) , has been
obtained in terms of the boundary crossing probabilities of some drifted
Brownian motions, with nonlinear drift functions). It may be remarked that for
~b(x) = x (m = 1), we have
k
U k = X k = k-I ~ , X ~ Vk~>l, U ~ = Xk V k ~ > l ,
i=1
and s 2 is the sample variance ( ( k - 1)-1 k= 1 ( X / - Xk)2) for sample size k/> 2.
The corresponding /)+ ( o r / ) , ) is then suitable for the shift alternative.
Similarly, for m = 2, ~b(xx, x2) = (xa- x2)2, O(F). reduces to the variance of the
distribution F and the corresponding/)+ (or D , ) is suitable for any change in
dispersion.
Let us now consider some recursive residual rank tests for change-points
relating to the general linear model in (2.3), where the fli and ci are t-vectors
for some t ~> 1. Assuming that /3z . . . . . /~k = r , let /~k be some suitable
Nonparametric procedures for some miscellaneous problems 705
(Generally, for k < t, the residuals )~i+l,i may be taken to be equal to 0.)
Note that the flk may be quite arbitrary (i.e., least squares estimators,
M-estimators, R-estimators, etc.), and the only regularity condition assumed for
such a sequence of estimators is that under H0: fll . . . . . /~n = r , for every
e > 0, there exists an interger k0 (t>1) such that
(It has been shown by Sen (1983a) that (2.21) holds (under quite general
regularity conditions) for the least squares, M- and R-estimators.) Now, for
every k (~>t+ 1), let / ~ i be the rank of I ,il among I k, l, for
i = 1 , . . . , k. Also, define the scores {a~(i), i = 1 , . . . , k; k/> 1} as in (2.5).
Then, the residual s i g n e d - r a n k scores are defined by
conventionally, we let fik = 0 for k ~< t. Then, the CUSUM's for the residual
rank scores in (2.22) are defined by
We also define
A 2 = A~ =
f0'~bZ(u)du = {~b+(u)}2 d u , (2.24)
The recursive tests for the change-point problem are based on D ,+ and /9,.
Unlike the location (shift) model, here, the use of D ,+ may only be advocated
when under the alternative hypothesis, the fl'kCk are monotone. Since this may
not generally be the case, the two-sided test statistic in (2.26) is generally
adopted. It may be noted that the recursive residual scores t~k, k ~> t are,
generally, neither strictly independent nor marginally identically distributed.
Hence, the tests based on D ,+ or D , may not be genuinely distribution-free.
706 Pranab Kumar Sen
it follows from Sen (1983a) that, under H0, D + and D, have respectively the
limiting distributions given by the right hand sides of (2.18) and (2.19). In this
context, we may, of course, allow the score function 4) = {~b(u), 0 < u < 1} to be
quite arbitrary, such that on letting ~b(r)(u) = (d~/du~)qb(u), r = 0, 1, 2, there exist
a generic positive constant K (<~) and a 6 (<1) for which
All the commonly adapted score functions satisfy (2.29). Asymptotic nonnull
distribution theory of D ,+ and D,, for local alternatives, has also been studied in
Sen (1983a). For the simple shift alternative scheme, for which the
Bhattacharya-Frierson (1981) sequential ranking scheme works out well,
Lombard (1981, 1983) has considered some weighted quadratic sums of rank
statistics for the change point problem. His procedure, however, may not apply
to the linear models treated above.
group and a treatment group, and the response relates to the time of occur-
rence of an event (e.g., heart attack/death), then, instead of waiting till the
study period is over, one may like to monitor the study from the very beginning
so that if one of the two groups performs significantly better than the other
one, then the surviving patients may all be switched to that group. This may
also call for a curtailment of the study at an early stage. On the other hand, if
there is no real difference between the two groups then the continuation of the
follow-up study does not pose any extra risk to any particular group, and an
early termination is really not that needed. This basic motivation underlies the
formulation of progressive censoring schemes (PCS). In passing, we may remark
that a repeated analysis scheme on the accumulating data may lead to a higher
risk of making incorrect decisions, unless proper statistical analysis schemes are
validly formulated.
We may refer to Chapter 25 (by Basu) where the concepts of various types of
censoring have been introduced and the related nonparametric methods have
also been discussed. With these in mind, we conceive of a set { X 1 , . . . , X,} of
independent random variables with continuous distribution functions (d.f.)
{Fa . . . . . F,}. Suppose that we want to test for the i~u!l hypothesis H0:F1--
. . . . F, = F (unknown), against some alternative (where the F~ are not all the
same). Let Z,~ ~<... ~< Z,, be the order statistics associated with X~ . . . . . X,,
and let us define
Thus, R = (R1 . . . . . R , ) and S = (Sb. . , S,) stand for the vectors of the ranks
and anti-ranks of the observations. In a life-testing problem, the failures occur
spatially, so that, one observes the following sequence in order
If the experiment is conducted for a fixed period of time [0, T] and if r = r(T)
is defined by r(T) = max{k: Znk <~T and k <~n}, then, for the Type I censoring
scheme, one observes the data set {(Z~I, S1). . . . . (Z,,(r), St(r))} along with the
complementary set (Sr(r)~. . . . . S,), though the exact permutation of this later
vector is not known at time-point T. In a Type II censoring scheme, for some
prefixed r (1 ~< r -<-<n), one waits upto a (random) time-point Znr and obtains the
data set {(Z~I, S 1 ) , . . . , ( Z , , , S r ) } along with the complementary set
(S,+1. . . . . S~), again without the precise knowledge on the exact permutation of
this vector. Nonparametric tests based on such censored data have been
discussed in Chapters 25 (by Basu), 26 (by Doksum and Yandell) and 32 (by
Wieand). Generally, in a life testing problem, continuous monitoring is needed
to record the set in (3.2), and, as such, it seems unreasonable to wait until the
time-point T or Z,~ has been reached and then to test for H0. It is more natural
to test for the null hypothesis as more information becomes available at the
successive failures (Z~), so that if at any early stage the null hypothesis
708 Pranab Kumar Sen
becomes untenable, the study may be curtailed along with the rejection of H0.
Basically, this calls for a repeated significance testing procedure (at each failure
point), and, as the accumulating data may not have independent or stationary
increments, special care needs to be taken so that the Type I or II error rates
for the procedure are under control. This is done through the formulation of
some suitable time-sequential procedures, which are described below.
Consider a typical linear rank statistic
T, = ~ ( c , - g , ) a , ( R , ) , (3.3)
i=l
where cl, . . , c. are given constants, ~. = n -11gT=xci and a,(1) . . . . , an(n) are
the scores. Using (3.1), we may rewrite (3.3) as T, = Ei~l (Cs,- g~)a,(i). With
this form, we may consider the censored rank statistics {T,~} by letting, for every
r ~ n,
a*(r) = {
(n-r) -1 ~ a.(j), O < - r < - n - 1 ,
O,
j=r+l
r = n.
(3.5)
Note that at the k-th failure Z,a,, one can compute the censored rank statistic
T,k, for k >1 1; conventionally, we let 7".o= 0. Operationally, in a progressive
censoring scheme, at the k-th failure Z,k, one computes T~k (or IT, k[): If, for
the first time, for some k (>~1), T~k (or IT, ll) exceeds a critical value ~-+,(or ~-,),
experimentation is stopped at that point of time (along with the rejection of
H0), while, otherwise, one proceeds on to the next failure point. In this setup,
one may either prefix a maximum number r (~<n), such that at Z~, the study is
curtailed (if it has not been so earlier), or even, one may take r = n. Thus, the
probability of the Type I error for this procedure is given by
+ ---
O/nt P{T,k > +
Tn~ for some k: 1 ~< k ~< r [ H0} (3.6)
and a similar expression holds for the two-sided case. If we define, for every r,
n: l ~ r < ~ n , n>~2,
and let
then, if follows from the results of Chatterjee and Sen (1973) that under Ho and
some very mild regularity conditions on the scores, for r: r/n ~ p : 0 < p ~ 1,
where W = {W(t); 0 ~< t ~< 1} is a standard Brownian motion on [0, 1]. Note
that, for every A 1> 0,
where qo(.) is the standard normal d.f. If A~ + and A~ stand for the solutions for
the right hand sides of (3.11) and (3.12) being equated to a given a (0 < a < 1)
then, for large n, r*~ =A~,C,A+~ and a similar approximation holds for the
two-sided case. For small n, the next value of ~'n* may be obtained by direct
enumeration of the permutation distributions of ( $ 1 , . . . , S~) (over the set of
permutations of (1 . . . . , n)). Asymptotic power properties of this PCS testing
procedure have been studied by Chatterjee and Sen (1973), Sen (1976) and
others, and a detailed account of these is given in Chapter 11 of Sen (1981a). In
the particular case of two-sample statistics, we have, for some nl: 1 ~< nl < n,
n2 = n - nl,
For some numerical studies in this case, we may refer to Davis (1978), For the
case of log-rank scores, i.e., a~(k)= --l+~k~l (n -- i + 1)-1, k = 1. . . . . n, we
may refer to Majumdar and Sen (1978b) and Koziol and Petkau (1978).
Majumdar and Sen (1978a) considered a vector generalization of the above
PCS procedure. In (3.3), replace the ci (and 5~) by q-vectors c/ (and ~'~), for
some q >t 1. This typically arises in a multi-sample model. A similar change is
needed in (3.4), while, in (3.8), replace C2~ by a q q matrix Cn=
~7~1 (ci - ~)(ci - ~)'. Then, at the k-th failure, we consider the test statistic
say, and the percentile points of the distribution of B ] have been tabulated by
De Long (1981). Asymptotic power properties of this test, for local alternatives,
have been studied by Majumdar and Sen (1978a) and De Long (1980).
For the specific two-sample problem, Koziol and Byar (1975) proposed the
use of truncated versions of the classical Kolmogorov-Smirnov statistic and
tabulated their percentage points too; for some one-sided tests, we may refer to
Schey (1977). Sinha and Sen (1979a) considered a weighted empirical process
which includes the two-sample problem as a particular case. Let
n
S . ( x ) = n -~ ~ t ( x , ~ x)
i=1
and (3.15)
H . ( x ) = C ; ~ ~ , (c, - e . ) / ( X , < x), x >/0,
i=1
where C2, is defined by (3.8). Also, let o~ = {~o(t), 0 < t < 1} be a nonnegative,
smooth weight function. Consider then the statistics
Note that like the ~,k, K*r remains invariant under a reparameterization:
Xi - fl'ci ~ X~ - ~"di where d i = Dci and D is nonsingular. For the asymptotic
distribution theory of K*,, the standardized Bessel process may be called for,
and, percentile points for the relevant distributions can be obtained from De
Long (1981).
PCS tests for the analysis of covariance (ANOCOVA) models based on ap-
propriate rank statistics have also been considered by Sen (1979, 1981b) and
others. These will be discussed briefly in the next section. In the rest of this
section, we consider some PCS procedures relating to the Cox (1972) propor-
tional hazard model Since the proportional hazard models have already been
discussed in Chapters 32 (by Wieand) and 26 (Doksum and Yandell), we shall
treat the same only briefly and stress mainly on the relevant PCS procedures.
Note that these models are quasi-nonparametric in character: the hazard
function is of nonparametric nature, but, the dependence on the covariates is of
a specified structure.
For the i-th subject having survival time Y~ and a set of concomitant variates
di = ( d i l , . . . , d~)', for some q ~> 1, consider the model that the conditional
hazard rate, given d~, is of the form:
where Q1. . . . , Q,, stand for the anti-ranks corresponding to the failure points
tl tin, respectively. The partial likelihood scores are then defined by
. . . . .
Also, let
0.22)
where d~ = r711gi~ej d~, j = 1 , . . . , m. At the k-th failure point, one may then
consider the Cox (1972) form of the partial likelihood ratio test statistic
,~.2 *
=
t * - *
(3.23)
In a PCS setup, we look at the data at each failure point, i.e., at each k
(=1 . . . . . m), we compute ~*k. If, for the first time, for some k, ~*k exceeds a
critical value ~-*~, we stop at that point of time (tk) along with the rejection of
the null hypothesis H0: fl = 0. If no such k occurs during the tenure of the
study, the null hypothesis is accepted. It follows from the results of Sen (1981)
that for the sequence {~,k. k = 1 , . . . , m}, the standardized q-dimensional
Bessel process may be used to study the asymptotic null distribution of
max{~*k: k ~< m}, and hence, the procedure is similar in nature to the one
based on the rank statistics. For null hypotheses other than fl = 0, in the scores
in (3.21) and the information matrix in (3.22), unknown parameters (i.e., the
unspecified part of fi) may enter and substitution of their estimates may be
necessary. This will result in a nondistribution-free nature of the resulting test
statistic. However, the asymptotic distribution-free character still retains, and
parallel PCS procedures work out well. We may refer to Tsiatis (1981a,b) and
Anderson and Gill (1982), among others, where these procedures have been
worked out.
against the set of alternatives that they are not all equal. It may be convenient
to conceive of the model
where the ci are specified r (~l)-vectors, not all equal, ~, = /'/ -2 ~i=1 ci and 18
parameterizes the regression of the primary variates on the ci. For the parti-
cular case of one-way RANOCOVAmodel, the cl can only assume the realizations
(1, 0 . . . . ,0), (0, 1 , . . . , 0) . . . . . ( 0 , . . . , 0, 1). In this more general setup in (4.2),
we like to test for H0:18 = 0 against fl ~ 0.
Let E* = (XT . . . . , X*) be the (p + q) n matrix of the sample observations.
For each row, we adapt a separate ranking scheme. This leads us to the
following r a n k collection m a t r i x
R * = ( R ~ P" (4.3)
\ R n / qn
ap(1)""" a ( n ) (4.4)
An = an1(1)""" a n l ( n )
a~q(1)..- a , ~ ( n )
where the scores are defined as in (2.5) and (2.6) with possibly different score
generating functions for the different variates. As in (3.3), we define the rank
statistics
T* = (T , Tn)
'~+q) "P 'q (4.5)
The within row averages of the scores in (4.4) are denoted by fil . . . . , t~p and
a,1 . . . . . d~q, respectively. Let then
00 _
Vnjj'-- (n - 1)-1 ( a n0i ( R n0l ) _ anj)(a~r(Ri,i
-o o o ) _ (to,) , j,j'= 1..... p,
(4.6)
714 Pranab Kumar Sen
j , j ' = 1. . . . . q. (4.8)
We denote
V ~ = {V
\ VO, V~
V~ ] where V = ((v,jj,)),
00 V = ((vjj,)) and V~ = ((Gjj'))
(4.9)
Further, we define
Then, the first step is to use the fact that the permutational dispersion matrix of
T* is equal to C , V*, and hence, the fitted value of the primary variate rank
statistics on the covariate rank statistics yields the following residuals:
Finally, let ~ o . be the rolled out rp-vector from T ,0. , and let
= . (4.13)
Then, IP* is the r x p matrix of covariate-adjusted rank order statistics and 5~*
is the test statistic based on these adjusted statistics. For small values of n, the
exact permutational (conditional) distribution of 5~* can be obtained by direct
enumeration of the n! equally likely column permutation of the matrix R* in
(4.3), and, for large n, g* has closely the chi square distribution with p r
degrees of freedom, when the rank of (7, = r and the null hypothesis holds.
The multivariate approach developed in Sen and Puri (1970) also yields the
asymptotic distribution theory under alternative hypotheses. For suitable
sequences of local alternatives, the asymptotic distribution of ~ * turns out to
be a noncentral chi square with rp degrees of freedom and an appropriate
noncentrality parameter A.}. In passing, we remark that for the MaNOVA
problem of testing the equality of the p-variate (marginal) d.f.'s F ~ (of the X~0),
one actually ignores the concomitant variates (X~) and, based on the T O and
Nonparametric procedures for some miscellaneous problems 715
where /~,o is the rolledout rp-vector from T . For small values of n, the exact
permutation distribution of 5~ can be obtained by direct enumeration, while,
for large n, under H0, L0 has closely the central chi square distribution with pr
degrees of freedom. For local alternatives, similar to the MANOCOVAmodel, the
asymptotic distribution of g 0 is noncentral chi square with pr degrees of
freedom and noncentrality parameter A.e, where Az depends on the marginal
df's F~, i = 1 , . . . , n . It can be shown that for a sequence of common
alternatives,
Thus, 5* is the difference between the classical rank MANOVA statistic for the
entire set of p + q variates and the one for the set of q covariates alone. This
formula avoids the computation of the residuals in (4.11) and the adjusted
matrix in (4.12). Gerig (1975) has used this characterization of the-gAr~OCOVA
statistics for the two-way layout problem. H e used the multivariate generaliza-
tion of the Friedman (intra-block) rank statistics (viz., Gerig, 1969) and
computed the two test statistics for the entire set of p + q characters and the
subset of q covariates only, and their difference provides the desired test
statistic for the MONOCOVA problem. MANOCOVA models incorporating aligned
ranking are also considered in Puri and Sen (1971, Ch. 7). These are con-
ditionally distribution-free and the results run parallel to the ones in the
one-way layout case.
Let us now consider the case of censored data relating to ANOCOVA. For
simplicity, we consider the case of p = 1 and q i> 1. Corresponding to the rank
vector R in (4.3), we define the anti-rank vector S o for the primary variate by
letting
We also assume that the covariates (and hence the rank matrix Rn) are
716 Pranab Kumar Sen
1 2 -.. k ~
R o . . . .R 1 So2 . . . ' . ' . ' . , R
...1S1. ; , 1.$.o~
k ;
(4.18)
\ n qs~ R qso ... R qsk ]
we also know about the remaining ( n - k) columns of Rn, but, without the
specific order in which they would have appeared in (4.18) if the censoring
were not made. Let R = Roi, i = 1 . . . . , n, and, for each j (=0, 1 , . . . , q), define
n k
a*i(k)=(n-k)-l{ff'~anj(r)-~a,/(R#o)}, l<~k<~n-1,
"r=l i=1
=0, k = n, (4.19)
(k) (k) \
13n00 OnO ~
V *(k)= \V~)o, v~)j for k = 1. . . . . n. (4.21)
Then, proceeding as in (3.13) and using the adjustment for the covariates as in
(4.16), we may define a censored test statistic (at the k-th failure) by
for k = 1. . . . . n, where fi, stands for the rolledout vector form of a matrix A.
Note that ~f*k is the rank analysis of covariance test statistic when the
covariates are observable at the beginning while the first k failures among the
primary variables are observed and the rest censored (in a Type II censoring
scheme). Here also, under the null hypothesis in (4.1), for small values of n, the
exact permutation distribution of 5f*k can be obtained by direct enumeration,
while, for large n whenever k/n is away from 0, 5*k has closely the central chi
Nonparametric procedures for some miscellaneous problems 717
say. For various e (>0), the Percentile points of the distribution of B** (r/> 1)
have been studied by De Long (1981). Note that as t $ O, t-mBr(t) may not
behave smoothly (and, in fact, it may almost surely bounces to ___o~),and hence,
in (~-.24), the weak convergence result may not be appropriate if we let e -- 0.
Also, from the practical point of view, statistical monitoring may not be very useful
unless we have gathered at least some data to enable us to use the estimators
V*k etc. in a consistent manner. Thus, it seems quite appropriate to start the
repeated significance testing not from the very first failure, but, after a given
number of failures have occurred. Technically, we may also replace the upper
limit (n) for k in (4.24) by any number r: r/n ~ p : 0 < p ~< 1, and in that case, in
the right hand side, the upper limit (1) for t has also to be replaced by p. Then,
given a choice of p (0 < p ~ 1), one may choose a small e (0 < e < p) and start
the repeated significance testing scheme at the n0-th failure point, where
no = [ne] + 1. With this choice, one may use the PCS scheme mentioned before
and use the partial sequence {5*k; k i> no} for this purpose. Note that for any
p ~ (0, 11,
so that the DeLong (1981) tables remain applicable with e replaced by e/p.
718 Pranab Kumar Sen
Note that the columns of Z are still independent (vectors) and the e* are
i.i.d.r.v, with a d.f. F*, defined on E r, where F * depends on the underlying F
and the matrix G. In any case, whenever F is continuous, so will be F*. Now,
(5.4) represents a reduced dimensional MANOVA model, and, for this model,
rank tests may be applied in the same fashion as in the classical case.
Let us first consider the case where the design matrix X can be partitioned as
X ' = (1,,X*') where 1, = (1 . . . . ,1)' and X* is a ( q - l ) n matrix. This is
typically the case in the one-way layout model. We also write 0 = (00, 0"),
where 00 is of order r 1 and 0* of order r ( q - 1). Then, we may rewrite
(5.4) as
Z = O01"+ 02X
* *2 + e* where X*' ~- (X1*' , X2*'). (5.8)
Since the ranks are shift invariant, we may not have to be worried with the
nuisance parameter 00, but, we need to estimate 0*2 and align the observations
through the estimate before using ranks. This is what is called an aligned
ranking procedure. To estimate 0~, we use, as in Jure6kovfi (1971), suitable
rank statistics, and obtain the estimators by similar alignment schemes. Let B
be an r x s2 matrix of real elements (which we may allow to vary over the space
Ers2), and for every B, define Z ( B ) = Z - B X ~ . On the r x n residual matrix
Z(B), we adopt the coordinatewise ranking scheme (as in after (5.6)), denote
this rank collection matrix by R , ( B ) and the corresponding matrix of linear
rank statistics (based on the regressor X~ and Z(B)), of order r x s2, by T,2(B).
This is defined for all B ~ Er~2. Note that B = 0~, the coordinates of T,2(0~) all
have the mean 0, and hence, we may estimate 0~ by equating T,2(B) to 0. In
view of the fact that T,2(B) may not have the exact value 0 admissible (note
that it has elements having discrete distributions), we define
where/I.II stands for the maximum norm. It follows from Jure~kovfi (1971) that
under suitable regularity conditions, the set /3. is a closed convex set with a
maximum diameter converging to O, in probability, as n-->~. We take the
center of gravity of set /3. as our estimate of 0~, and denote this estimator
by 0~. Note that /~ is the so-called R-estimator of 05, (see Chapter 21 (by
Jure~kov~i)), and as in Chapter 12 (Aubuchon and Hettmansperger) and 11
(Adichie), we employ this R-estimator for testing H0 in (5.7). For this
purpose, we define
Let /~, be the r x n matrix of ranks of 2`, where for each row, a separate
ranking of the elements is made. Now, in (4.5), we replace the ci by the column
vectors of XT and the matrix Rn by R,; we denote the resulting matrix (of
order r x s 0 of aligned linear rank statistics by i",1. Further, in (4.10), we
replace the ci by the columns of X* (and similarly for gn), and denote the
resulting (q - 1) x (q - 1) matrix by C,. We partition C~ as
722 Pranab Kumar Sen
Cn = \On21(
cnll Cn12~,C/ n22 Cnij of order si x sj, for i, j = 1, 2. (5.11)
Finally, let
The aligned rank test statistic for testing H0 in (5.7) is ten given by
where T,1 is the rolledout form of 7",1 and V, is the rank covariance matrix in
(4.9) based on the r x n matrix Z. Under the null hypothesis H0 in (5.7), ~ , has
asymptotically central chi square distribution with rsl degrees of freedom.
Hence, an asymptotic test may be made by using the appropriate percentile
point of this latter distribution as the critical value of ~,. Under suitable
sequence of local alternatives, ~ has asymptotically noncentral chi square
distribution with rsl degrees of freedom and an appropriate noncentrality
parameter. Hence, the discussions following (5.6) all pertain to the case of this
aligned rank test too.
Finally, we consider the case where for (5.4), X' may not be expressible as
(1,, X*') or in (5.6) or (5.7), we want to test for 0 instead of 0". For this model,
we may note that the ordinary ranks are invariant under shift and hence will
not be useful in testing for 00. Thus, to overcome this problem, we may have to
use signed rank statistics instead of linear rank statistic, and, this, in turn,
demands the extra assumption that the d.f. F* of e* in (5.4) is diagonally
symmetric about 0 0 . e . , both e* and (-1)e* have the same d.f. F*). This
assumption, though less restrictive than the multinormality of F*, was not
needed for the rank tests for (5.6) and (5.7). We may define (a vector of) signed
rank statistics as in (2.4) wherein we replace the Di by the columns of X and
for the scores a+,(k) in (2.5) we may use the score generating function ~b~ for
the j-th coordinate of Z (1 ~<j ~< r). The definition of the R~ (1 ~< i ~< n) is the
same as in (2.4), but restricted to the j-th row of Z, for j = 1 . . . . . r. With these
adjustments, we are now in a position to use the general theory developed by
Adichie (1978) for rank tests for linear hypotheses (based on aligned signed
rank statistics), where at the beginning we replace the data set Y by Z. Since
this theory has been discussed in Chapter 11 (by Adichie), we refer to the
details there. One technical advantage of using the aligned signed rank statis-
tics is that instead of using specifically the R-estimators (viz., (5.10)) for the
alignment process, one may also use any other estimator having the 'root n'
consistency property. This may make the computations relatively simpler,
though at the cost of the more stringent assumption of diagonal symmetry of
F*.
works out well. However, there are certain drawbacks of the MANOVA approach
in Section 5.1 and some of these can be avoided in the alternative MAnOCOVA
approach of this section. It has been pointed out by Rao (1965) (see also Khatri
(1966)) that the dimension reducing transformation in (5.3) may throw away
some information contained in g through the complementary part of Z, which
is dropped out in the subsequent analysis. Though these criticisms were mainly
aimed at the normal theory analysis, nevertheless, they remain pertinent in the
nonparametric case too. By considering this complementary part of Z as a
matrix of (stochastic) covariates, information contained in this matrix can be at
least partially recovered by using the multivariate ANOCOVA procedures. Since
this MANOCOVAprocedure based on rank statistics has already been discussed in
Section 4, we may take advantage of that and consider parallel procedures for
the growth curve models.
In (5.3), side by side, consider another matrix W of order ( p - r ) x n
Potthoff and Roy (1964), one may also choose a symmetric and positive definite
matrix Q (of order p x p ) and define Z = (G'Q-1G)-IG'Q-1Y. Then, by (5.1)
and (5.2), we would have Z = OX + (G'Q-1G)-XG'Q-le. The solutions in the
normal theory case as well as in the nonparametric case all work out for any
positive definite Q, and hence, one may like to know whether there is any
optimal choice of Q in this context. In the normal theory case, from the point
of view of best linear unbiased estimation of a linear (estimable) function of 0,
Potthoff and Roy (1964) showed that for Q the optimal choice is the true
covariance matrix of the ei. Since this covariance matrix is unknown, they
suggested the use of the sample mean (residual) product matrix, and as it turns
out that this choice is isomorphic to the alternatives suggested by Rao (1965)
and Khatri (1966) from the ANOCOVA point of view. The situation is different in
the nonparametric case. First, in the normal theory case, the residual sum of
product matrix is stochastically independent of the estimates of linear com-
binations of 0. This result is not generally true for the nonnormal case.
Secondly, the normal theory estimates are linear ones, while the rank based
estimates are nonlinear functions of Y. Further, the invariance of the normal
theory estimates, under nonsingular transformations on the observation vec-
tors, may not apply to the rank statistics and the derived estimates, where a
coordinatewise ranking is made. For different choices of Q, we may not have
therefore the desired linear relations, and hence, locating an optimal Q may be
a harder problem in the theory of nonlinear programming. However, these are,
to some extent, only pathological points: In most of the practical applications,
the G matrix in (5.2) may be selected in a very natural way and the specific
choice of Q -- I, made in this section, would work out well. We may also note
that the form of G becomes quite simple in many longitudinal studies where
the time points at which the repeated measurements are made are the same for
each unit or individual. However, in practice, missing observations are not
uncommon. These missing values may occur systematically (which is easier to
handle) or may occur haphazardly. The patients may drop off at some point of
time, so that the observations at time points beyond that point would be
missing. Alternatively, the patients may not show up on some specific dates,
thus skipping some appointments, which results in missing observations at
random points of times. In the parametric case, to accommodate such missing
patterns, more general linear models (MGLM) have been introduced. The
analysis schemes for such M G L M ' s are generally complicated: Kleinbaum
(1973) suggested the use of B A N (best asymptotically normal) estimates for
such an analysis. However, the exactness of the solutions are affected and only
asymptotic solutions are available. Nonparametric procedures for such
M G L M ' s have not been developed to the full generality, and, more research on
this line is needed to bring it down to the practical users' level. Finally, the
solutions to the nonparametric growth curve analysis considered in this section
relate specifically to the multivariate one-way layout designs. Sen (1973) has
shown that similar solutions can be worked out in some higher order factorial
designs. For complete block designs, ANOCOVArank procedures are available in
Nonparametric proceduresfor some miscellaneousproblems 725
the literature a n d these would provide the desired solutions for the cor-
responding growth curve analysis based on rank statistics (after a suitable
dimensional reduction transformation is used). In principle, the methodological
approach is the same, and hence, these details will not be considered here.
Growth curve models may also be quite appropriate for many clinical trials:
Here, one may have some additional complications due to censoring of various
types. Since rank analysis of covariance procedures have also been developed
for such censored data (see Section 4), modifications in the scheme for growth
curve analysis due to censoring may be formulated in the same manner.
The positive constant p is termed the relative potency of the test preparation
with respect to the standard one. Also, the model in (6.1) is termed the
fundamental assumption of a direct (-dilution) assay. Our main interest lies in
estimating the relative potency p and verifying this fundamental assumption.
Parametric procedures for these inference problems are usually based on the
assumption that F r is normal or lognormal (or sometimes, logistic or log-
logistic). These procedures are discussed in detail in Finney (1964). The form of
the estimator of p depends explicitly on the assumed form of the d.f. Fr, and
these parametric estimates are generally not very robust against departures
from the assumed form of the tolerance d.f. For example, if we assume that F r
is normal (only justified if the standardized mean is sufficiently large so that
F r O ) is very small), then the estimate of p comes out as the ratio of the sample
means for the two preparations, while, if Fr is taken as the log-normal d.f.,
then the estimator is the ratio of the two geometric means, and these are
generally not the same. Rank based estimates of relative potency have been
considered by Sen (1963), Shorack (1966) and Rao and Littell (1976), among
others. These estimates are invariant under the choice of any monotone
transformation on the dose (called dosage), and, besides being robust, are
generally quite efficient for normal, log-nominal, logistic or other common
forms of the tolerance distributions.
For convenience of description, we choose the dosage as equal to log-dose.
Let X~ = log X~, i = 1. . . . . m, be the dosages for the standard preparation and
let F*s(X) be the d.f. of the X*. Then,
Similarly, let Y* = log Y~, i = 1. . . . . n, be the dosages for the test preparation
and F~-(x) = P{Y~}' ~ x} = FT(ex) be the corresponding d.f. Then, by (6.1), we
conclude that for the dilution-model,
for every x'E ( - % ~). Thus, the two d.f.'s F~ and F ~ differ only by a shift
d = log p, and for this shift model, efficient rank based estimators have already
been considered in Chapter 21 (by Jure~kovfi). As in Section 5, we consider the
two-sample linear rank statistic TN(b) = E'f=I aN(RNi(b)), b E ( - % ~), where
N = m + n, aN(1) . . . . , aN(N) are monotone (/~) scores and RNi(b) is the rank
of X i - b among X ~ - b , . . . , X * - b , YT . . . . , Y * , for i = l . . . . . m. Then,
TN(b) is ~ in b, and we define an R-estimator ~N(R) of A by
~N(R) = (sup{b: TN(b) > tiN}+ inf{b: TN(b) < tiN})/2, (6.3)
where tin = N-IE~=I aN(i), and, we may set, without any loss of generality,
tin = 0. In particular, if we choose the two-sample W i l c o x o n - M a n n - W h i t n e y
statistic, i.e., aN(k) = (k - (N + 1)/2)/(N + 1), k = 1 . . . . , N, then, (6.3) simplifies
to the median of the mn differences X* - Y~, 1 ~< i ~< m, 1 ~<j ~< n. This simple
estimator has some optimal properties when F ] is a logistic d.f. Even other-
wise, it is a very simple, robust and efficient estimator of A ; the estimator of p
can simply be obtained by taking the anti-log of this estimator. For scores other
than the ones relating to the median and Wilcoxon statistics (worked out by
Sen, 1963), an explicit solution for (6.3) may not be available. But, starting with
the Wilcoxon scores estimator, an iteration procedure may be employed and
a few iteration steps should yield the desired estimator upto a given level of
accuracy. Let us next consider the confidence intervals for the relative potency.
In this respect, the lack of robustness with the parametric procedure is more
noticeable. Attainment of the specified coverage probability by the prescribed
confidence interval, in the parametric case, may be seriously affected by an
incorrect assumption on the form of Fs (or F ] ) . The picture is quite different
for the nonparametric case. Distribution-free confidence interval for A can
again be obtained by using linear rank statistics, and this, in turn, provides the
confidence interval for the relative potency p. Note that when p obtains, TN(A)
has t h e same distribution as of TN(0) under H0: A = 0, and the latter is
independent of the underlying F ] . Hence, we can always find two constants t(n1)
and t~), depending on m, n, oz and the scores aN(l) . . . . . aN(N), such that
The desired confidence interval for A is given by IN in (6.6), and the confidence
interval for p is obtained from (6.6) by replacing ~N.L and /(N,u by their
728 PranabKumar Sen
Also, let/~/~i be the rank of X * among the N aligned observations in (6.7), for
i = 1. . . . , rn. Consider then a two-sample rank statistic T~ = m -1 ~iml a~q(l~Ni),
where the scores are symmetric in the sense that a~(k)= a N N - k + 1), for
every k (~<N), as is typically the case with rank tests for scale. Then, using the
linearity results of Jure~kov~ (1969), it follows that whatever be the unknown p,
under (6.1), (Nm[n)l/Z(7"~t-gt~)/A~ has asymptotically normal distribution with
0 mean and unit variance, where aN--*-N-1EN=la~(i ) and A~r2--
(N - 1)~1 E/u=l(a~v(i) - d~) 2. Hence, T~ is asymptotically distribution-free, under
(6.1), and an asymptotic test for the fundamental assumption in (6.1) can be
Nonparametric procedures for some miscellaneous problems 729
based o.n T~v, using its asymptotic normality. The assumption that F ] in (6.2) is
a symmetric d.f. may not always be very realistic in bio-assays, where the
tolerance d.f.'s may sometimes be so skewed that even the log-dose trans-
formation may not render symmetry to the transformed d.f. In such a case, use
of the aligned rank test (as in above) may not be very suitable on the ground of
robustness (against asymmetry of the d.f.'s). We may, however, use some
alternative nonparametric tests which do not require the symmetry of the d.f.'s;
these tests may not be otherwise full efficient for some specific models. Thus,
we may have to make a compromise between high efficiency for specific
alternatives and validity for a broad class of tolerance d.f.'s.
Let ~b(a, b; c, d) be equal to s i g n ( [ a - b l - l c - d l ) , and consider the two-
sample (generalized) U-statistic
Then, under (6.2), whatever be the value of p, when F~ is symmetric and has
a continuous density function almost everywhere, we have, for every d/> 0,
where the errors es and er both have the common (unknown) d . f . G . If the test
preparation behaves as a dilution (or concentration) of the standard one, we
have then
where p (>0) is the relative potency of the test preparation with respect to the
standard one, and, the equality of the regression coefficients constitutes the
fundamental assumption of this parallel line assay. We consider some non-
parametric tests for the validity of the fundamental assumption and some
nonparametric estimates of the relative potency. These were studied earlier by
Sen (1971). Consider a symmetrical 2k-point design (for some k i> 2) with k
doses of each preparation such that the successive doses bear a constant ratio
Nonparametric procedures for some miscellaneous problems 731
D (>0) to one-another, and no (i>1) subjects are used for each dose. For the
standard preparation, the k doses are denoted by Z,j = a D j-l, a > 0 , for
j = 1 , . . . , k, while, for the test preparation, these are a b D j-', b > 0 , for
j = 1 , . . . , k. Note that each preparation is administered to kno = n subjects.
The dosages for the standard and test preparations are
With this change in the scale and origin of the dosage, we thus obtain the
following two sets of responses:
y(l)
In 0
V(1)
~2n 0
... r(1)
kn 0
y!2)i,o V(2)
2n 0
... Y!2)
oan
First, to test the validity of the fundamental assumption i.e., the parallelism
of the two regression lines in (6.14), we proceed as in Sen (1971) and define the
set of divided differences as follows: Let
Then, as in Sen (1968), the median of these k ( k - 1)n 2 entries is the pooled
sample estimator of the hypothesized common value/3, and is denoted by/3*.
Let then
t
no n0
U~)(fl *) = ~ ~] ~'~ sign(Yl~)- r ~ ) - f l * ( x , - xj)), i = 1, 2 .
l~j<l~k r=l s=l
(6.17)
Also, let
Under the null hypothesis of the equality of/3s and/3r, S, has closely the chi
square distribution with 1 degree of freedom (viz., Sen, 1971) and the test is
quite robust in character. Instead of using this simple test statistic based on
the aligned Kendall tau statistics, one may consider alternative tests based on
general linear rank statistics aligned in a similar manner; these tests are worked
out in detail in Sen (1969) and are easily adaptable here.
To estimate the relative potency p, we note that when/3s =/3T = /3,
logo(bp*.) = 6 . */ 1 3 *. . (6.23)
Here also, instead of using the simple Wilcoxon scores estimator in (6.22) one
may use a general R-estimator based on the aligned observations in (6.21).
Asymptotic properties of the estimator p* have been studied by Sen (1971).
In the above development, for simplicity, we have considered a symmetric
2k-point design. The picture becomes little more computationally involved
when the number of subjects for each dosage is not the same or the design is
not a symmetric one. However, looking at (6.11), we may note that essentially
the problem is to test for parallelism of two regression lones and to estimate
the relative potency by a ratio formula as in (6.12). Hence, the R-estimators for
linear models, see for example Chapter 11 (by Adichie), are generally adapt-
able for this specific problem. In planned bio-assays, of course, one can adopt a
symmetric design, and, the solutions considered here remain useful.
So far, we have considered the case where the dosage is given by log-dose.
Often, the dosage is taken as (dose) x, for some A > 0. In such a case, referred to
the dosage-response regressions in (6.11), we have
Nonparametricproceduresfor some miscellaneousproblems 733
Since the relative potency p is expressible in terms of the ratio of the two
slopes/Jr and fls, the assay is termed a slope-ratio assay, and the equality of the
two intercepts as and a r constitutes the fundamental assumption of the
slope-ratio assay.
First, we consider a nonparametric test for the validity of the fundamental
assumption. For each i (= 1, 2), consider the (~)n 2 divided differences in (6.16),
and the median of these is taken as the point estimator of the respective slope.
We denote these estimates by /J~n and fl~.,, respectively. Consider then the
residuals, defined as in (6.21), where fl* is replaced by/3~, (for i = 1) and fl~,
(for i = 2). Then, for each i (= 1, 2), consider the (~"0+1) midranges
The median of these 2(~"0+1) midranges (for the combined sample) is denoted
by a* and is taken as the pooled sample estimator of the hypothesized
common value a. Subtracting a .* further from the 17~i~ jr, we denote the ultimate
residuals as YJ~), for r = 1. . . . , no, j = 1. . . . , k and i = 1,2. Let ff',s and ff',r
be respectively the Wilcoxon signed rank statistics based on the ultimate
residuals for the standard and the test preparation. Then, as in Sen (1972a), we
consider the test statistic
where N = kno. Under the null hypothesis is equal intercepts, ON has closely
chi square distribution with 1 degree of freedom. It is a robust statistic and the
test based on ON is asymptotically distribution-free. In the above discussion,
we have confined ourselves to a 2k-point design. An analogous statistic can be
worked out for the (2k + 1)-point design. Also, instead of the Wilcoxon scores
procedures, one may use a general rank procedure based on R-estimates of the
two regression slopes and a general R-estimator of the intercept. These are
worked out in Sen (1972b) and are comparatively more involved. However, in
principle, these run on parallel lines, and hence, we omit the details. We
proceed on to the estimation of the relative potency. Since in a slope-ratio
assay, the constant A (>0) is specified, we may define iS= p~, and, we may
proceed to estimate/~ as well. We define
k
E {(~_z~l)_ O/ -- ~ s X j ) 2 + (~1}2)_ Ol -- ~TX.i) 2} (6.27)
.i=1
with respect to the unknown a,/3s and fir, we obtain the following estimator
a and b are the scale factors appearing in the doses for the standard and test
preparation. A similar formula works out well for the (2k + 1)-point design
(viz., Sen, 1972a). Asymptotic properties of the estimator iS* (of iS) have been
studied by Sen (1972a). In this context also, one may use some general
R-estimators of within cell locations and use them in (6.27) for deriving the
corresponding estimator of tS. The procedure remains the same.
W e conclude this section with some remarks on the nonparametric pro-
cedures for the indirect quantal assays. In this indirect assay, the response is
quantal (i.e,, all or nothing) in nature, so that for each preparation and each
dose, among the subjects administered, some manifest a certain reaction and
the others not. For a given dose z, if P ( z ) stands for the probability of a
response, then, one is interested in knowing the dependence of P ( z ) on the
dose z. In particular, if we assume that P ( z ) is a monotone function of z, and
there exists a unique ~ such that
then, G is called the 100a % effective dose; for a = 1, it is called the median
effective dose. If the quantal response relates to the death, ~1/2 is termed the
median lethal dose. Estimation of the median effective dose is the main task in
a quantal assay. A detailed account of the parametric theory (based on specific
forms of P(.), such as the (log)-normal, logistic etc.) is available with Finney
(1964, Chapter 17). There are a few nonparametric estimates not fully
explored, and, we shall comment briefly on them.
Let xl . . . . . Xk be the k (/>2) doses which we assume to be equally spaced
(either on the linear or logarithmic scale). Suppose that each dose is ad-
ministered to n (/>1) subjects. Let ~ j be the response of the j-th subject at the
dose level Xg, for j = 1. . . . . n and i = 1 . . . . . k. The U~j, 1 ~<j ~< n are i.i.d.r.v.
having a binomial distribution P ( U 0 = O) = 1 - P ( U 0 = 1) = 1 - ~i, for i =
1. . . . . k. Note that ~ = n - 1 ~ = 1 U/j is an unbiased estimator of 7r~, for i =
1 , . . . , k. W e write
(ii) The Reed-Muench estimator. Note that x~ < < Xk and by assumption
the rri are therefore ordered too. In such a case, the sample counterparts U~ are
stochastically_ ordered. If there exists a positive interger m (<k), such that
( U , + . . . + Urn)= (k - m + 1 ) - (Uk + ' ' " + Uk), then, the R e e d - M u e n c h esti-
mator is
~-(2)
1/2 = Xk -- d ( k - m ) (6.35)
If for no m, the strict equality sign holds (for the ~ ) , then, one may get two
consecutive values of m for which opposite inequalities hold, and, one may
obtain the estimator (parallel to (6.35)) by linear interpolation. This situation is
more likely to occur as the ~ are r.v.'s.
(iii) The Dragstedt-Behrens estimator. Consider the partial sequence
~(31
1/2 = Xk -- d ( k - m ) + d ( - G ) / ( G + I - 0~ ) (6.38)
Rderences
[1] Adichie, J. N. (1978). Rank tests of subhypotheses in the general linear regression. Ann.
Statistist. 6, 1012-1026.
[2] Anderson, P. K. and Gill, R. D, (1982). Cox's regression model for counting processes: a large
sample study. Ann. Statist. 10, 1100-1120.
[3] Bhattacharya, P. K. and Frierson, D. (1981), A nonparametric control chart for detecting
small disorders. Ann. Statist. 9, 544-554.
[4] Bhattacharyya, G. K, and Johnson, R. A. (1968). Nonparametric tests for shift at unknown
time point. Ann. Math. Statist. 39, 1731-1743.
[5] Brown, R. L., Durbin, J. and Evans, J. M. (1975). Techniques for testing constancy of
regression relationship over time. (With discussion). J. Roy. Statist. Soc. Ser. B 37, 149-192.
[6] Chatterjee, S. K. and Sen, P. K. (1973). Nonparametric testing under progressive censoring.
Calcutta Statistical Assoc. Bull. 22, 13-50.
[7] Chernoff, H. and Zacks, S. (1964). Estimating the current mean of a normal distribution which
is subjected to change in time. Ann. Math. Statist. 35, 999-1018.
[8] Cox, D. R. (1972). Regression models and life tables. (With discussion). J, Roy. Statist. Soc.
Set. B. 34, 187-220.
[9] Davis, C. E. (1978). A two-sample Wilcoxon test for progressively censored data. Comm.
Statist. Set. A 7, 389-398.
[10] DeLong, D. M. (1980). Some asymptotic properties of a progressively censored nonparametric
test for multiple regression. J. Multivar. Anal. 10, 363-370.
[11] DeLong, D. M. (1981). Crossing probabilities for a square root boundary by a Bessel process.
Comm. Statist. Set. A 10, 2197-2213.
[12] Finney, D. J. (1964). Statistical Method in Biological Assay. Charles Griffin, London, 2nd ed.
[13] Gerig, T. M. (1969). A multivariate extension of Friedman's X2-test. J. Amer. Statist. Assoc.
64, 1595-1608.
[14] Gerig, T. M. (1975). A multivariate extension of Friedman's xZ-test with random covariates. J.
Amer. Statist. Assoc. 70, 443-447.
[15] Ghosh, M, Grizzle, J. E. and Sen, P. K. (1973). Nonparametric methods in longitudinal
studies. J. Amer. Statist. Assoc. 68, 29-36.
[16] Hackl, P. (1980). Testing the Constancy of Regression Relationships Over Time. Vandenhoeck
and Ruprecht, Gottingen,
Nonparametric procedures for some miscellaneous problems 737
[17] Jure~kov~i, J. (1969). Asymptotic iinearity of a rank statistic in regression parameter. Ann.
Math. Statist. 40, 1889-1900.
[18] Jurerkovfi, J. (1971). Nonparametric estimates of regression coefficients, Ann. Math. Statist.
42, 1328-1338.
[19] Khatri, C. G. (1966). A note on a MANOVAmodel applied to problems in growth curves. Ann.
Inst. Statist. Math. 18, 75-86.
[20] Kleinbaum, D. G. (1973). A generalization of the growth curve model which allows missing
data. J. Multivar. Anal. 3, 117-124.
[21] Koziol, J. A. and Byar, D. P. (1975). Percentage points of the asymptotic distributions of one
and two-sample K - S statistics for truncated or censored data. Technometrics 17, 507-510.
[22] Koziol, J. A., Maxwell, D. A., Fukushima, M., Colmerauer, M. E. M. and Pilch, Y. H. (1981).
A distribution-free test for tumor growth curve analysis with applications to an animal tumor
immunotherapy experiment. Biometrics 37, 383-390.
[23] Koziol, J. A. and Petkau, A. J. (1978). Sequential testing of equality of two survival
distributions using modified Savage statistics. Biometrika 65, 615-623.
[24] Lehmann, E. L. (1951). Consistency and unbiasedness of certain nonparametric tests. Ann.
Math. Statist. 22, 165-179.
[25] Lombard, F. (1981). An invariance principle for sequential nonparametric test statistics under
contiguous alternatives. South African Statist. J. 15, 129-152.
[26] Lombard, F. (1983). Asymptotic distributions of rank statistics in the change-point problem.
South African Statist. Jour. 17.
[27] Majumdar, H. and Sen, P. K. (1977). Rank order tests for grouped data under progressive
censoring. Comm. Statist. Set. A. 6, 507-524.
[28] Majumdar, H. and Sen, P. K. (1978a). Nonparametric tests for multiple regression under
progressive censoring. J. Multivar. AnaL 8, 73-95.
[29] Majumdar, H. and Sen, P. K. (1978b). Nonparametric testing for simple regression under
progressive censoring with staggering entry and random withdrawal. Comm. Statist. Set. A. 7,
349-371.
[30] Miller, R. G., Jr. (1973). Nonparametric estimators of the mean tolerance in bioassay.
Biometrika 60, 535-542.
[31] Page, E. S. (1955). A test for a change in a parameter occurring at an unknown point.
Biometrika 42, 523-526.
[32] Page, E. S. (1957). On problems in which a change of parameters occurs at an unknown time
point. Biometrika 44, 248-252.
[33] Pettitt, A. N. (1979). A nonparametric approach to the change point problem. Appl. Statist.
28, 126-135.
[34] Potthoff, R. F. and Roy, S. N. (1964). A generalized multivariate analysis of variance model
especially useful for growth curve problems. Biometrika 51, 313-326.
[35] Purl, M. L. and Sen, P. K. (1969). Analysis of covariance based on general rank scores. Ann.
Math. Statist. 40, 610-618.
[36] Puri, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis. Wiley,
New York.
[37] Quade, D. (1967). Rank analysis of covariance. J. Amer. Statist. Assoc. 62, 1187 ft.
[38] Rao, C. R. (1965). The theory of least squares when the parameters are stochastic and its
application to the analysis of growth curves. Biornetrika 52, 447-458.
[39] Rao, P. V. and Littell, R. C. (1976). An estimator of relative potency. Comm. Statist. Ser. A 5,
183-189.
[40] Schechtman, E. and Wolfe, D. A. (1981). Distribution-free tests for the change-point problem.
Tech. Report, Ohio State Univ.
[41] Schey, H. M. (1977). The asymptotic distribution of the one-sided Kolmogorov-Smirnov
statistic for truncated data. Comm. Statist. Ser. A 6, 1361-1366.
[42] Sen, A. and Srivastava, M. S. (1975). On tests for detecting changes in mean. Ann. Statist. 3,
98-108.
[43] Sen, P. K. (1960). On some convergence properties of U-statistics. Calcutta Statist, Assoc.
Bull. 10, 1-18.
738 Pranab Kumar Sen
[44] Sen, P. K. (1963). On the estimation of relative potency in dilution (-direct) assays by
distribution-free methods. Biometrics 19, 532-552.
[45] Sen, P. K. (1964). Tests for the validity of fundamental assumption in dilution (-direct) assays.
Biometrics 20, 770-784.
[46] Sen, P. K. (1965). Some further applications of nonparametric methods in dilution (-direct)
assays. Biometrics 21, 799-810.
[47] Sen, P. K. (1968). Estimates of the regression coefficient based on Kendall's tau. J. Amer.
Statist. Assoc. 63, 1379-1389.
[48] Sen, P. K. (1969). On a class of rank order tests for the parallelism of several regression lines.
Ann. Math. Statist. 40, 1668-1683.
[49] Sen, P. K. (1971). Robust statistical procedures in problems of linear regression with special
reference to quantitative bio-assays, I. Internat. Statist. Rev. 39, 21-38.
[50] Sen, P. K. (1972a). Robust statistical procedures in problems of linear regression with special
reference to quantitative bio-assays, II. Internat. Statist. Rev. 40, 161-172.
[51] Sen, P. K. (1972b). On a class of aligned rank order tests for the identity of the intercepts of
several regression lines. Ann. Math. Statist. 43, 2004-2012.
[52] Sen, P. K. (1973). Some aspects of nonparametric procedures in multivariate statistical
analysis. In Multivariate Statistical Inference (ed: D. G. Kabe and R. P. Gupta), North-
Holland, Amsterdam, pp. 230-240.
[53] Sen, P. K. (1976). Asymptotically optimal rank order tests for progressive censoring. Calcutta
Statist. Assoc. Bull. 25, 65-78.
[54] Sen, P. K. (1977). Tied-down Wiener process approximations for aligned rank order statistics
and some applications. Ann. Statist. 5, 1107-1123.
[55] Sen, P. K. (1978). Invariance principles for rank statistics revisited. Sankhya Ser. A 40,
215-236.
[56] Sen, P. KI (1979). Rank analysis of covariance under progressive censoring. Sankhya Ser. A
41, 147-169.
[57] Sen, P. K. (1980). Asymptotic theory of some tests for a possible change in the regression
slope occurring at an unknown time point. Z. Wahrsch. Verw. Geb. 52, 203-218.
[58] Sen, P. K. (1981a). Sequential Nonparametrics: Invariance Principles and Statistical Inference.
Wiley, New York.
[59] Sen, P. K. (1981b). Rank analysis of covariance under progressive censoring, II. In: M. Csorgo
et al., eds., Statistics and Related Topics. North-Holland, Amsterdam, pp. 285-295.
[60] Sen, P. K. (1982a). Invariance principles for recursive residuals. Ann. Statistist. 10, 307-312.
[61] Sen, P. K. (1982b). Asymptotic theory of some tests for constancy of regression relationship
over time. Math. Operat. Statist., Ser. Statist. 13, 21-32.
[62] Sen, P. K. (1982c). Tests for changepoints based on recursive U-statistics. Sequential Anal. 1,
263-284.
[63] Sen, P. K. (1983). Some recursive residual rank tests for change points. In: M. H. Rizvi et al.,
eds., Recent Advances in Statistics: Papers in Honor of Herman Chernoff's Sixtieth Birthday.
Academic Press, New York, pp. 371-391.
[64] Sen, P. K. (1984a). Subhypotheses testing against restricted alternatives for the Cox regression
model. J. Statist. Plann. Infer. 10, 31-42.
[65] Sen, P. K. (1984b). The Cox regression model, random censoring and locally optimal rank
tests. J. Statist. Plann. Infer. 9, 355-366.
[66] Sen, P. K. (1984c). On a Kolmogorov-Smirnov type aligned test. To appear in Statist. Probability
Letters 2.
[67] Sen, P. K. and Puff, "M. L. (1970). Asymptotic theory of likelihood ratio and rank order tests
in some multivariate linear models. Ann. Math. Statist. 41, 87-100.
[68] Sen, P. K. and Purl, M. L. (1977). Asymptotically distribution-free aligned rank order tests for
composite hypotheses for general multivariate linear models. Z. Wahrsch. Verw. Geb. 39,
175-186.
[69] Sinha, A. N. and Sen, P. K. (1979a). Progressively censored tests for clinical experiments and
life testing problems based on weighted empirical distributions. Comm. Statist. Ser. A 8,
871-898.
Nonparametric procedures for some miscellaneous problems 739
[70] Sinha, A. N. and Sen, P. K. (1979b). Progressively censored tests for multiple regression
based on weighted empirical distributions. Calcutta Statist. Assoc. Bull. 28, 57-82.
[71] Sinha, A. N. and Sen, P. K. (1982). Tests based on empirical processes for progressive
censoring schemes with staggering entry and random withdrawal. Sankhya Ser. B 44, 1-18.
[72] Shiryayev, A. N. (1963). On optimum methods in quickest detection problems. Theor. Probability
Appl. 8, 22-46.
[73] Shiryayev, A. N. (1978). Optimal Stopping Rules. Springer-Verlag, New York.
[74] Shorack, G. R. (1966). Graphical procedures for using distribution-free methods in the
estimation of relative potency in dilution (-direct) assays. Biometrics 22, 610-619.
[75] Tsiatis, A. (1981a). A large sample study of Cox's regression model. Ann. Statist. 9, 93-108.
[76] Tsiatis, A. (1981b). The asymptotic distribution of the efficient score test for the proportional
hazard model calculated over time. Biometrika 68, 311-315.
[77] Woolson, R. F. and Sen, P. K. (1974). Asymptotic comparison of a class of multivariate
multi-parameter tests. Comm. Statist. 3, 813-828.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 "~1"]
.../IkJ
Elsevier Science Publishers (1984)741-754
M i n i m u m Distance Procedures
Rudolf Beran
1. Introduction
This research was supported in part by National ScienceFoundation Grant MCS 80-02648.
741
742 Rudolf Beran
that the functional being estimated remain sensible and that the estimate be
precise. Minimum Cram6r-von Mises, or Kolmogorov-Smirnov, or Hellinger
distance estimates share this property.
The robustness of certain minimum distance estimates is reflected by the
sensitivity of the corresponding minimum distance tests to most departures
from the parametric model. For tests based on the Kolmogorov-Smirnov or
Cram6r-von Mises distances, this broad sensitivity was recognized relatively
early. Unfortunately, finding critical values for these tests is not easy; even
their asymptotic theory, under the null hypothesis that the parametric model
holds, is complex. Despite some analytical progress on the problem, this
circumstance has greatly hampered the use of these otherwise appealing tests.
Recent research suggests a new approach to finding critical values: estimate
the null distribution of the test statistic by parametric bootstrapping. One
version of the procedure runs as follows. Let Tn denote the minimum distance
estimate of 0, calculated from the observed sample of size n. Draw m
pseudo-random samples of size n from the distribution FT. For each sample,
evaluate the minimum distance between sample and parametric model. The
empirical distribution of the m values of the minimum distance so realized is an
estimate of the null distribution of the test statistic. It yields estimated critical
values for the minimum distance goodness-of-fit test. Current experience with
bootstrapping suggests taking m between 100 and 1000.
In what follows, we will examine more closely the minimum distance
estimates and tests based on the Cram6r-von Mises metric
which implies that min, h (t, Gn) ~ min, h (t, Fo) = 0 or, equivalently,
h(T(Gn), G . ) ~ O. Since (2.1) also implies [h(T(Go), On)- h(T(G,), Fo)l ~ O, we
conclude that
2.2. Differentiability of T
Let us regard the elements of O as k x 1 vectors. Suppose the parametric
model has three additional properties for every 0 in the interior of O:
(C) There exists a k x 1 vector function 60 in Lk(/~) such that
where
po(x) = f I(t>~ X)yo(t)d/z(t)- f f I(t >1X)yo(t)d/z(t)dFo(x). (2.12)
for every positive c. Hence, using the definition of B,(O, c) and the triangle
inequality,
~o = ~ pop'odFo. (2.19)
under G. E B,(O, c). This implies the asserted locally uniform asymptotic
normality.
be the risk associated with T*. Since T(G) is differentiable at Fo, in the sense
of equation (2.11), an argument based on Hgtjek's asymptotic minimax theorem
yields the following lower bound on maximum risk over B,(O, c) when 0 E
int(O):
748 Rudolf Beran
for every positive c. Thus, the minimum distance estimate T, attains the lower
bound (2.24) on maximum risk over the contamination neighborhood t3,(0, c).
We have replaced the classical problem of estimating 0 in the parametric
model 1=;oby the more realistic problem of estimating the minimum Cramrr-
von Mises distance functional T(G) for underlying distributions G near Fo. If c
and n are large, the minimum distance estimate T, = T(P,) is approximately
minimax for T(G) over all distribution functions G in the contamination
neighborhood B~(O, c) about Fo. This property may be interpreted as quan-
titative robustness of the estimate T,. Similar results are available for minimum
Hellinger distance estimates. Whether minimum Kolmogorov-Smirnov dis-
tance estimates are asymptotically minimax is not known, primarily because the
asymptotic distributions of these estimates are not normal.
Suppose the observations Xa, X2, ., 32, are i.i.d, random variables. To test the
null hypotheses that the common distribution function of the observations
belongs to the parametric model {Fo: 0 E O}, it is natural to consider the
statistic
members of the parametric model. We will pursue this idea by finding the
asymptotic distribution of S, under null hypotheses and local alternatives. The
asymptotics suggests two ways to estimate critical values of S,. One way is
analytic and complex; the other is the parametric bootstrap.
nt/2(T.- O.)= nl/2 1 Pood(F.- G.)+ nl/2 f poo d(G,- Foo) + op(1)
under K,. Since the random variables {nl/2(Zn-Oo)} a r e tight, assumption (C)
yields
where Y(x, 0o) is a gaussian process with mean zero and covariance function
More explicit representations are available for the random variables S and
S(b). Let the {Ak(00); k ~>1} denote the distinct, nonzero eigenvalues of
C(x, y, 00), ordered so that Al(00)>A2(00)>"" > 0 . Let rk(Oo) be the multi-
plicity of Ak(00) and let Ak(Oo)b~(Oo) be the squared length in L2(/x) of the
projection of boo onto the eigenspace of Ak(00). Then
c
S(b) = ~ A~'2(rk, b2), (3.10)
k=l
where the { ) ( 2 ( F k , bE)} are independent random variables with noncentral chi-
square distributions, degrees-of-freedom {rk}, and noncentrality parameters
{bk}. Similarly,
S = ~ AkX2(rk) (3.11)
k=l
=Aoxp[ 1 (3.14)
While exact calculation of J,(x, T,) is usually impractical, Monte Carlo ap-
proximations are fairly straightforward. For instance: Draw rn pseudorandom
samples of size n from the distribution FT. For each sample, evaluate S,. The
empirical distribution of the m values of S, so realized is an approximation to
Jn(x, 7",) which readily yields an approximation to cn(a). Current experience
with bootstrapping suggests taking m between 100 and 1000.
The corresponding goodness-of-fit test ~b, is to reject H, if Sn > c,(a) and to
752 Rudolf Beran
P
under both Hn and Kn. In particular, (3.18) entails c,(a)---~ c(a, 00) under both
Hn and Kn, because J(x, 00) is continuous and strictly monotone for all positive
X.
4. Sources
References
Beran, R. (1975). Tail probabilities of noncentral quadratic forms. Ann. Statist. 3, 969-974.
Beran, R. (1977). Minimum Hellinger distance estimates for parametric models. Ann. Statist. 5,
445-463.
Beran, R. (1981). Efficient robust estimates in parametric models. Z. Wahrsch. Verw. Gebiete 55,
91-108.
Bickel, P. J. and Freedman, D. A. (1981). Some asymptotic theory for the bootstrap. Ann. Statist.
9, 1196-1217.
Blackman, J. (1955). On the approximation of a distribution function by an empirical distribution.
Ann. Math. Statist. 26, 256--267.
Bolthausen, E. (1977). Convergence in distribution of minimum distance estimators. Metrika 24.
Durbin, J. and Knott, M. (1972). Components of Cramrr-von Mises statistics I. J. Roy. Statist. Soc.
Ser. B 34, 290-307.
Dvoretzky, A., Kiefer, J. and Wolfowitz, J. (1956). Asymptotic minimax character of the sample
distribution function and of the classical multinomial estimator. Ann. Math. Statist. 27, 642-669.
Efron, B. (1979). Bootstrap methods: another look at the jackknife. Ann. Statist. 7, 1-26.
Hoeffding, W. (1964). On a theorem of V. M. Zolotarev. Th. Probab. Appl. 9, 89-92.
Holm, S. (1976). Discussion to a paper by P. J. Bickel. Scand. J. Statist. 3, 158-161.
Kac, M., Kiefer, J. and Wolfowitz, J. (1955). On tests of normality and other tests of goodness-of-
fit ba~ed on distance methods. Ann. Math. Statist. 26, 189-211.
Koshevnik, Yu. A. and Levit, B. Ya. (1976). On a nonparametric analog of the information matrix.
Th. Probab. Appl. 21, 738-753.
Millar, P. W. (1981). Robust estimation via minimum distance methods. Z. Wahrsch. Verw. Gebiete
55, 73-84.
Neyman, J. (1949). Contributions to the theory of the X2 test. Proc. First Berkeley Syrup. Math.
Statist. Probab. 239-273, University of California Press.
Parr, W. C. and Schucany, W. R. (1980). Minimum distance and robust estimation. J. Amer. Statist.
Assoc. 75, 616-637.
Parr, W. C. (1980). Minimum distance estimation: a bibliography. Unpublished preprint.
Pollard, D. (1980). The minimum distance method of testing. Metrika 27, 43-70.
Rao, P. V., Schuster, E. F. and Littel, R. C. (1975). Estimation of shift and center of symmetry
based on Kolmogorov-Smirnov statistics. Ann. Statist. 3, 862-873.
Wolfowitz, J. (1957). The minimum distance method. Ann. Math. Statist. 28, 75-88.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 '~ 1
Elsevier Science Publishers (1984) 755-770 ,J 1
S. R a o J a m m a l a m a d a k a
1. Introduction
In many natural and physical sciences the observations are in the form of
d i r e c t i o n s - directions either in plane or in three-dimensional space. Such is the
case when a biologist investigates the flight directions of birds or a geologist
measures the paleomagnetic directions or an ecologist records the directions of
wind or water. A convenient sample frame for two-dimensional directions is
the circumference of a unit circle centered at the origin with each point on the
circumference representing a direction; or, equivalently, since magnitude has
no relevance, each direction may be represented by a unit vector. Such data on
two-dimensional directions will be called 'circular data'. Similarly the surface of
a unit sphere in three-dimensions may be used as the sample space for
directions in space, with each point on the surface representing a three-
dimensional direction; or alternatively, such a direction may be represented by
a unit vector in three-dimensions. Such data is referred to as the 'spherical
data'. Also, studies on any periodic p h e n o m e n a with a known period (such as
circadian rhythms in animals) can be represented as circular data, for instance
by identifying each cycle or period with points on the circumference, pooling
observations over several such periods, if necessary.
The analysis of directional data gives rise to a host of novel statistical
problems and does not fit into the usual methods of statistical analysis which
one employs for observations on the real line or Euclidean space. Since there is
no natural zero-direction, any method of numerically representing a direction
depends on the arbitrary choice of this zero direction. It is important that the
statistical analyses and conclusions remain independent of this arbitrary zero
direction. Unfortunately, however, usual statistics like the arithmetic mean and
standard deviation (and all the higher moments) which one employs in linear
statistical analyses fail to have this required rotational invariance so that one is
forced to seek alternate statistics for describing directional data. T o do this, we
treat each direction as a unit vector in plane or space. O n e computes the
resultant vector, whose direction provides a meaningful measure of the average
direction in unimodal populations. The length of this vector resultant measures
755
756 s. Rao Jammalamadaka
the concentration of the data since observations closer together lead to a longer
resultant.
O n e of the basic parametric models for unimodal directional data is called
the von Mises-Fisher distribution and is discussed briefly in Section 2. This
plays as prominent a role in a directional data analysis as does the normal
distribution in the linear case. Sections 3 and 4 review nonparametric methods
for circular (two-dimensional) and spherical (three-dimensional) data, respec-
tively. Section 3 is considerably larger since m o r e distribution-free methods
have been developed for circular data. The reader may consult the b o o k s b y
Mardia (1972), Batschelet (1981) and Watson (1983) for a m o r e complete
introduction to this novel area of statistics.
cp(K) = (2.2)
where L(K) is the modified Bessel function of the first kind and order r. When
p = 2, this density reduces to
where 0 ~< a < 2r and 0 ~</x < 2~" are the angles (in polar coordinates) cor-
responding to x and /, in (2.1). This density was introduced by von Mises
(1918) to test the hypothesis that the atomic weights are integers. When p -- 3,
Fisher (1953) studied the pdf with zero mean direction,
where 0 ~< a < ~ and 0 ~< fl < 2~ are the polar coordinates of x. Fisher's 1953
p a p e r is the first comprehensive treatment of the sampling distributions and
statistical inference for the spherical model (2.4).
If the concentration p a r a m e t e r K = 0 in (2.1), this reduces to the uniform
Nonparametric methods in directional data analysis 757
where R is the vector resultant of the sample, R is a sufficient statistic for this
family of distributions. The Maximum Likelihood Estimator of/~ is given by
(I/IR[)'R, where [RI is the length of the resultant. The concentration
parameter K is estimated by solving the equation
I,(~) _- IRI
I0(K) n
1 IRI
coth K - - =
K n
r Jo(rt)J~(t)t dt
when p = 2, and
n
when p = 3. Here (x) = x if x > 0 and 0 otherwise. This test of 'no preferred
direction', i.e., of H0: K = 0 which is based on IR[, is known as Rayleigh's test.
Though considerable statistical theory has been developed for the von
Mises-Fisher distribution and to a much lesser extent for some of the other
parametric models for directions, these models may not provide an adequate
description of the data or the distributional information may be imprecise. For
instance, information about the unimodality or axial symmetry that a particular
parametric model assumes may be lacking or might be inappropriate for a
758 S. Rao Jammalamadaka
given data set. The search for methods which are robust leads naturally, as in
linear statistical inference, to techniques which are nonparametric or model-
free. In linear inference, there are a number of considerations on which one
can justify for instance an assumption of normality as for example when one
deals with averages, or when the samples are large enough. Unfortunately,
there is no corresponding rationale for invoking the von Mises-Fisher dis-
tribution and thus the need for model-free methods might indeed be stronger
in directional data analysis.
This section will be subdivided into three subsections dealing with one-, two-
and multi-sample nonparametric techniques.
As in the linear case, the goodness-of-fit problem of testing whether the sample
came from a specified circular distribution can also be reduced to testing
uniformity on the circle.
We see~ rotationally invariant tests, i.e., tests invariant under changes in
zero direction as well as the sense of rotation (clockwise or anticlockwise).
There are three broad groups of tests for this problem, which are described
below.
(i) Tests based on sample arc lengths or spacings. If o~(1) ~ ~ O/(n) denote
" " "
Di = ( ~ ( i ) - - a(i-1)), i = 1. . . . . n, (3.3)
with at0)= ( a ( , ) - 1 ) , these are the lengths of the arcs into which the sample
partitions the unit circumference and are called the sample spacings. Clearly
a *i = Y.j=2D#
i Any symmetric function of the sample spacings will have the
rotational invariance property, and Rao (1969) suggested the use of such a class
of spacings tests for testing H0 in (3.1). See Rao (1976) and the references
contained there. In particular the statistic
Nonparametric methods in directional data analysis 759
number of al ~<x
F,(x) = (3.5)
n
for 0 ~<x < 1. The usual test statistics like the Kolmogorov-Smirnov statistic
1
W~ = n
f0 (F,(x) - x) 2 dx (3.7)
do not have the required invariance property. Kuiper (1960) suggested the
following variation of (3.6) which is rotationally invariant and hence usable
with circular data. Let
8x ~
3~--~51m2( 4m2x2- 3) e -2mex2 + O ( 1 ) (3.10)
for x ~>0. Stephens (1965) provides upper percentage points for small n.
Watson (1961) defined an invariant version of (3.7), namely
for use with circular data. Observe that U 2 is of the form of a variance while
W~ is like the second moment. The asymptotic null distribution is given by
(refer to Watson, 1961)
for x 1>0.
(iii) Scan statistics and chi-square-type tests. Ajne (1968) suggested two test
statistics based on the number of observations in a half-circle
N= sup N ( a ) , (3.13)
0~<1
the maximum number in any half-circle. As Rao (1969) and Bhattacharya and
Johnson (1969) pointed out, this is related to a bivariate sign test suggested
earlier by Hodges (1955). The exact null distribution of N is given by (cf. Ajne,
1968)
P(N>~k)= 2 n+l ~
j=O
\k+j(2k-n) / (3.14)
for k/> [n/2] + 1 which reduces for k > 2n to the simpler expression
"2n~-I
N* = ~ n
is given by
An = N (a )- doe. (3.16)
4 m-1
lim P(A, > x) =
. . . . ml ~(-1)
-rr(2m - 1)
e_~2=_i?x/2
for x t> 0. The statistic in (3.16) has been generalized in two different directions
by Beran (1969a) and Rao (1972b). Rao (1972b) considered dividing the unit
circumference into m (/>2) equal class intervals with the i-th interval being
1
XZ. =
f0 X2.(a) da . (3.17)
where f is any probability density function on the circle. It can be verified that
762 S. Rao Jammalamadaka
An and U~ defined in (3.16) and (3.11) are of this form. Beran (1969a) obtains
the asymptotic distribution of this statistic under the null hypothesis (3.1) as
well as under fixed alternatives and derives the approximate Bahadur slope.
Tests based on T~ are best invariant against local alternatives (i.e., for small k)
of the form
If we define
h(O) = 2 2 p2pcos pO
p=l
where
1 2
Let (r, . . . . . rm) denote the (linear) ranks of the first sample in the combined
sample of N = (m + n) observations in the usual fashion, and let
be the space of rank vectors for the combined sample. Define groups of
transformations {g} (corresponding to changes in zero direction) and {h}
(corresponding to changes in sense of rotation), of R onto itself by
g: ( r l , . . . , rN)-*(rl+ 1 , . . . , rN+ 1)
and
where the components of the transformed vector are defined modulo N. Let
be the group of transformations R ~ R generated by {g} and {h}. We may
define circular ranks (C, . . . . . Cm) of (aa . . . . . am) as an equivalence class of
( r , , . . . , rm) under the group q3. One can then define, corresponding to any
linear rank test T ( r ) based on linear ranks r, a circular rank test
which will then possess the required invariance. Batschelet (1965) suggested
such an invariant version of the Wilcoxon-Mann-Whitney statistic and pro-
vided a short table of critical values. Epplett (1982) pursues this further and
obtains its asymptotic null distribution.
764 S. Rao Jammalamadaka
Let Fm(x) and G , ( x ) denote the empirical distribution functions of the a ' s
and/3's, respectively. Define
+
Din,. = sup [Fro(x)- G.(x)] (3.21)
O~x<l
and
D~,. = sup [ O . ( x ) - Fm(x)]. (3.22)
O~x<l
Wm,.(r) = ~ ri (3.23)
i=l
the Wilcoxon test statistic. Then Epplett (1982) shows that the circular version
for 0 ~<s ~< t ~< 1. This test is shown to compare favorably with the two-sample
Kuiper test (see Equation (3.25)) in terms of Bahadur efficiency. Through
inclusion-exclusion, Epplett (1979) relates the exact probabilities for the cir-
cular statistic to those of the linear Wilcoxon statistic and provides a recurrence
relation.
(i) Tests based on empirical distribution functions. Since the two-sample
versions of the Kolmogorov-Smirnov and Cramer-von Mises statistics are not
rotationally invariant, they are inappropriate for testing the hypothesis (3.20).
Kuiper (1960) suggested the following two-sample variation of the Kolmogorov-
Smirnov statistic:
where D + . and DT.,. are as defined in (3.21) and (3.22). Its asymptotic null
distribution, properly normalized, is the same as that given in (3.10). Barr and
Shudde (1973) show that
mn 1 1 2
(3.27)
where F,,(x) and G , ( x ) are the empirical distribution functions of a ' s and/3's
respectively and H N ( x ) = [ m F m ( x ) + n G , ( x ) ] / N . The asymptotic null dis-
tribution of U~,, is again the same as that given in Equation (3.12).
(ii) Tests based on uniform scores. Beran (1969b) pointed out that two-
sample tests for the hypothesis (3.20) can be obtained from tests of uniformity
as follows: If (rl . . . . . rm) denote the (linear) ranks of the first sample in the
combined sample, then define
called the 'uniform scores'. Under the null hypothesis F = (3, these scores must
be uniformly distributed on the circle of unit circumference. Thus any test of
uniformity discussed in Section 3.1 can then be used on {ui} to test the
hypothesis (3.20). A test which rejects this hypothesis for large values of Inll,
the length of the resultant of {ui, i = 1 , . . . , m} was proposed by Wheeler and
Watson (1964). Mardia (1967) considered the statistic based on IRI[ in con-
nection with a bivariate location problem. Mardia (1969) and Schach (1969a)
discuss the asymptotic power and consistency properties of the Wheeler and
Watson statistic. Schach (1969a) considers a general class of statistics of the
form
clearly rotation invariant. Thus one may use test statistics of the form
for 'reasonable' functions hN('). The circular run test (cf. David and Barton,
1962) and a test suggested by Dixon (1940) based on E~' S~, are special cases of
this form. Holst and Rao (1980) show that under mild conditions on hN('), the
statistics Tin,, are asymptotically normal and that the Dixon test based on ET' S~
is asymptotically locally most powerful among this class. Some further power
comparisons and the special relevance of this class (3.31) to circular data
problems are discussed in Rao and Mardia (1980). More recently, tests based
on k-th order spacing-frequencies (for fixed finite k), i.e., on S! k) = the number
of observations in [O/(i-1) , Ol(i+k-1)), are considered in Rao and Schweitzer (1982)
where it is shown that among tests symmetric in {s!k)}, which can be used for
circular distributions, E?=I S! k)2 is asymptotically locally most powerful.
uq = rq/N (3.32)
where {r~j, u = 1. . . . . ni} are the ranks of the i-th sample observations among
the combined sample of N -- (nl + + nk) observations. Mardia (1972b) con-
siders a test based on the statistic
k
2 ~, (R~/ni) (3.33)
i=l
where
RE = COS 2~rU~j + sin 2~u 0
~j=l I 1
is the squared length of the resultant for the uniform scores of the i-th sample.
The statistic in (3.33) corresponds to the log likelihood ratio for testing
homogeneity of mean directions of k von Mises-Fisher distributions and may
Nonparametric methods in directional data analysis 767
T _ n 1 ~ Oij (4.1)
4 "rrn~<j
where Ou is the smaller angle between the i-th and j-th observations (in polar
coordinates) (o~i,/3i) and (% flj). Gates and Westcott (1980) discuss bounds on
the distribution of the minimum interpoint angular distance 6 = minij Oij under
the hypothesis of uniformity. Gin6 (1975) considers a class of invariant tests for
uniformity based on Sobolev norms, which contains as special cases tests for
uniformity on the circle, the sphere and the hemisphere (where the antipodes
are identified) introduced earlier by Rayleigh, Watson (1961), Ajne (1968), Rao
(1972b), Beran (1968) and Bingham (1964). Prentice (1978) follows along the
lines of Gin4 (1975) and Beran (1968) to obtain a class of invariant tests for
spheres and hemispheres in any dimension p ~>1. Stephens (1966) tabulates the
percentage points of three statistics which are useful in testing uniformity on
the sphere against specified alternatives listed.
References
Ajne, B. (1968). A simple test for uniformity of a circular distribution. Biometrika 55, 343-354.
Barr, D. and Shudde, R. H. (1973). A note on Kuiper's Vn statistics. Biometrika 60, 663-664.
Batschelet, E. (1965). Statistical Methods for the Analysis of Problems in Animal Orientation and
Certain Biological Rhythms, Am. Inst. of Bio. Sciences, Washington.
Batscbelet, E. (1981). Circular Statistics in Biology, Academic Press, London.
Beran, R. J. (1968). Testing for uniformity on a compact homogeneous space. J. Appl. Prob. 5,
177-195.
Beran, R. J. (1969a), Asymptotic theory of a class of tests for uniformity of a circular distribution.
Ann. Math. Statist. 40, 1196-1206.
Beran, R. J. (1969b). The derivation of nonparametric two sample tests from tests for uniformity of
a circular distribution. Biometrika 56, 561-570.
Bhattacharya, G. K. and Johnson, R. A. (1969). On Hodges' bivariate sign test and a test for
uniformity of a circular distribution. Biometrika 56, 446-449.
Bingham, C. (1964). Distributions on the sphere and on the projective plane, Ph.D. thesis, Yale
University.
Burrows, P. M. (1979). Selected percentage points of Greenwood's statistic, J. R. Statist. Soc. Set. A
142, 256-258.
Currie, F. D. (1981). Further percentage points of Greenwood's statistic. J. R. Statist. Soc. Ser. A
144, 360-363.
David, F. N. and Barton, D. E. (1962). Combinatorial Chance, Griffin and Co., London.
Dixon, W. J. (1940). A criterion for testing the hypothesis that two samples are from the same
population. Ann. Math. Statist. 11, 199-204.
Eplett, W. J. R. (1979). The small sample distribution of a Mann-Whitney type statistic for circular
data. Ann. Statist. 7, 446-453.
Eplett, W. J. R. (1982). Two Mann-Whitney type rank tests. J. R. Statist. Soc. Set. B 44, 270-286.
Fisher, N. I. and Lee, A. J. (1981). Nonparametric measures of angular-linear association.
Biometrika 68, 629-4536.
Fisher, R. A. (1953). Dispersion on a sphere. Proc, Roy. Soc. Lond. A 217, 295-305.
Gates, D. J. and Westcott, M. (1980). Further hounds for the distribution of minimum interpoint
distance on a sphere. Biometrika 67, 466--469.
Gint, E. (1975). Invariant tests for uniformity on compact Riemannian manifolds based on Sobolev
norms. Ann. Statist. 3, 1243-1266.
Hajek, J. and Sidak, Z. (1967). Theory of Rank Tests, Academic Press, New York.
Hodges, J. L., Jr. (1955). A bivariate sign test. Ann. Math. Statist. 26, 523-527.
Hoist, L. and Rao, J. S. (1980). Asymptotic theory for families of two-sample nonparametric
statistics. Sankhya Set. A 42, 19-52.
Jupp, P. E. and Mardia, K. V, (1980). A general correlation coefficient for directional data and
related regression problems. Biometrika 67, 163-173.
Kuiper, N. H. (1960). Tests concerning random points on a circle. Proc. Koninkl. Nederl. Akad.
Van Wetenschappen Ser. A, 63, 38--47.
Nonparametric methods in directional data analysis 769
Laubscher, N. F. and Rudolph, G. J. (1968). A distribution arising from random points on the
circumference of a circle. Nat. Res. Inst. Math. Sci. Rep. 268, Pretoria, South Africa, pp. 1-15.
Lenth, R. V. (1981). Robust measures of location for directional data. Technometrics 23, 77-81.
Maag, U. R. (1966). A k-sample analogue of Watson's U 2 statistic. Biometrika 53, 579-583.
Mardia, K. V. (1967). A nonparametric test for the bivariate two-sample location problem. J. R.
Statist. Soc. Set. B 29, 320-342.
Mardia, K. V. (1969). On Wheeler and Watson's two-sample test on a circle. Sankhya Ser. A. 31,
177-190.
Mardia, K. V. (1972a). Statistics of Directional Data, Academic Press, London.
Mardia, K. V. (1972b). A multisample uniform scores test on a circle and its parametric competitor.
J. R. Statist. Soc. Ser. B 34, 102-113.
Mardia, K. V. (1975). Statistics of directional data. J. R. Statist. Soc. Ser. B 37, 349-393.
Naus, J. I: (1982). Approximations for distributions of scan statistics. J. Amer. Statist. Assoc: 77,
177-183.
Prentice, M. J. (1978). On invariant tests of uniformity for directions and orientations. Ann. Statist.
6, 169-176.
Puff, M. L. and Rao, J. S. (1977). Problems of association for bivariate circular data and a new test
of independence. In: P. R. Krishnaiah, ed., Multivariate Analysis, Vol. 4. North-Holland
Publishing Co., Amsterdam.
Puff, M. L. and Sen, P. K. (1971). Nonparametric Methods in Multivariate Analysis, Wiley, New
York.
Rao, J. S. (1966). Large sample tests for homogeneity of angular data (Appendix to a paper by S.
Sengupta and J. S. Rao). Sankhya Ser. B 28, 172-174.
Rao, J. S. (1969). Some contributions to the analysis of circular data, Ph.D. thesis, Indian Statistical
Institute, Calcutta.
Rao, J. S. (1972a). Bahadur efficiencies of some tests for uniformity on the circle. Ann. Math.
Statist. 43, 468-479.
Rao, J. S. (1972b). Some variants of chi-square for testing uniformity on the circle. Z. Wahrsch.
Ver/e. Geb. 22, 33-44.
Rao, 3. S. (1976). Some tests based on arc lengths for the circle. Sankhya Set. B 38, 329-338.
Rao, J. S. and Kuo, M. (1984). Asymptotic results on the Greenwood statistic and some of its general-
izations. J. Roy. Statist. Soc. Set. B 46.
Rao, J. S. and Mardia, K. V. (1980). Pitman efficiencies of some two-sample nonparametric tests.
In: K. Matusita, ed., Recent Developments in Statistical Inference and Data Analysis. North-
Holland, Amsterdam.
Rao, J. S. and Schweitzer, R. L. (1982). On tests for the two sample problem based on higher order
spacing-frequencies. To appear.
Rothman, E. D. (1971). Tests of coordinate independence for a bivariate sample on a torus. Ann.
Math. Statist. 42, 1962-1969.
Rothman, E. D. (1972). Tests of uniformity for circular distributions. Sankhya Ser. A 34, 23-32.
Schach, S. (1969a). On a class of nonparametric two-sample tests for circular distributions. Ann.
Math. Statist. 40, 1791-1800.
Schach (1969b). Nonparametric symmetry tests for circular distributions. Biometrika 56, 571-577.
Stephens, M. A. (1965). The goodness of fit statistic V,: distribution and significance points.
Biometrika 52, 309-321.
Stephens, M. A. (1966). Statistics connected with the uniform distribution: percentage points and
application to testing for randomness of directions. Biometrika 53, 235--240.
Stephens, M. A. (1969). A goodness of fit statistic for the Circle with some comparisons. Biometrika
56, 161-168.
Stephens, M. A. (1981). Further percentage points of Greenwood's statistic. J. R. Statist. Soc. Set.
A 144, 364-366.
yon Mises, R. (1981). Uber die 'Ganzzahligkeit' der Atomgewicht und verwandte fragen. Physikal
Z. 19, 490-500.
Watson, G. S. (1961). Goodness of fit tests on a circle. Biometrika 48, 109-114.
770 S. Rao Jammalamadaka
Watson, G. S. (1962). Goodness of fit tests on a circle II. Biometrika 49, 57-63.
Watson, G. S. (1983). Statistics on the sphere. Wiley, New York.
Wellner, J. A. (1979). Permutational tests for directional data. Ann. Statist. 7, 929-943.
Wheeler, S. and Watson, G. S. (1964). A distribution free two sample test on a circle. Biometrika
51, 256-257.
Yoshimura, I. (1978). On a test of homogeneity hypothesis for directional data. Sankhya Ser. A 40,
310--312.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 "~,~
Elsevier Science Publishers (1984) 771-790
H. S. Wieand
1. Introduction
survival function
771
772 H.S. Wieand
d f(t)
A(t) : - -~. log S ( t ) = (1.4)
s(t)
and the cumulative hazard function by
The data set considered in this section is taken from an example appearing in
a paper by Fleming et al. (1980). It consists of observations of time to
progression of disease for patients with high-grade or undifferentiated Stage II
or Stage 1II ovarian carcinoma who were followed at Mayo Clinic. The
observation times were 34, 88, 137, 199, 280, 291, 299+, 300+, 309, 351, 358,
369, 369, 370, 375, 382, 392, 429+, 451 and 1119+ days.
We will begin our analysis by estimating the survival function, using the
K a p l a n - M e i e r (1958) product limit estimator. Let k be the number of distinct
failure times. We will let ti represent the i-th ordered failure time, n~ the
number of patients at risk at time tl, and di the number of deaths at time ti. The
K a p l a n - M e i e r estimate is then defined by
S(t)=
i:ti<.t
Table 2.1 gives the values of i, ti, ni, di and S(t) in the interval [ti, ti+l) for the
ovarian carcinoma data. Notice that the statistic does not change values at
299+ since there is no failure. Furthermore, if 299+ had been a 291+ or 308+,
none of the entries in the table would have changed. However, if the 299+ had
been a 309+, n7 would have been 13 and other changes would have resulted.
Although it is not mentioned in the table, we define S ( t ) = 1 in the interval
Application of nonparametric statistics to cancer data 773
Table 2.1
Mayo Clinic data. Values of the Kaplan-
Meier estimator for the high-grade ovarian
carcinoma patients
i ti ni di $(t)
1 34.0 20 1 0.9500
2 88.0 19 1 0.9000
3 137.0 18 1 0.8500
4 199.0 17 1 0.8000
5 280.0 16 1 0.7500
6 291.0 15 1 0.7000
7 309.0 12 1 0.6417
8 351.0 11 1 0.5833
9 358.0 10 1 0.5250
10 369.0 9 2 0.4083
11 370.0 7 1 0.3500
12 375.0 6 1 0.2917
13 382.0 5 1 0.2333
14 392.0 4 1 0.1750
15 451.0 2 1 0.0875
1.0
0.8 I
I
0.6 b-
l--
~(t)
{
0.4
I-
I-
0.2 I
0.0 t I I I {
1O0 200 300 400
t (in days)
Fig. 2.1. Estimated survival function for the 20 high grade ovarian carcinoma patients.
774 H.S. Wieand
can be approximated by
di
d'~(t) = S2(t) ~ n ~ ( ~ - d,)" (2.2)
i:ti<<.t
~J = S-~(0.5). (2.4)
where to is assumed to be zero and tk+l is the last observation time, regardless
of whether it is a failure time or censored time. If the last time is a failure,
tk+X = tk and the last term on the RHS of (2.5) is zero. For the Mayo Clinic data,
the value of the mean is 380 days (see Table 2.2). The variance of t2 can be
estimated by
Table 2.2
Mayo Clinic data. Values needed for computation of the mean survival time fox high grade
carcinoma patients
K (K )2 di
i S(t,) ti+l- t, S(t,)(t,+,- t~) ~ S(t,)(t,+l- t,) ~'~ S(t,l(t,+t- t,) hi(n, - di)
i=i i=i
0 1.0000 34 34.00
1 0.9500 54 51.30 345.86 314.78
2 0.9000 49 44.10 294.56 253.70
3 0.8500 62 52.70 250.46 205.00
4 0.8000 81 64.80 197.76 143.78
5 0.7500 11 8.25 132.96 73.66
6 0.7000 18 12.60 124.71 74.06
7 0.6417 42 26.95 112.11 95.22
8 0.5833 7 4.08 85.16 65.93
9 0.5250 11 5.78 81.08 73.04
10 0.4083 1 0.41 75.31 180.05
11 0.3500 5 1.75 74.90 133.57
12 0.2917 7 2.04 73.15 178.36
13 0.2333 10 2.33 71.11 252.83
14 0.1750 59 10.33 68.78 394.22
15 0.0875 668 58.45 58.45 1708.20
379.87 4146.40
In this case, if the last time is a failure the term corresponding to i = k is 0. For
the Mayo Clinic data, ~2 is 4146 and the standard error is 64.
Estimation of the density function or hazard function is quite involved and
will not be addressed here, however, estimation of the cumulative hazard
function is straightforward. If one has already found the Kaplan-Meier esti-
mator, an easily obtained estimator (see Peterson, 1977) is:
di-1
AN(t)= ~ ~ 1 . (2.8)
t ti-<t 1 = 0 rti -- l
The two estimators are generally very close. For example, for the Mayo
Clinic data, Ap(365) = -1og(0.525) = 0.644 while AN(365)- (1/20)+
(1/19) + (1/18) + (1/17) + (1/16) + (1/15) + (1/12) + (1/11) + (1/10) = 0.620. Note
than to get AN(369), we would have to add (1/9)+ (1/8) to AN(368) since there
were two failures at t = 369.
As shown above, once the K a p l a n - M e i e r estimate of the survival function is
obtained, one can very easily obtain estimates of quantiles (including the
776 H. S. Wieand
median), the mean, variance of the mean, and hazard function. For large
samples, or samples where specific failure times are hard to obtain, it is often
convenient to replace the K a p l a n - M e i e r estimator by the actuarial life table
estimator, which is the classical method. Details can be found in virtually any
survival analysis textbook, however a short discussion may be worthwhile.
We divide the time axis into intervals 11, 12. . . . , IM and let wj be the n u m b e r
of censored values in the i n t e r v a l / / and dj the n u m b e r of failures. We define
n) = n j - 0.5wj (essentially the effective risk set during the interval). Then, for t
in Ik,
^2 -- dj
d'2a(t) = S A ( i ) ~ n~( rl~'- 4 ) " (2.10)
i<k I 1
Table 2.3
Mayo Clinic data. Values of the actuarial estimator for the high-grade
carcinoma patients
i Ii ni di i n~ Sk(t), t ~ I
1 [0-6) 20 3 0 20 1.00
2 [6-12) 17 6 2 16 0.8500
3 [12-18) 9 7 1 8.5 0.5313
4 [18-24) 1 0 0 1 0.0937
5 [24-30) 1 0 0 1 0.0937
6 [30-36) 1 0 0 1 0.0937
7 [36-42) 1 0 1 0.5 0.0937
[42-~) 0.0937
3. T h e t w o - s a m p l e problem
Mayo Clinic, w h o differ from the first group in that they had low-grade or well
differentiated cancer. We want to test to see if progression of disease is
different for patients with high-grade cancer than it is for patients with
low-grade cancer.
The observation times for the low-grade cancer patients were 28, 89, 175,
195, 309, 377+, 393+, 421+, 447+, 462, 709+, 744+, 770+, 1106+ and 1206+
days. Kaplan-Meier plots (Figure 3.1) indicate that if there is a difference in time
to progression of disease in the two groups, it is in favor of the low-grade patients.
However, more analysis is required to determine whether such a difference is
likely to occur by random chance or if it truly represents an improved prognosis of
low-grade patients.
The notation given below will be used in the analyses which follow. Let k be
the number of distinct failure times in the combined sample, and tl < t2 < <
tk be the ordered failure times, nji will be the number of patients from group j
at risk at time t~. A patient is considered to be at risk at time ti if his time to
failure or censor is at least t~. dj.i will be the number of patients from group j
w h o fail at time t~. ni and d~ will be the corresponding values in the entire
sample. Define
Table 3.1
Mayo Clinic data. Values needed for computation of the T a r o n e - W a r e statistics
ni dl nli dli n2i d2i Eli dl i _ Eu V1 i n i ( d l i _ Eli) n 2i Vii F(ti)(dli - Eli) F2(ti) V i i
1.0
I
0.8
0.6 b-
S(t)
0.4 I
0.0 t I t I
100 200 300 400
t (in days)
Fig. 3.1. Estimated survival functions for the high and low grade ovarian carcinoma patients.
and
nun2idi(ni- di)
Vii = n2(ni- 1) (3.2)
Ej~ and Vj~ can be thought of as the expected value and variance, respectively,
of dji, j = 1, 2 if there is no difference in prognosis for the two groups. Table 3.1
gives these values for the Mayo Clinic data. For example, at time t = 34,
nt~ = 20, dli = 1, nxi = 14 and d2i = 0. Hence ni = 34, d i = 1, Eli = (20 1)/34 =
0.59 and Vu = (20 14 1 33)/(342 33) = 0.24.
The statistics used most frequently to form two-sample tests are special cases
of a class of statistics discussed by Tarone and Ware (1977). These statistics are
of the form
T = ~k=l w i ( d u - E u )
(3.3)
(Y~=I w~V.) 1/2
to the normal critical points. For example if the alternative is simply that the
two groups differ (with no preconceived idea of which group has a better
prognosis), one would form an a-level test by computing T and rejecting the
hypothesis if IT I > z~/2 where z~ is defined in (2.3). If the alternative is that
group 1 has a better prognosis than group 2, the rejection region would be
T < - z ~ . The decision of whether to reject for large or small values of T is
straightforward if one notes that a negative T implies that there are fewer
deaths in the first sample than expected, i.e., the 1st group has a better
prognosis. Which weight function is the appropriate one to use in (3.3) is
somewhat arbitrary. The most commonly used one is wi = 1 (Mantel-
Haenszel), however if one is particularly interested in detecting early
differences, wi = ni (Gehan's generalized Wilcoxon) would be preferable. If
there is a great deal of early censoring, wi = F ( t i ) (Prentice's generalized
Wilcoxon) is more appropriate than wi = ni (see Prentice and Marek, 1979).
Although in practice one should decide which form of T is to be used before
beginning the analysis, all three forms are computed below to illustrate the
procedure. Some of the intermediate values are given in Table 3.1. The
Mantel-Haenszel statistic has the value T = 5.33279/(5.10898) 1/2= 2.36, the
value of the Gehan generalized Wilcoxon is 1.50, and the Prentice generalized
Wilcoxon is 1.60. Hence if we had chosen a two-sided test with a = 0.05
(z~/2 = 1.96) we would have rejected the hypothesis using the Mantel-Haenszel
statistic (p = 0.018) and concluded that the low-grade patients (group 2) had an
improved prognosis (since T was positive). On the other hand, we would not
have rejected the hypothesis using the Gehan (p = 0.13) or Prentice (p = 0.11)
generalized Wilcoxon statistic. The Wilcoxon statistics were not as sensitive to
the difference between the groups because the differences occurred late (at
times when these statistics gave less weight to the differences).
The statistics discussed above are most useful for detecting differences in
groups for which the hazard for the 'good prognosis' group is neve~ more than
the hazard for the 'poor prognosis' group. However there are instances
where one suspects that one group will have a greater hazard initially but a
smaller hazard later such as when one group has surgery or a toxic treatment.
Differences of this type are more likely to be detected by generalizations of the
two-sample Kolmogorov-Smirnov (Smirnov, 1939) and Cram6r-von Mises
(Cram6r, 1928, and von Mises, 1931) statistics than by Statistics in the T a r o n e -
Ware family. A generalization of the Cram6r-von Mises statistic to be used for
randomly right censored data is given by Koziol (1978).
A generalization of the Kolmogorov-Smirnov statistic is given in Fleming et
al. (1980) who use the idea that the hypothesis of no difference between groups
can be rejected when the maximum value of the difference in their cumulative
hazard functions (properly weighted) is large. An application of their method
to the above data follows. (This example is given in their paper in more detail.)
The procedure uses the cumulative hazard function estimator A (t) defined by
(2.8) and another function k(t) which can be thought of as the cumulative
hazard function of censoring times. If we let uj, j = 1 . . . . . c be the ordered
780 H. S. Wieand
c~-, 1 (3.4)
&(t)= j:uj~t
~ k=0~, nj - d j k'
where q represents the number of patients censored at time uj. Actually, this is
analogous to (2.8) if one notes the number of patients at risk for censoring is
n~ - dr (since deaths precede censors).
As before, to extend the notation :o the two sample case, we let h , . . . , tk be
the times of failure for the combined sample. We let a#(Ajg) represent the value
of &(z() forthe j-th group at time ti. Then
is the number of patients censored in the interval [ti-1, ti), A]O = OLjO = O, and
nj0 = nj (the number of patients in sample ]).
Fleming et al. define a weight function
p
= 2 ( 1 - q~((R _A2)o.5) + =a \s({RA-(R2 2R)- 1)'~
03] exp{_2AZ}) '
Furthermore,
Table 3.2
Mayo Clinic data. Calculation of the statistics needed to evaluate the two-sided general-
ized Smirnov statistic
Table 3.3
Data for breast cancer patients from the NSABP study
Time Status Age Nodes Time Status Age Nodes Time Status Age Nodes
Time Status Age Nodes Time Status Age Nodes Time Status Age Nodes
adequate. (This is often the case in clinical trials.) The Mantel-Haenszel form
is easy to compute and can be used in most cases. However if one expects the
group to differ the most at an early stage, the Gehan-Wilcoxon or Prentice-
Wilcoxon would be more appropriate. Finally, if one is expecting a difference
that might include 'crossing hazards' or has no idea what type of differences
might occur, the generalized Kolmogorov-Smirnov statistic would be prefer-
able.
A set of data which will be analyzed in the next section is given in Table 3.3.
The three groups of patients considered were participants in a study conducted
by the National Surgical Adjuvant Project for Breast and Bowel Cancers
(NSABP). The patients all had a radical mastectomy for mammary carcinoma
and histologically positive axillary nodes. In the actual studies, patients were
randomized to various treatment groups, however for this analysis subsets have
been selected in a nonrandom fashion. Sixty-eight patients were selected from
a placebo group, 67 from a group which received a chemotherapy combination
(referred to as treatment A), and 68 from another group receiving a different
chemotherapy combination (referred to as treatment B).
We will begin analyzing this data as if the patients had been randomized into
the three groups by using a 3-sample version of a T a r o n e - W a r e statistic. The
method of computation is similar to that described earlier in the section for the
two-sample problem, however some additional notation is required. For sim-
plicity, let j = 1 refer to the placebo group and j = 2 and 3 correspond to
treatment groups A and B respectively, dj~, nj~, etc. will have the same
meanings as before. The definitions which parallel those given by (3.1) and (3.2)
are
Under the hypothesis that all three groups have the same survival prognosis, U
Application of nonparametric statistics to cancer data 785
k k
= ( 3 8 - 30.11, 25 - 31.70)
and
k
( 19.07 -9.33'~
where each Vjj, = ~', Vjj, i .
V= \-9.33 20.74/ i=1
This leads to
V _ I = (0.067 0.030~
\0.030 0.062]
and U = 3.76 (p = 0.15). H e n c e , o n e would not reject the hypothesis that the
groups differ in terms of survival at a = 0.05 or even a = 0.10.
Table 4.1
Summary tables of characteristics for 203 NSABP patients a
29 29 26 84
<50 (42.6) (43.3) (38.2) (41.4)
21 27 27 75
50-59 (30.9) (40.3) (39.7) (36.9)
18 11 15 44
~60 (26.5) (16.4) (22.1) (21.7)
Total 68 67 68 203
Number of
Positive Nodes Placebo Treatment A Treatment B Total
34 45 26 105
1-3 (50.0) (67.2) (38.2) (51.7)
22 15 26 63
4-9 (32.4) (22.4) (38.2) (31.0)
12 7 16 35
~>10 (17.6) (10.4) (23.5) (17.2)
Total 68 67 68 203
within each subset the Y~idli-F-,ti and Ei Vii are found, and the statistic
T = (Es Ei (dli - Eli))/Es Ei (Vii) 1/2) c o m p u t e d where Es refers to the sum over
all the different subsets. A g a i n T is asymptotically normal with m e a n 0 and
variance 1 u n d e r the hypothesis of no difference.
T o illustrate using the N S A B P data, we divide the patients into subsets with
1-3, 4 - 9 and 10 or m o r e positive nodes. C o m p a r i n g placebo to t r e a t m e n t A,
we first rank the failure times of patients with 1 - 3 n o d e s f r o m the c o m b i n e d
(placebo and t r e a t m e n t A ) groups and use the m e t h o d of c o m p u t a t i o n dis-
cussed in Section 3 to obtain Y'i ( O i i - Eli) = 2.15 and Ei Vii = 4.72.
W e repeat this process for the patients with 4 - 9 positive nodes to obtain
Ei (011 - Ell) = 1.63 and Ei Vii = 5.84 and for the patients with 10 or m o r e nodes
to obtain ~,i (01i - Eli) = - 1 . 4 5 and Y.i Vii = 3.17. W e then sum these results and
get T = (2.15+ 1 . 6 3 - 1 . 4 5 ) / ( 4 . 7 2 + 5 . 8 4 + 3 . 1 7 ) = 0.63 (p = 0.26). T h e same
technique applied to placebo versus treatment B yields T = 1.44 ( p - - 0 . 0 8 ) .
Using this adjusted m e t h o d , the results indicate no significant i m p r o v e m e n t
using t r e a t m e n t A, but borderline significance with t r e a t m e n t B, quite contrary
to the results o b t a i n e d in the u n a d j u s t e d case.
T h e r e is no set rule for when adjustments are appropriate, but if the patients
are r a n d o m i z e d to t r e a t m e n t groups, unadjusted statistics are usually prefer-
able. In this case adjusting m a y introduce as m a n y problems as it solves. If the
patients are not r a n d o m i z e d , but a characteristic is balanced across groups, it
Application of nonparametric statistics to cancer data 787
still need not be adjusted for but, if there is an imbalance in some characteristic
which might influence survival, use an adjusted statistic. In the above case,
adjustment.is clearly required.
Although the Mantel-Haenszel statistic was used in the above discussion,
the same technique can be used with any T a r o n e - W a r e type of statistic.
Another topic of interest when covariates (patient characteristics) are given
is trying to determine which covariates are important in predicting treatment
response. The standard nonparametric method for addressing this problem uses
the Cox (1972) proportional hazard model. This model assumes that the hazard
function for every patient is of the form
Az(t) = A0(t) e z~ (4.1)
where A0(t) is called the base-line hazard, z is a vector of covariates for an
individual patient, and/3 is a vector of regression parameters. A particularly
thorough discussion of the model is given in Kalbfleisch-Prentice (1980). The
discussion involves some reasonably complex mathematics which is beyond the
scope of this chapter, however an outline of how the model can be used (to
include preparation for and interpretation of computer packages) follows.
The starting point is usually to obtain estimators for the values of/3 which
will indicate whether individual covariates influence prognosis, whether the
effect is favorable or unfavorable, and the magnitude of the effect. It is possible
to standardize each component of/3 and determine whether the corresponding
covariate has a statistically significant effect. The first step in running a program
using the proportional hazards model is to arrange the data so that one column
has the time to event (failure or censor), another has status (fail or censor), and
each succeeding column has a value for a covariate. For the NSABP data, we
let zt represent age, z z represent number of positive nodes, and z3 represent
treatment (0 = placebo and 1 = treated). Hence the file has 5 columns and the
entry for the first patient is '846 0 58 1 0', where the first 0 indicates the patient
is censored and the second 0 indicates the patient is a placebo patient. This file
was used as input for the BMDP Cox regression program (using the placebo
and treatment A patients) and some of the results obtained are given in Table
4.2.
The values of the standardized coefficients are helpful for a preliminary
analysis since under the hypothesis that/3i = 0, i.e., covariate zl has no effect on
the hazard, these represent observations of a standardized (mean 0 and
variance 1) normal random variable. In this example the standardized
coefficient for /31 is -1.328 (two-sided p-value = 0.19), that of nodes is 5.75
(p ,~ 0.001) and treatment is 0.81 (so = 0.42). This would indicate that a highly
significant prognostic factor is the number of positive nodes, while age and
treatment may not be significant.
The values of/3i are used to estimate the effect of a particular covariate. For
example, /32 = 0.122 implies that if two patients are the same age and are
receiving the same treatment, but the second patient has one more positive
node than the first, the second patient's estimated hazard function will always
be exp{0.122} = 1.13 times that of the first patient. If the second patient had 5
788 H. S. Wieand
Table 4.2
S u m m a r y of output from a B M D P run using the Cox regression model applied to
the N S A B P data
Standardized
Variable Coefficient Z Coefficient EXP(Coeff)
Placebo and T r e a t m e n t A
Placebo and T r e a t m e n t B
more positive nodes than the first patient, his estimated hazard would be
exp{5 0.122} = 1.84 times that of the first patient. /33 = -0.221 implies that if
two patients have the same number of positive nodes and are the same age, a
treated patient's estimated hazard will be exp{-0.221} = 0.80 times that of a
placebo patient.
A similar run was made on the placebo and treatment B patients and these
results are also shown in Table 4.2. Again the number of positive nodes
appears to be highly significant while age is not at all significant and treatment
is of borderline significance (which is consistent with the findings using the
adjusted Mantel-Haenszel statistic). In this case the estimated hazard of a
greated patient is exp{-0.461} = 0.63 times that of a placebo patient.
The above example illustrates some uses of the proportional hazards model,
but ignores topics such as covariate interactions and time-dependent covariates.
These topics are discussed in some detail in Kalbfleisch-Prentice (1981),
Breslow (1975), and Byar and Green (1980).
5. Additional references
Acknowledgement
References
Fleming, T. R. and Harrington, D. P. (1982). A class of rank test proceduces for censored survival
data. Biometrika 69, 553-566.
Fleming, T., O'Fallon, J., O'Brien, P. and Harrington, D. (1980). Modified Kolmogorov-Smirnov
test procedures with application to arbitrarily right-censored data. Biometrics 36, 607-625.
Gehan, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily singly-censored
samples. Biometrika 52, 203-223.
Gill, R. D. (1980). Censoring and Stochastic Integrals. Mathematical Centre Tracts 124, Mathema-
tisch Centrum, Amsterdam.
Greenwood, M. (1926). The natural duration of cancer. In: Reports on Public Health and Medical
Subjects, No. 33.
Helland, I. S. (1982). Central limit theorems for martingales with discrete or continuous time.
Scand. J. Statist. 9, 79-94.
Jones, D. and Whitehead, J. (1979). Sequential forms of the log rank and modified Wilcoxon tests
for censored data. Biometrika 66, 105-113.
Kalbfleisch, J. and Prentice, R. L. (1980). The Standard Analysis of Failure Time Data. Wiley, New
York.
Kaplan, E. L. and Meier, P. (1958). Nonparametric estimation from incomplete observations. J.
Amen. Statist. Assoc. 53, 457-481.
Koziol, J. A. (1978). A two sample Cram6r-von Mises test for randomly censored data. Biota. J.
20, 603-608.
Koziol, J. A. and Petkau, A. J. (1978). Sequential testing of the equality of two survival
distributions using the modified Savage statistic. Biometrika 65, 615-623.
Mantel, N. and Haenszel, W. (1959). Statistical aspects of the analysis of data from retrospective
studies of disease. J. Nat. Cancer Inst. 22, 719-748.
Miller, R. (1981). Survival Analysis. Wiley, New York.
Nelson, W. (1969). Hazard plotting for incomplete failure data. J. Oual. Tech. 1, 27-52.
Oakes, D. (1981). Survival times: Aspects of partial likelihood. Int. Statist. Rev. 49, 235-264.
Peterson, A. V. (1977). Expressing the Kaplan-Meier estimator as a function of empirical
subsurvival functions. J. Amen. Statist. Assoc. 72, 854-858.
Peto, P. and Peto, J. (1972). Asymptotically efficient rank invariant test procedures. J. Roy. Statist.
Soc. Ser. A 135, 185-198.
Prentice, R. L. (1978). Linear rank tests with right censored data. Biometrika 65, 167-179.
Prentice, R. L. and Marek, P. (1979). A qualitative discrepancy between censored data rank tests.
Biometrics 35, 861-869.
Rai, K., Susarla, V. and Van Ryzin, J. (1979). Shrinkage estimation in nonparametric Bayesian
survival analysis. Comm. Statist. B 9, 271-298.
Rubinstein, L. V. and Gail, M. H. (1982). Monitoring rules for stopping accrual in comparative
survival studies. Controlled Clinical Trials 3, 325--343.
Schey, H. M. (1977). The asymptotic distribution of the one-sided Kolmogorov-Smirnov statistic
for truncated data. Comm. Statist. A 6, 1361-1365.
Sen, P. K. (1979). Weak convergence of some quantile processes arising in progressively censored tests.
Ann. Statist. 7, 414-431.
sen, P. K. (1981). The Cox regression model, invariance principles for some induced quantile
processes and some repeated significance tests. Ann. Statist. 9, 109-121.
Slud, E. and Wei, L. J. (1982). Two-sample repeated significance tests based on the modified Wilcoxon
statistic. J. Amer. StatisL Assoc. 77, 862-868.
Smirnov, N. V. (1939). Estimate of deviation between empirical distribution functions in two
independent samples (Russian). Bulletin Moscow Univ. 2(2), 3--16.
Tarone, R. E. and Ware, J. (1977). On distribution free tests for equality of survival distributions.
Biometrika 64, 156-160.
Tsiatis, A. A. (1981a). The asymptotic joint distribution of the efficient scores test for the proportional
hazards model calculated over time. Biometrika 68, 311-315.
Tsiatis, A. A. (1981b). A large sample study of Cox's regression model. Ann. Statist. 9, 93-108.
Von Mises, R. (1931). Wahrscheinlichkeitsrechnung. Leipzig-Wein.
P. R. Krishnaiahand P. K. Sen, eds., Handbook of Statistics, Vol. 4 "~'~
Elsevier SciencePublishers (1984) 791-811
Mitchell Gail
1. Introduction
Clinical trials to compare survival on two treatment groups are often ~costly
and time-consuming, and they may impose psychological and physical burdens
on participating patients. Yet these trials offer a uniquely sound methodology
for acquiring new medical knowledge. To obtain convincing scientific evidence
from such a trial requires that one follow a substantial number of pathents for a
sufficient time interval. Over the past fifteen years, a number of methods for
monitoring the results of clinical trials have been developed in an effort to
shorten the trials and reduce the numbers of patients required without greatly
diminishing the quality of the scientific information gained.
The question of how to monitor a clinical trial is inevitably determined by
one's view of the central purpose and meaning of such an experiment.
Armitage (1960, 1975) presents classical frequentist methods for a variety of
endpoints. These methods emphasize hypothesis tests and proper control of the
significance level despite repeated looks at the data, in hypothetical repetitions
of the entire clinical trial. The frequentist viewpoint has been vigorously
challenged by Cornfield (1966), who presents a Bayesian alternative. Anscombe
(1963), too, criticized the frequentist approach, and presented a decision
theoretic approach to monitoring. This formulation, which was developed
simultaneously by Colton (1963), is to regard the monitoring of a clinical trial
as an effort to minimize the total number of patients who receive the inferior
treatment; patients other than those participating in the trial must be included
in such calculations. Yet another viewpoint is that of the selection theorist (e.g.
Bechhoffer, 1954), who aims to select the superior treatment with prespecified
high probability in hypothetical repetitions of the trial. Gail (1982) has dis-
cussed these several approaches and gives further references, as do DeMets
and Lan (1984). I n this paper we concentrate on recent developments and
adaptations of the frequentist paradigm, and we confine attention to survival
tests based on ranks.
It is assumed that the main response of interest is survival time and that
decisions to stop the trial or to declare a treatment preference, if any, depend
791
792 Mitchell Gail
2. Null distribution theory for two-sample rank tests used for monitoring
The summation in (2.1) extends only over the d(t) distinct death times, Y(i),
and Z(t, Y(i)) indicates the group membership of the patient who died at Y(i).
The positive function Q(t, Y(i)) defines the particular rank test of interest. For
example Q(t, Y(i)) = i defines the logrank statistic, which was first proposed by
Mantel (1966), and Q(t, Y(i))= n(t, Y(i)) defines the modified Wilcoxon test
794 Mitchell Gail
given by Gehan (1965). The class of tests represented by (2.1) has been
discussed by Tarone and Ware (1977), Prentice and Marek (1979), Gill (1980),
and Harrington and Fleming (1982).
As mentioned by Lagakos (1982), it is revealing to think of (2.1) as the score
statistic associated with the partial likelihood of Cox (1972). Suppose the
hazard ratio hi(t, Y(i))/hz(t, Y(i))= exp{aQ(t, Y(i))}. The partial likelihood
represents the survival experiment as a product of independent Bernoulli trials
with log likelihood
The quantity S(t) in (2.1) is the score statistic/~(~ = 0). The observed Fisher
information is
d
-f~(a = O)=- V(d) = ~, QZ(t, Y(i))p~(1 -p~), (2.3)
1
where the shortened notation Zi =--Z(t, Y(i)) and Pi =-p(t, Y(i)) will be used.
If (2.2) had indeed arisen from independent Bernoulli trials with variates Zi
having mass functions pZi(1- pi) 1-zl under H0: a = 0, then S(t) would be the
sum of independent centered, weighted variates with expectations 0 and
variances Q2(t, Y(i))pi(1- pi). Hence
(2.1). These martingale and convergence results hold whether one calculates S
at fixed real times or at the random times of observed deaths. Majumdar and
Sen (1978a) and Sinha and Sen (1979) extend these results to several samples.
The distribution theory is also simple if entry is purely sequential; that is, if
the entry of the next patient occurs only after the death of the previous patient.
Sen and Ghosh (1972) demonstrate that a normed version of S(t), computed
after each observed death, again converges to a Wiener process. However,
example 3.1 of Slud (1983) shows that the martingale property may fail in the
presence of loss to follow-up represented by W < ~.
Results are more complicated for staggered entry. Suppose first that S(h) and
S(t2) are computed at two predetermined real times, h < t2. The previous
heuristic device can be extended in the following way. Let Y(i, h), i=
1, 2 . . . . . d(tl), denote the ordered experimental death times as of real time h,
and let Y(i, tz), i = 1, 2 . . . . . d(t2), denote such death times as of real time t2.
Note Y(i, h) need not equal Y(i, t2). Express S(t2) as
d(t2)
S(t2)-- Z Q(tz, Y(i, t2)){z(t2, Y(i, t2))- p(t2, Y(i, t2))}
1
where ~;* represents a sum over those patients who had already died in [0, tl],
{Y*} are their experimental death times, E** represents a sum over those
patients who die in (h, t2], and {Y**} are their experimental death times. At
real time t2, the first of the three sums in (2.5) might be regarded as a sum of
independent weighted, centered Bernoulli variates with centering P(h, Y*)
determined by the data at h and with weights Q(t2, Y*) depending on t2. The
second sum might be regarded as another independent set of independent
weighted, centered Bernoulli variates with centerings p(t2, Y**) dependent on
all the information available at t2. For randomized trials, the third sum can be
shown to be asymptotically negligible compared to the first two, because the
entry time distributions are the same on the two treatment arms. Indeed, from
Tsiatis (1982, equation 3.3), it follows that P(h, Y*)-p(t2, Y*) converges to
zero and that the third sum is asymptotically negligible. Because S(h) only
depends on the set of Bernoulli variates in E*, we4aave
a(q)
eov(S(tl), s(t2))= ~, O(tl, Y(i, t3)O(t2, Y(i, t3)p,(1- p,) (2.6)
1
where p~ = p(tl, Y(i, h)) and where the summation is over the times Y(i, tl).
796 Mitchell Gail
3. Continuous monitoring
Armitage (1975, p. 142) suggested that one can monitor the logrank test by
~otting S(Te), the value of S at the random time of death d, against
V(d) = Z1d p~(1- p~). He asserted under the null hypothesis that, S(Ta) would
behave approximately like the cumulative sum of V(d) independent normal
deviates, each with mean 0 and variance ~ if patients were assigned with equal
probability to each treatment. He therefore suggested using classical boun-
daries for normal variates, in conjunction with the logrank statistic, to mon-
itor survival studies. For proportional hazards alternatives ha(t, Y(i))=
h2(t, Y(i))exp(0), with small 0, a Taylor series expansion argument shows that
the mean of one of these normal deviates is / z - p , ( 1 - p i ) O - 0 / 4 and the
variance is C - p~(1- p~)- . Thus boundaries appropriate for monitoring the
cumulative sum of d independent N(/x, o-) variates may be used to monitor a
logrank plot of S(Td) against V(d), where the noncentrality parameter 6 =
#/o- = 0/2 is used to help select a boundary. Since V(d) is only slightly less than
d/4, one might choose to plot S(Td) against d/4 or 2S(Td) against d, instead.
Chapter 5 of Armitage (1975) contains two-sided boundaries for testing
0 = 0. A 'restricted boundary' for the logrank test is expressible in terms of the
design constants a, b and Vi. One continues observation so long as
and I3"(d)< Vs. Reject the null hypothesis as soon as the boundary (3.2) is
infringed. Otherwise, accept the hypothesis as soon as V(d)i> Vi. The con-
stants Vi and k(a, Vi) are again determined by 6 = 0/2 and the desired size and
power. The boundary (3.2) is a parabola, and it results from repeatedly
comparing the standardized deviate (2.4) with the constant k(a, Vi). The
calculations needed to construct repeated significance test boundaries for
independent normal variates are given by Armitage, McPherson and Rowe
(1969) and McPherson and Armitage (1971). The operating characteristics of
restricted boundaries (3.1) and repeated significance test boundaries (3.2) are
similar. Under the alternative, both result in substantial average reductions in
the numbers of deaths which must be observed to terminate a trial, compared
with a fixed sample design that is analyzed only when a prespecified number of
798 Mitchell Gail
S(o~), namely VI, is known, and c can be determined by noting that S(Ta)/V}/2
converges to a Wiener process on [0, 1] with transformed 'time' scale to =
9(d)/
More recently, Majumdar and Sen (1978b) adapted boundaries like (3.5) to
staggered entry. However, instead of restricting analyses to the times at which
deaths occur, they proposed to monitor the data at the random real times
either of a death or of a new entry into the study. At each real time of analysis,
they consider a sequence of type I censored survival problems, one cor-
responding to each member of a finite set of potential follow-up times. For
each member of the sequence, they find the maximum of S(Td), and then they
maximize over these maxima. The resulting distribution theory is complex, and
rejection occurs when one of the computed statistics exceeds a critical value
appropriate for a two-dimensional 'Brownian sheet'. Sinha and Sen (1982)
proposed closely related procedures. A disadvantage of these techniques is the
need to specify a 'target sample size', which is the putative number of patients
to be entered in the event monitoring doesn't lead to rejection. This quantity,
which is hypothetical, is needed to calculate the monitoring statistics. No
corresponding parameter is needed to calculate S(Td) and V(d) for the
methods discussed previously.
Sen (1979) developed an adjustment procedure for covariates that permits
continuous monitoring for simultaneous entry, and Whitehead (1983) presents
covariate adjustment procedures for the case of staggered entry.
It is often impractical to attempt continuous monitoring because of the
difficulty of maintaining accurate up-to-date data. This is a particular problem
for cooperative clinical trials involving several institutions. To alleviate this
difficulty, group sequential plans have been proposed for examining the data
only a few times (Pocock, 1977). Group sequential plans capture most of the
efficiencies of continuous monitoring, and they greatly reduce, but do not
eliminate, practical problems (see DeMets, Williams and Brown, 1982). In the
next two sections, we therefore describe group sequential plans for survival
data.
Rather than attempt to monitor the data continuously, one might choose to
look only a few times and to stop the trial if early evidence against the null
hypothesis is strong. Pocock (1977) proposed this 'group sequential' approach
for a variety of clinical trial response variables, and he concluded that "In
general, a group sequential design with even a quite small number of groups
provides a substantial reduction in average sample size when treatment
differences exist. In fact, such a reduction may often be close to or even better
than that achieved by standard sequential designs". Further work by Pocock
(1981) suggested that it was rarely worthwhile to examine accumulating in-
800 Mitchell Gail
formation more than five times if a repeated significance test boundary is used.
The following simple normal model has been used to construct group
sequential boundaries. After k groups of g observations each, the standardized
statistic
Gail, DeMets and Slud (1982) compared the simulated performance of the
logrank test, computed after each eighteen deaths, with the theoretical predic-
tions of the simple normal model. They chose K = 5 maximum looks, and
considered four two-sided boundaries. The Haybittle (1971) boundary (H) has
bk = 3.0 for k = 1, 2, 3, 4 and b5 = 1.96. This boundary is conservative and only
detects extreme early differences. Its size is 0.053, slightly in excess of the
nominal 0.05 level. The Pocock boundary (P) is a repeated significance test
boundary with bk = 2.413 for k = 1, 2, 3, 4, 5. Note that bk exceeds 1.96 in order
to assure proper size; had 1.96 been used instead, the size of this test would be
about 0.142. The O'Brien-Fleming boundary (O) is bk = (4.149x 5/k) la for
k = 1, 2, 3, 4, 5. A fixed sample size boundary (F) was also defined as bk = 100
for k = 1, 2, 3, 4, and b5 = 1.96.
If the increments of the logrank score statistic S, computed after each g
Nonparametric frequentist proposals for monitoring comparative survival studies 801
deaths, satisfied the assumptions (1)-(3) above, then the properties of these
boundaries could be calculated theoretically. For proportional hazards alter-
natives with hazard ratio exp(0), the expectation of an increment based on g
deaths is approximately g6 = gO~4 for small 0, and the variance is go-2- g/4.
Hence the required noncentrality parameter is
A = (gO/4)(g/4)-"2= 0 ( g / 4 ) 1/2 .
Table 1
Theoretical properties of four group sequential boundaries with K = 5 and g = 18 deaths for each
increment of the logrank score a
null case
size 0.050 0.050 0.053 0.050
/~ 4.876 4.964 4.977 5.000
relative hazard
exp(0) = 2
power 0.845 0.901 0.909 0.907
/~ 3.083 3.648 3.864 5.000
healthier patients enter later in the trial, and that the other boundaries are less
sensitive to trends in the life distribution of entering patients. Altogether, these
results suggest that the simple normal model may be used to design and study
group sequential boundaries for the logrank test with fixed numbers of deaths
in each group.
It is a strength of methods based on analyses at fixed increments of V(d) that the
properties of proposed boundaries can be evaluated prior to the experiment,
either by use of the simple normal model or through simulations. Moreover,
standardized designs are available for the practitioner, who needs no special
facilities to use them. Recent work on the selection of group sequential
boundaries includes that ot! DeMets and Ware (1980, 1982), who proposed
one-sided rejection regions, Whitehead and Stratton (1983), who propose an
asymmetric triangular continuation region, Gould and Pecore (1982), who
insert an inner wedge to permit earlier stopping under the null hypothesis, and
Pocock (1981) and McPherson (1982), who consider how many repeated looks
at the data are useful.
Standard confidence intervals and point estimates of the log relative risk, 0,
based either on the partial likelihood of Cox (1972) or on likelihoods from
parametric models, do not have their intended frequentist properties in hypo-
thetical repetitions of the group sequential trial. However, valid frequentist
confidence intervals have been constructed for group sequential plans with
predetermined boundaries. Jennison and Turnbull (1983a) obtained valid
confidence intervals for the binomial parameter following group sequential
tests by defining an ordering on the outcomes, and Tsiatis, Rosner and Mehta
(1983) have extended these ideas to the standard normal model. They obtained
confidence intervals, which can be applied to the log relative hazard, 0. For our
problem, the outcomes are ordered according to how much the data favor
survival c u r v e G2 over survival curve G1 = G~ ~p(). Thus G2 is most favored if
Z(1) rejects at a large positive value, somewhat less favored if Z(2) rejects at a
large positive value and so forth, with G1 most favored if Z(1) has a large
negative value. With this ordering of the group sequential outcome space, one
can compute the probability that the results would favor G2 as much or more
than did the observed outcome. These probabilities are inverted to produce
confidence intervals on 0. Similar ideas are found in the work of Fairbanks and
Madsen (1982) and Madsen and Fairbanks (1983), who define P values accord-
ing to an implicit ordering of the possible outcomes.
The confidence intervals of Tsiatis, Rosner and Mehta (1983) are designed
for the analysis after a group sequential boundary has been used to test the null
hypothesis. Jennison (1982) and Jennison and Turnbull (1983b) construct
repeated confidence intervals based on the logrank test that can be calculated
as the trial proceeds. These confidence intervals can be used for estimation of
the log relative hazard 0 and for hypothesis tests which reject whenever a
sequentially computed confidence interval excludes the null value 0 -- 0. To use
the method of Jennison and Turnbull (1983b), one must pre-specify a group
sequential boundary, bk. Based on the fact that the logrank statistic, S(Tkg), is
Nonparametric frequentist proposals for monitoring comparative survival studies 803
approximately normally distributed with mean kgO/4 and variance kg/4, com-
pute the confidence interval after k groups of g deaths from
One rejects the null hypothesis if Z(tl) i> b(~Ol)OJ]q/2. If no rejection occurs at/1,
one tests again at t2 and determines b(w2) from P[B(oJ1)< b(oJx) , B(o~;)~
b(w2)] = a*(w2)- a*(oJ1). Again, one rejects if Z ( t 2 ) > ~ b(o~2)o~ 1/2. The pro-
Nonparametric frequentist proposals for monitoring comparative survival studies 805
cedure continues, based on such recursive calculations, either until the null
hypothesis has been rejected or until D deaths have been observed (w = 1).
The function a*(w) determines how rapidly the size of the test is to be 'used
up'. Lan and DeMets define functions a*(w) that correspond to the O'Brien-
Fleming (O) and Pocock (P) boundaries described in Section 4 if tl, tz. . . . . tk
are chosen to correspond to equal increments wk--wk-1 = 1/K for k =
1, 2 . . . . . K and o~0= 0. Indeed, the procedures in Section 4 can be regarded as
special cases of the Lan-DeMets procedure. However, when analyses are
performed at the times of meetings of the monitoring committee, the cor-
responding boundaries will be unpredictable. Therefore, as for the procedure
of Slud and Wei, it is not possible to study the properties of these boundaries in
advance without making strong assumptions about accrual rates and survival.
Canner (1976, 1977) discusses group sequential monitoring of survival data at
designated calendar times, but, for staggered entry, he studies a parametric test
of the equality of two exponential survival distributions (Canner, 1977).
6. Curtailed experiments
Suppose a fixed sample clinical trial is planned, and that the trial has size
a = P [ R [H0] and type 2 error/3 = P[/~ [Ha] where R is the event that the test
statistic falls in the rejection region at completion of the planned experiment
and /~ is the complementary event. If one reaches a point in the trial from
which it is known with certainty whether the final result will be in R or in/~,
irregardless of any future observations, one could stop the trial and either
accept or reject the null hypothesis. Such a procedure is called curtailment.
If all patients enter simultaneously, curtailment is feasible for survival
statistics like (2.1), because a sequential path may be so extreme that all
possible arrangements of later ranks lead to R, for example. Halperin and
Ware (1974) studied such curtailment for Gehan's modification of the Wilcoxon
test, and Verter (1983) extended their results to other statistics like (2.1),
including the logrank test.
The procedures above are very conservative because they require that a
foregone conclusion be reached before curtailment. To permit somewhat
earlier stopping one might require only that the probability that the present
decision will be reversed is small. Such a procedure is called stochastic
curtailment. To illustrate this idea, suppose one plans a 'fixed sample' logrank
analysis after D deaths, with rejection if S ( T D ) E R. The quantity D and the
rejection region R are chosen to give size a and power 1 -/3 for a hazard ratio e .
Suppose, however, one examines the logrank score after each death and plans to
reject H0 at the smallest d ~<D such that
where 7 is near 1. Also, if the new treatment is not doing well, one might wish
806 Mitchell Gail
Lan, Simon and Halperin (1982) proved the very general result that the overall
size of this procedure is a/y and the power is 1 -/3/7'. For example if Y = 0.95,
stochastic curtailment only increases the size from a = 0.05 to a/y = 0.0526. To
attain adequate power, a value of y' as small as 0.5 may be used. For example,
if the original fixed sample power is 1 - / 3 = 0.9, the stochastically curtailed
power is 1 - 0.10/0.5 = 0.8, which is often acceptable.
There are practical problems with this procedure. Early stopping to reject H0
according to (6.1) is feasible, because the probability in (6.1) can be calculated
under H0. For the logrank statistic, the independent increments property makes
the calculation tractable. Early stopping to accept the null hypothesis according
to (6.2) is more problematic. The probability in (6.2) depends on the unknown
alternative. For proportional hazards, one must specify 0. If the true 0 is larger
than anticipated, one will overestimate the left hand side of (6.2), leading to
inappropriate early stopping and loss of power. There is the further technical
problem that, under Ha, the logrank increments are correlated, and no theory
is yet available for the computation of (6.2). The calculation of (6.2) is also
problematic in situations which require that one guess at the future form of the
two survival distributions. Halperin, Lan, Ware, Johnson and DeMets (1982)
discuss this difficulty in connection with stochastic curtailment of a comparison
of the proportions dead on the two treatments.
All the methods for monitoring that we have discussed so far are based on
the premise that a final treatment comparison will be made at the moment the
trial is stopped. In some circumstances, it may be desirable to separate the
decision to stop the accrual of new patients from the final analysis for the
comparison of treatments. Rubinstein and Gail (1982) have studied the follow-
ing procedure and have used it in the design and monitoring of lung cancer
trials.
It is agreed that the final two-sample treatment comparison, based on the
proportional hazards model, will be made after D = 90 deaths have occurred.
This assures a power 0.9 against relative hazard e = 2 for the two-sided
a = 0.05 level logrank test. The standard deviation of the estimate of 0 will be
approximately V ( D ) -1/2- (4/90) a/z= 0.21. In essence, this is a fixed sample
experiment, based on D deaths, as regards the final treatment comparison.
However, Rubinstein and Gail (1982) show that one can monitor the ac-
cumulating survival data with the logrank statistic and stop accrual early if an
important treatment difference begins to emerge, without perturbing the final
statistical comparison of treatments, so long as one waits until D deaths have
Nonparametric frequentist proposals for monitoring comparative survival studies 807
been observed before performing the final analysis of treatment effect. This
strategy takes advantage of the fact that, at the moment accrual is stopped,
there are a number of patients still on study who will, in time, provide the
required additional survival information. The rules for stopping accrual are
quite flexible and usually contain the proviso that the time to death D will not
be unduly prolonged by early termination of accrual. The technique should not
be used with statistics that have correlated increments.
A major advantage of this procedure is that one can prespecify the precision
of the final estimate of treatment effect and that the final fixed sample analysis
is simple and noncontroversial. Also, the frequentist analysis and the relative
likelihood analysis will agree, because for such fixed sample experiments, the
likelihood ratio is a monotone function of the observed p value. Furthermore,
the data may be monitored whenever it is convenient, and rigid rules for
stopping accrual are not necessary.
This technique is not universally applicable, because it is assumed that all
accrued patients will remain on their originally assigned treatment until D = 90
deaths are observed. This is often appropriate in a cancer trial in which the
treatment is given early in the course of follow-up, because there is usually no
compelling evidence that delayed treatment with the apparently better regime
could benefit those initially given the other treatment. However, in a trial of a
chronically administered treatment, such as a beta-blocker to prevent sudden
death from cardiac arrhythmia, there may be an urgent need to switch all
patients to the apparently preferable treatment as soon as substantial evidence
of a treatment difference arises. In such cases, one cannot usefully separate the
decision to stop accrual from the final analysis of treatment effect.
8. Discussion
Several themes emerge from this survey. First, good progress has been made
in developing a null distribution theory for sequential application of survival
statistics with staggered entry. Convergence to a Gaussian process has been
proved for a wide class of statistics for analysis in real time (see Slud, 1983,
Tsiatis, 1982 and Harrington, Fleming and Green, 1982), and Sellke and
Siegmund (1983) have proved convergence to a Brownian motion for the
logrank test in 'process time', V(d). It would be useful to have similar results
for other statistics in process time. No rigorous theory has been published for
fixed alternatives, even for the logrank test with proportional hazards. This
deficit impedes the evaluation of group sequential designs, and it prevents the
precise evaluation of probabilities like (6.2) needed for stochastic curtailment.
Nonetheless, at least for the logrank statistic with relative hazards in the range
0.5 to 2.0, a Brownian motion model with drift leads to results that are
sufficiently accurate for planning group sequential trials (Gail, DeMets and
Slud, 1982).
A second theme is flexibility. The procedure of Slud and Wei (1982) permits
808 Mitchell Gail
analyses at arbitrary real times, though the number of looks and allocation of
a level must be prespecified. If one is willing to specify a final intended amount
of process information V(D), and a function for allocating a level, then the
methods of Lan and DeMets (1983) may be used to look at the data whenever
desired. Stochastic curtailment (Lan, Halperin and Simon, 1982) may also be
employed at any time in the trial, provided the required probabilities can be
calculated. The method of accrual monitoring (Rubinstein and Gail, 1982)
offers investigators great freedom in the monitoring process, provided they
agree to defer a treatmen! decision until the prespecified final information,
V(D), has been obtained.
Implicit in the variety of proposed boundaries and techniques are underlying
judgments as to the relative importance of early stopping and the need for
compelling medical evidence. Accrual monitoring, stochastic curtailment, and
conservative group sequential boundaries such as those proposed by t-Iaybittle
(1971) and O'Brien and Fleming (1979) generally result in more clinical data
and less potential for early stopping. Additional clinical data and longer
follow-up times can be especially helpful in survival studies, because later
survival experience on the two treatments may not resemble the early relative
performance.
Those procedures which offer the greatest potential for early stopping pose
special problems at the time of analysis, because a frequentist analysis that is
based on hypothetical repetitions of the experiment with its stopping procedure
can differ from a likelihood based analysis that ignores the stopping rule
(Cornfield, 1966). A medical investigator, who was asked to write up the results
of a clinical trial that stopped early, was frustrated by the advice of the
following three statistical consultants. The first statistician, who had planned
the original group sequential study with the Pocock boundary, presented a
confidence interval for the hazard ratio. The second statistician, who had
replaced the first during the course of the study, presented a different
confidence interval, because he had decided to switch to an O'Brien-Fleming
boundary. The third statistician, who had not had access to the information on
experimental design and who said he didn't want it anyhow, based his analysis
on the observed likelihood and produced yet a third confidence interval.
Relieved, the medical investigator chose the last confidence interval, since it
agreed most closely with his own fixed sample analysis. This story, which I
hope never happened, is a reminder that there is disagreement among statisti-
cians as to whether the strict frequentist analysis is meaningful or appropriate
for a trial which may never be repeated and for which it is unlikely that any
pre-specified stopping procedure will be strictly observed.
Acknowledgements
comments, the reviewer for pointing out important references, and Julie
Paolella for typing the manuscript. This paper was used as the basis of a
presentation to the 1983 Conference on Biostatistics in Philadelphia, sponsored by
Temple University and the Merck Company in Philadelphia.
References
Anderson, T. W. (1960). A modification of the sequential probability ratio test to reduce sample
size. Anals of Mathematical Statistics 31, 165-197.
Anscombe, F. J. (1963). Sequential medical trials. Journal of the American Statistical Association
58, 365-383.
Armitage, P. (1960). Sequential Medical Trials, Blackwell, Oxford.
Armitage, P. (1975). Sequential Medical Trials, Wiley, New York.
Armitage, P., McPherson, C. K. and Rowe, B. C. (1969). Repeated significance tests on accumulat-
ing data. Journal of the Royal Statistical Society A 132, 235-244.
Bechhofer, R. E. (1954). A single-sample multiple decision procedure for ranking means of normal
populations with known variances. Annals of Mathematical Statistics 25, 16-39.
Canner, P. L. (1976). Repeated analysis of clinical trial data. In: Proceedings of the Ninth
International Biometric Conference, Volume 1, Biometric Society, Boston.
Canner, P. L. (1977). Monitoring treatment differences in long-term clinical trials. Biometrics 33,
603-615.
Chatterjee, S. K. and Sen, P. K. (1973). Nonparametric testing under progressive censorship.
Calcutta Statistical Association Bulletin 22, 13-50.
Colton, T. (1963). A model for selecting one of two medical treatments. Journal of the American
Statistical Association 58, 388-400.
Cornfield, J. (1966). Sequential trials, sequential analysis, and the likelihood principle. American
Statistician 20, 18-23.
Cox, D. R. (1972). Regression models and life tables (with discussion). Journal of the Royal
Statistical Sgciety B 34, 187-202.
Davis, C. E. (1978). A two-sample Wilcoxon test for progressively censored survival data.
Communication Statistical Theory and Methods A 7, 389-398.
DeMets, D. L. and Lan, K. K. G. (1984). An overview of sequential methods and their applications
in clinical trials. Communications in Statistics Series A (to appear).
DeMets, D. L. and Ware, J. H. (1980). Group sequential methods in clinical trials with a one-sided
hypothesis. Biometrika 67, 651-660.
DeMets, D. L. and Ware, J. H. (1982). Asymmetric group sequential boundaries for monitoring
clinical trials. Biometrika 69, 661--663.
DeMets, D. L., Williams, G. W. and Brown, B. W. Jr. and the NOTF Research Group (1982). A
case report of data monitoring experience: the nocturnal oxygen therapy trial. Controlled Clinical
Trials 3, 113-124.
Fairbanks, K. and Madsen, R. (1982). P values for tests using a repeated significance test design.
Biometrika 69, 69-74.
Gail, M. H. (1982). Monitoring and stopping clinical trials. In: V. Mik6 and K. E. Stanley, eds.,
Statistics in Medical Research : Methods and Issues with Applications in Cancer Research. Wiley,
New York, pp. 455-484.
Gail, M. H., DeMets, D. L. and Slud, E. V. (1982). Simulation studies on increments of the
two-sample logrank score test for survival time data, with application to group sequential
boundaries. In: J. Crowley and R. A. Johnson, eds., Survival Analysis, Institute of Mathematical
Statistics, Hayward, California, pp. 287-301.
Gehan, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily singly-censored
samples. Biometrika 52, 203-223.
810 Mitchell Gail
Gill, R. D. (1980). Censoring and Stochastic Integrals. Mathematical Centre Tracts 124, Mathema-
tische Centre, Amsterdam.
Gould, A. L. and Pecore, V. J. (1982). Group sequential methods for clinical trials allowing early
acceptance of H0 and incorporating costs. Biometrika 69, 75-80.
Halperin, M. and Ware, J. (1974). Early decision in a censored Wilcoxon two-sample test for
accumulating survival data. Journal of the American Statistical Association 69, 414--422.
Halperin, M., Lan, K. K. G., Ware, J. H., Johnson, N. J. and DeMets, D. L. (1982). An aid to data
monitoring in long-term clinical trials. Controlled Clinical Trials 3, 311-323.
Harringron, D. P. and Fleming, T. R. (1982). A class of rank test procedures for censored survival
data. Biometrika 69, 553-566.
Harring~on, D. P., Fleming, T. R. and Green, S. J. (1982). Procedures for serial testing in censored
survival data. In: J. Crowley and R. A. Johnson, eds., Survival Analysis, Institute of Mathematical
Statistics Monograph Series, Volume 2, Hayward, California.
Haybittle, J. L. (1971). Repeated assessment of results in clinical trials of cancer treatment. British
Journal of Radiology 44, 793-797.
Jennison, C. (1982). Sequential methods for medical experiments. Ph.D. Dissertation, Cornell
University.
Jennison, C. and Turnbull, B. W. (1983a). Confidence intervals for a binomial parameter following
a multistage test with application to MIL-STD 105D and medical trials. Technometrics 25, 4%58.
Jennison, C. and Turnbull, B. W. (1983b). Repeated confidence intervals for sequential clinical
trials. Controlled Clinical Trials 5, 33-45.
Jones, D. and Whitehead, J. (1979). Sequential forms of the logrank and modified Wilcoxon tests
for censored data. Biometrika 66, 105-113.
Jones, D. R., Newman, C. E. and Whitehead, J. (1982). The design of a sequential clinical trial for
the comparison of two lung cancer treatments. Statistics in Medicine 1, 73-82.
Kalbfleisch, J. D. and Prentice, R. L. (1980). The Statistical Analysis of Failure Time Data, Wiley,
New York.
Koziol, J. and Petkau, A. J. (1978). Sequential testing of the equality of two survival distributions
using the modified Savage statistic. Biometrika 65, 615-623.
Lagakos, S. W. (1982). Inference in Survival Analysis: nonparametric tests to compare survival
distributions. In: V. Mik6 and K. E. Stanley, eds., Statistics in Medical Research: Methods and
Issues with Applications in Cancer Research. Wiley, New York, pp. 340-364.
Lan, K. K. and DeMets, D. L. (1983). Discrete sequential boundaries for clinical trials. Biometrika
70, 659-663.
Lan, K. K. G., Simon, R. and Halperin, M. (1982). Stochastically curtailed tests in long-term
clinical trials. Communications in Statistics: Sequential Analysis 1, 207-219.
Lininger, L., Gail, M. H., Green, S. B. and Byar, D. P. (1979). Comparison of four tests for
equality of survival curves in the presence of stratification and censoring. Biometrika 66,
419--428.
Madsen, R. W. and Fairbanks, K. B. (1983). P values for multistage and sequential tests.
Technometrics 25, 285-293.
Majumdar, H. and Sen, P. K. (1978a). Nonparametric tests for multiple regression under progres-
sive censoring. J. Multivariate Analysis 8, 73-95.
Majumdar, H. and Sen, P. K. (1978b). Nonparametric testing for simple regression under
progressive censoring with staggering entry and random withdrawal. Communications in Statistics
Series A 7, 34%371.
Mantel, N. (1966). Evaluation of survival data and two new rank tests arising in its consideration.
Cancer Chemotherapy Reports 50, 163-170.
McPherson, K. (1982). On choosing the number of interim analyses in clinical trials. Statistics in
Medicine 1, 25-36.
McPherson, C. K. and Armitage, P. (1971). Repeated significance tests on accumulating data when
the null hypothesis is not true. Journal of the Royal Statistical Association A 134, 15-65.
Mehrotra, K. G., Michalek, J. E. and Mihalko, D. (1982). A relationship between two forms of
linear rank procedures for censored data. Biometrika 69, 674--676.
Nonparametric frequentist proposals for monitoring comparative survival studies 811
Meier, P. (1979). Terminating a trial - the ethical problem. Clinical Pharmacology and Therapeutics
25, 633--640.
Morton, R. (1978). Regression analysis of life tables and related nonparametric tests. Biometrika
65, 32%333.
O'Brien, P. C. and Fleming, T. R. (1979). A multiple testing procedure for clinical trials. Biometrics
35, 54%556.
Pocock, S. J. (1977). Group sequential methods in the design and analysis of clinical trials.
Biometrika 64, 191-199.
Pocock, S. J. (1981). Interim analyses and stopping rules for clinical trials. In: J. F. Bithell and R.
Coppi, eds., Perspectives in Medical Statistics. Grune and Stratton, New York, pp. 191-223.
Prentice, R. (1978). Linear rank tests and right censored data. Biometrika 65, 167-179.
Prentice, R. and Marek, P. (1979). A qualitative discrepancy between censored data rank tests.
Biometrics 35, 861-867.
Rubinstein, L. V. and Gail, M. H. (1982). Monitoring rules for stopping accrual in comparative
survival studies. Controlled Clinical Trials 3, 325--343.
Schoenfeld, D. (1981). The asymptotic properties of nonparametric tests for comparing survival
distributions. Biometrika 68, 316-319.
Sellke, T. and Siegmund, D. (1983). Sequential analysis of the proportional hazards model.
Biometrika 70, 315-326.
Sen, P. K. (1979). Rank analysis of covariance under progressive censoring. Sankhya Series A 41,
147-169.
Sen, P. K. (1981). The Cox regression model, invariance principles for some induced quantile
processes and some repeated significance tests. The Annals of Statistics 9, 10%121.
Sen, P. K. and Ghosh, M. (1972). On strong convergence of regression rank statistics. Sankhya A
34, 335-348.
Sinha, A. N. and Sen, P. K. (1979). Progressively censored tests for clinical experiments and life
testing based on weighted empirical distributions. Communications in Statistics Series A 8,
871-897.
Sinha, A. N. and Sen, P. K. (1982). Tests based on empirical processes for progressive censoring
schemes with staggering entry and random withdrawal. Sankhya Series B 44, 1-18.
Slud, E. V. (1983). Sequential linear rank tests for two-sample censored survival data. Submitted
for publication.
Slud, E. V. and Wei, L. J. (1982). Two-sample repeated significance tests based on the modified
Wilcoxon statistic. Journal of the American Statistical Association 77, 862-868.
Tarone, R. and Ware, J. (1977). On distribution-free tests for equality of survival distributions.
Biome~ka 64, 156-160.
Tsiatis, A. A. (1982). Repeated significance testing for a general class of statistics used in censored
survival analysis. Journal of the American Statistical Association 77, 855-861.
Tsiatis, A. A., Rosner, G. L. and Mehta, C. R. (1983). Exact confidence intervals following a group
sequential test. Dana-Farber Cancer Institute Technical Report 310Z.
Tsiatis, A. A. and Tritcbler, D. L. (1982). Application of grouped sequential tests to survival
analysis in clinical trials. Technical Report 275 ERCZ, Sidney Farber Cancer Institute Depart-
ment of Biostatisties.
Verter, J. (1983). Early decision using simple rank statistics for accumulating survival data. Ph.D.
Dissertation, University of North Carolina at Chapel Hill.
Wald, A. (1947). Sequential Analysis. Wiley, New York.
Whitehead, J. (1983). Design and Analysis of Sequential Clinical Trials. Wiley, New York.
Whitehead, J. and Jones, D. (1979). The analysis of sequential clinical trials. Biometrika 66,
443--452.
Whitehead, J. and Stratton, I. (1983). Group sequential clinical trials with triangular continuation
regions. Biometrics 39, 227-236.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 "71A
Elsevier Science Publishers (1984)813--830 ..)'1"
P a u l W. M i e l k e , Jr.
1. Introduction
This work has been supported by the Division of Atmospheric Resources Research, Bureau of
Reclamation, U.S. Department of the Interior, under Contract 8-07-83-V0009 and Cooperative
Agreement 2-07-83-V0273,by the National ScienceFoundation, under Grant ATM-81-07056, and
by National Environmental Satellite Service, National Oceanic and Atmospheric Administration,
U.S. Department of Commerce, under Contract NAS1RA-H-00001.
813
814 Paul W. Mielke, Jr.
The techniques and examples presented in this section are based on multi-
response permutation procedures (MRPP). Since these techniques are rela-
tively new, a description of M R P P along with their relation to well known
techniques will precede some recent meteorologica! applications involving wea-
ther modification and tropical storms.
i=1
where
=
l<J
is the average distance function value for all distinct pairs of objects in
subgroup Si (i = 1. . . . , g), ni/> 2 is the number of a priori classified objects in
subgroup Si (i = 1. . . . . g), K = E~=1 hi, ng+l = N - K is the number of remain-
ing (unclassified) objects in the excess subgroup Sg+l (this is an empty subgroup
in many applications), Et<j is the sum over all I and J such that 1 ~< 1 < J ~< N,
qt/(o~,) is 1 if o~, ~ Si and 0 otherwise, C / > 0 (i = 1. . . . . g), and E~=1 C,. = 1. The
underlying permutation distribution of 6 (the null hypothesis) assigns equal
probabilities to the
by/xs, o-g and 3'8. Under the null hypothesis, preliminary findings (Mielke et al.,
1976; Mielke, 1978, 1979b) indicated some situations when the asymptotic
distribution of N(8 - IXs) is nondegenerate with 3'8 being substantially negative.
Based on results due to Sen (1970, 1972), O'Reilly and Mielke (1980) presented
general theorems for the multivariate case of M R P P which characterize situa-
tions in which the distribution of N~/2(6- ixs) is asymptotically normal under
the null hypothesis. More recently Brockwell et al. (1982) presented theorems
for the univariate case of M R P P which delineate distributions for situations
(probably the most important situations) in which the nondegenerate dis-
tribution of N ( 8 - Ixs) is not asymptotically normal under the null hypothesis
(with rare exceptions, invariance principles fail for these situations). The
multivariate generalizations of the results byjBrockwell et al. (1982) and special
situations analogous to those considered by Mielke and Sen (1981) for linear
rank statistics are open questions which require further attention.
The symmetric distance function (AI.j) is extremely important since it defines
the structure of the underlying analysis space of MRPP. The form of the
symmetric distance functions considered in this paper will be confined to
AI, J = X k i -- X k j I p
Xk= 1
for either small, moderate or extremely large sample sizes (Mielke, 1978,
1979b; O'Reilly and Mielke, 1980; Brockwell et al., 1982), approximate P-
values are based on the Pearson type III distribution which compensates for
the fact that the underlying permutation distribution is often substantially
skewed (Harter, 1969; Mielke et al., 1981a). In particular, the standardized test
statistic given by
T = (6 -/xs)/o'~
is presumed to follow the Pearson type III distribution with the density
function given by
where -oo < y < -2/% If To = (60- tx~)/o-~ and y = y~ ~<-0.001, then
f r0
P(6 ~< 6o)- J - | f(y) dy
/z8 = D(1),
~r2 = 2 _ C2[n(2)]
,~ , ~-1 - [N (2)]-1 O ( 2 ) - 20(2') + D(2")]
~'8 = { E [ ~ ~] - 3 ~ 8 o - g - ~ } / o - ~ ,
and
g g
D(1) 1
1
D (2) = ~ Z a }l,J2, D(2') = 1 Z as,,,2aJ , 3,
1 ~ AjvhA6d4
D (2") = ~7~
1 1
O (3) = ~-~ Z z~31,J2, D (3t) = Z A 21,J2AJl,J3,
1 1
0(3") = ~ Z A 21,hA'3,J,, 0(3"') ~---~ ' ~ Z Ajl,J2Ajl,J3AJ4,J 5 '
D(3") 1 1 ~ aj,.hAjij3Ah,j3,
D(3**) = 1
and
where J1, Jz, J3, J4, J5 and J6 denote distinct integers from 1 through N and the
sums are over all permutations of distinct indices. Efficient computations to
obtain these parameters are given in the appendix of Mielke et al. (1976). The
Pearson type III distribution has been investigated and appears to be an
excellent approximation (Mielke and Berry, 1982; Mielke et al., 1982).
A number of different choices for Ci have been mentioned and/or used. An
lnefficient choice of Ci when the ni's are not equal is (7/= rt~2)/[]~= 1 n}Z)]. This
choice was used by Mantel and Valand (1970) and Mielke et al. (1976) and
discussed by Mielke (1978) and O'Reilly and Mielke (1980). Another inefficient
choice of Ci when the ni's are not equal is G = 1/g (O'Reilly and Mielke, 1980).
An efficient choice of Ci when the ni's are not equal is Ci = ni/K. This choice was
suggested in Mielke (1979b) and used in Mielke et al. (1981a, 1981b, 1982).
Because the choice of Ci = ndK causes the second of the two expressions
comprising cr~ (the potentially dominant expression when large subgroup sizes are
involved) to vanish when N = K, this is the recommended choice. An asymp-
totically equivalent choice of C~which is also efficient when the ni's are not equal is
Ci = (ni - 1)/(K - g). As will be shown, this last choice of C/underlies the one-way
analysis of variance and many classical linear rank statistics since the permutation
tests associated with these statistics are also special eases of MRPP. Note that all of
the choices for C~ mentioned above are equivalent if the ni's are equal
(i = 1 , . . . , g ) .
818 Paul W. M~elke, Jr.
2.2. R e l a t i o n to w e l l - k n o w n methods
This example is based on two recent papers (Mielke et al., 1981c, 1982)
which were prompted by concerns (Mielke, 1979a) involving earlier evaluations
(Mielke et al., 1971) of the Climax I and I1 wintertime orographic cloud seeding
experiments. The concerns resulted from suggestions that the positive results of
Mielke et al. (1971) might have been the consequences of natural regional
increases during treated experimental units (24 hour periods) rather than being
primarily attributed to a cloud seeding treatment (i.e., a type I statistical error).
Since data from specified National Weather Service control stations were not
used by Mielke et al. (1971), Mielke et al. (1981c) obtained even stronger
statistical results when the control data were used to adjust for the concern
regarding natural regional increases during the treated experimental units.
Meteorological applications of permutation techniques 819
Although the results of Mielke et al. (1982) are in general agreement with those
of Mielke et al. (1981c), the methodology of Mielke et al. (1981c) is attacked in
Mielke et al. (1982). The following discussion describes the reason for this
attack and also compares selected results of Mielke et al. (1981c) with results of
the alternative methodology suggested in Mielke et al. (1982).
Before describing the attack on the methodology of Mielke et al. (1981c), the
methodology of that paper is presented. Let (xl, Yl). . . . . (xN, YN) denote N
pairs of control and target precipitation amount observations, respectively,
associated with N experimental units. A treatment (cloud seeding with silver
iodide) which should only affect target observations is applied to a randomly
obtained subset of n experimental units and no treatment is applied to the
remaining subset of N - n experimental units. The statistical analyses are
based on residual data (er = Y l - b x t for I = 1, . . . , N) based on a least squares
regression line, i.e., the estimate of b minimizes
N
~'~ ( y l - bxx ) Z .
I=1
f, = 0 if I = (N + 1)/2,
II - ( N + 1)/21 w if I > (N + 1)/2,
rank tests. The alternative methodology used by Mielke et al. (1982) involved
(1) utilization of median regression residual data, i.e., the estimate of b
minimizes
N
Z [yl - bx~[,
I=1
and (2) the two-sample rank tests introduced by Mielke et al. (1981b) where
nl=n, n2 = N - n , g=2, v = l , Ci=ni/N for i = l and 2, and fr is the
previously defined score function with w = 0, 1 and 2. As pointed out by Huber
(1974), a median regression line is resistant against arbitrary residuals with
large magnitudes. In addition, an extremely efficient algorithm for obtaining
median regression residual data has been recently developed by Bloomfield
and Steiger (1980). As previously emphasized, the underlying analysis space
associated with v = 1 is a simple Euclidean space (i.e., does not involve either
the non-metric or distorted metric spaces associated with other choices of v).
The primary motivation for preferring the methodology of Mielke et al.
(1982) over that of Mielke et al. (1981c) is the physical interpretation.
Meteorologists generally believe that cloud seeding will be most effective only
when relatively small precipitation amounts are involved (i.e., cases associated
with small residuals). The basis for this belief is that the large precipitation
amounts are associated with synoptic storms which are endowed with an
abundance of natural ice crystals (i.e., there is no need for additional ice
crystals induced by cloud seeding). Since a few large residuals may severely
influence the position of a least squares regression line, the immediate con-
sequence is that the residual data considered most important by meteorologists
(small values) may be distorted in a totally unreasonable manner by a very small
subset of the residual data that is considered least important. With respect to
analysis techniques, it is intrinsically assumed by investigators in most any
discipline that the underlying analysis space of a chosen statistical technique is
congruent with the perceived Euclidean space of the data being analyzed. Since
very few investigators recognize the complexity of the underlying analysis space
associated with either linear rank tests or closely related techniques based on
v = 2, a reasonable demand is that a statistical technique must possess an
underlying analysis space (i.e., v = 1) which is compatible with the perceived
data space in question.
Table 1 presents selected two-sided P-value comparisons, involving experi-
mental units with warm 500 mb temperatures, between the old methodology of
Mielke et al. (1981c) and the new methodology of Mielke et al. (1982). The
comparisons of Table 1 are presented separately for the independent Climax I
and II experiments. The major differences in the results of the Climax II
experiment are caused by a few very large residuals. Further discussion and
details which include physical interpretations and the availability of data
associated with these and other comparisons are given in Mielke et al.
(1981c, 1982).
Meteorological applications of permutation techniques 821
Table 1
Two-sided P-value comparisons between the
old and new methodologies involving warm
500 mb temperatures (-20 to -llC) for the
independent Climax I and II experiments
Climax I Climax II
periods of time are needed to fill in various positions (radial belts and octants)
relative to a 'conceptual composited' storm's center and direction. The radial
belts are annuli of two degrees thickness (1 to 3 , 3 to 5 , 5 to 7 , etc.) about the
storm's center and the octants (north, northwest, west, etc.) correspond to the
storm's forward direction passing through the center of the north octant.
Comparisons of one or more responses (made commensurate by appropriate
scaling) between developing and nondeveloping storms are considered. The
classification of a developing and nondeveloping storm at a given time is
determined from the known history of each storm. Substantial differences
between one or more responses for given radial belt, octant and pressure level
combinations would yield a basis for future forecasts of development by
obtaining essential information with appropriate aircraft penetrations. Some
preliminary results based on the west Atlantic data set are given in Table 2.
The P-values of Table 2 are based on M R P P with v = 1. The results suggest
that tangential winds and height responses may be very important for sub-
sequent forecast criteria. This same procedure will also be used to develop
forecast criteria for storm direction changes and other phenomena.
The present example is one of an apparent endless number of multi-response
situations which are routinely encountered in meteorology. Another typical
example involving seasonal mean sea-level pressure pattern changes over broad
Table 2
P-values for testing response differences between developing
and nondeveloping storms with responses restricted to the 5
to 7 radial belt
Pressure level
3.1. Methodologicaldescription
Let b blocks and g treatments be associated with a randomized block design.
Let x]j = (xuj,..., xnj) denote r commensurate response measurements cor-
responding to treatment i and block j (the response measurements might again
be functions of response measurements or residuals adjusted by predictors).
The modified MRPP statistic for this situation is given by
: [
g E E a (Xij, Xik)
i=lj<k
probabilities to the
M = (g !)b
A (x, y) =
(2r
"h=l 7
Xh -- Yh]P
where p 1> l and v > 0. Since the choice of the symmetric distance function
defines the structure of the underlying analysis space of these procedures, the
discussion in Section 2 concerning this choice is equally pertinent here.
In a manner analogous to MRPP, small values of 6 imply a concentration of
the response measurements associated with each of the g treatments (i.e., over
blocks). Therefore P(6 <~6o) is again the P-value associated with 60 (the
realized value of 6). Though an efficient algorithm for calculating the exact
P-value for an observed value of 6 exists, this approach becomes prohibitively
expensive when M is large (e.g., greater than 106). Noting that b = 6 and g = 4
yields M - 1.9 x 108 or that b = 4 and g = 6 yields M - 2.7 x 10 21, the necessity
for an approximation technique is obvious for even relatively small randomized
block configurations.
As in Section 2, approximate P-values are again based on the Pearson type
III distribution to compensate for the commonly encountered substantial
skewness of the underlying permutation distribution of 6. Thus the stan-
dardized test statistic given by
T = (,~ - ~ 8 ) / o - 8
is again presumed to follow the Pearson type III distribution and the ap-
proximate P-value is obtained by the previously described approach in Sub-
section 2.1. To obtain T and the P-value for a realized value of 8, the
determination of/zs, cr2 and )'8 is again essential. If
g g
D(i, r;j, s ) = a(i, r;j, s ) - g-* ~', a(i, r;j, s ) - g - 1 E a(i, r;j, s)
i=l j=l
g g
+g-2ZA(i,r;j,s ),
i=l j=l
then/*8, ~r~ and Y~ are conveniently expressed as
1,8= g2 ZZZa(i,r;j,s),
r<s i=l j = l
: g 2 Z Z Z [o{i, j, ,
r<s i=1 j = l
"ya = K3(8)]Ov 3
and
b -3 1
{g~-2 g g ifg=2,
H(g)= ~[D(i,r;j,s)]3 ifg>~3,
r<* i = I j = l
{~61 g g g if b = 2 ,
L(b) = ~' ~' ~ ~_~ D(i, r; j, s)D(i, r; k, t)D(j, s; k, t) ifb~3,
r<s<t i=l j~l k=l
and gr<,<~ denotes the sum over all r, s and t such that 1 ~ r < s < t ~< b.
Efficient computational expressions to obtain/,a, o-~ and ya are described in
detail by Mielke and Iyer (1982). For the special case of these techniques
involving matched pairs (Mielke and Berry, 1982), empirical results indicate
that P-values based on the Pearson type lII distribution are excellent ap-
proximations.
where the corrected total sum of squares is given by SSr = (~=1Ebj = l x 2 ) - SSM,
the block sum of squares is given by S S ~ -- {/=1 b [(Eg=l xiJ)2/g]} - SS~, and
SSM ~ - ( ~ i =~ l ~ , , jb= l X i j ) 2/ b g . Thus F and 8 are equivalent under the null hypo-
826 Paul W. Mielke, Jr.
thesis since SST and SSB are invariant relative to the (g!)b p e r m u t a t i o n s of the
response m e a s u r e m e n t s . (For this and o t h e r cases involving univariate respon-
ses, the response m e a s u r e m e n t subscript is omitted, i.e., Xlo = xij.) Incidentally,
6 is equivalent to C o c h r a n ' s O test statistic if r = 1 and each xij is either 0 or 1.
Let R d e n o t e the ordinary Pearson correlation coefficient. If v = 2, b = 2 and
r = 1, then the functional relation b e t w e e n R and 8 is given by
R = (tz~ - ~)/(2S1S2)
8= Z Ixi- xjl
i<j
Table 4
Estimated power against a location shift of size 0.60- where ~r
is the standard deviation of the distribution specified and
b=20
Mielke and Berry (1982). The power comparisons of 6", 31,1, (~2,1 and 61, 2 in
Table 3 involve (1) a fixed size of b = 80, (2) five origin symmetric distributions
including the Laplace (double exponential), logistic, normal, uniform, and a
U-shaped distribution with density (3y2)/2, - 1 < y < 1, and (3) a location shift
of 0.3o- for the distribution specified. Table 4 differs from Table 3 in that the
fixed size is b = 20 and the location shift is 0.6o- for the distribution specified.
Each power estimate in Table 3 or Table 4 for 6", 61,1, 62,l and 61, 2 (correspond-
ing to each significance level, a, and each distribution) depends on 300
P-values associated with the same collection of 300 independent random
samples of 80 or 20 values, respectively, from the uniform (0, 1) distribution.
Complete details concerning these comparisons are given by Mielke and Berry
(1982). The purpose of these comparisons is to demonstrate that specific
advantages can be gained when classical tests are replaced with tests based on a
Euclidean space (in addition to the geometric appeal stressed in Section 2). The
results of Tables 3 and 4 indicate that 61.1 is a good choice when heavy-tailed
distributions are encountered and that 6zl is a seemingly outstanding choice
when light-tailed (including uniform and U-shaped) distributions are encoun-
tered. Similar comparisons involving two-sample analogs of 61,1 and 81,2 (two-
sample analogs of 8" and 82.1 not included) are given by Mielke et al. (1981b).
Meteorological applications of permutation techniques 829
References
Berry, K. J. (1982). Algorithm AS 179: Enumeration of all permutations of multi-sets with fixed
repetition numbers. Appl. Statist. 31, 169-173.
Bloomfield, P. and Steiger, W. L. (1980). Least absolute deviations curve-fitting. SIAM J. Sci.
Statist. Comput. 1, 290-301.
Brockwell, P. J., Mielke, P. W. and Robinson, J. (1982). On non-normal invariance principles for
multi-response permutation procedures. Austral. J. Statist. 24, 33-41.
Cliff, A. D. and Ord, J. K. (1973). Spatial Autocorrelation. Pion Limited, London, England.
Diaconis, P. and Graham, R. L. (1977). Spearman's footrule as a measure of disarray. J. Roy.
Statist. Soe. Ser. B 39, 262-268.
Friedman, J. H. and Rafsky, L. C. (1979). Multivariate generalizations of the Wald-Wolfowitz and
Smirnov two-sample tests. Ann. Statist. 7, 697-717.
830 Paul W. Mielke, Jr.
Gray, W. M. (1979). Hurricanes: Their formation, structure and likely role in the tropical
circulation. In: D. B. Shaw, ed., Meterology Over the Tropical Oceans. Royal Meteorological
Society, James Glaisher House, Bracknell, England, pp. 155-218.
Harter, H. L. (1969). A new table of percentage points of the Pearson type III distribution.
Technometrics 11, 177-187.
Huber, P. J. (1974). Comment on adaptive robust procedures. J. Amer. Statist. Assoc. 69, 926-927.
Mantel, N. and Valand, R. S. (1970). A technique of nonparametric multivariate analysis.
Biometrics 26, 547-558.
Mielke, P. W. (1972). Asymptotic behavior of two-sample tests based on powers of ranks for
detecting scale and location alternatives. J. Amer. Statist. Assoc. 67, 850-854.
Mielke, P. W. (1974). Squared rank test appropriate to weather modification cross-over design.
Technometrics 16, 13-16.
Mielke, P. W. (1978). Clarification and appropriate inferences for Mantel and Valand's non-
parametric multivariate analysis technique. Biometrics 34, 277-2.82.
Mielke P. W. (1979a). Comment on field experimentation in weather modification. J. Amer. Statist.
Assoc. 74, 87--88.
Mielke P. W. (1979b). On asymptotic non-normality of null distributions of MRPP statistics.
Comrnun. Statist. A 8, 1541-1550. Errata: A 10, 1795; A 11, 847.
Mielke P. W. and Berry, K. J. (1982). An extended class of permutation techniques for matched
pairs. Comrnun. Statistic. A 11, 1197-1207.
Mielke, P. W., Berry, K. J. and Brier, G. W. (1981a). Application of multiresponse permutation
procedures for examining seasonal changes in monthly mean sea-level pressure patterns. Mon.
Wea. Rev. 109, 120-126.
Mielke, P. W., Berry, K. J., Brockwell, P. J. and Williams, J. S. (1981b). A class of nonparametric
tests based on multiresponse permutation procedures. Biometrika 68, 720-724.
Mielke, P. W., Berry, K. J. and Johnson, E. S. (1976). Multi-response permutation procedures for a
priori classifications. Commun. Statist. A 5, 1409--1424.
Mielke, P. W., Berry, K. J. and Medina, J. G. (1982). Climax I and II: Distortion resistant residual
analyses. J. Appl. Meteor. 21, 788-792.
Mielke, P. W., Brier, G. W., Grant, L. O., Mulvey, G. J. and Rosenzweig, P. N. (1981c). A
statistical reanalysis of the replicated Climax I and II wintertime orographic cloud seeding
experiments. J. Appl. Meteor. 20, 643--659.
Mielke, P. W., Grant, L. O. and Chappell, C. F. (1971). An independent replication of the Climax
wintertime orographic cloud seeding experiment. J. Appl. Meteor. 10, 1198--1212. Corrigendum:
15, 801.
Mielke, P. W. and Iyer, H. K. (1982). Permutation techniques for analyzing multiresponse data
from randomized block experiments. Commun. Statist. A 11, 1427-1437.
Mielke, P. W. and Sen, P. K. (1981). On asymptotic non-normal null distributions for locally most
powerful rank test statistics. Cornmun. Statist. A 10, 1079-1094.
O'Reilly, F. J. and Mielke, P. W. (1980). Asymptotic normality of MRPP statistics from invariance
principles of U-statistics. Commun. Statist. A 9, 629-637.
Sen, P. K. (1970). The Hfijek-Rrnyi inequality for sampling from a finite population. Sankhy5 Ser.
A 32, 181-188.
Sen, P. K. (1972). Finite population sampling and weak convergence to a Brownian bridge.
Sankhy5 Ser. A 34, 85-90.
'Whaley, F. S. (1983). The equivalence of three independently derived permutation procedures for
testing the homogeneity of multidimensional samples. Biometrics 39, 741-745.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 "~
*b.J q ~ /
Elsevier Science Publishers (1984) 831-871
1. Introduction
explanatory variables, we shall treat the data as many binomials. The examples
at the end of this chapter illustrate the preceding discussion.
2. Discrimination information
3. MDI estimation
Suppose the statistician initially holds the viewpoint that the appropriate
distribution is rr(w). However, given a judgement that the distribution is a
m e m b e r of a family P of distributions satisfying the linearly independent
m o m e n t constraints
where we use the symbol /2 to represent the space as well as the number of
cells and the rank of the model matrix C is r + 1 ~< O. The matrix relation (2)
may be written as
4. L o g l i n e a r r e p r e s e n t a t i o n
Straightforward application of the calculus yields the fact that the MDI
estimate p*(~o) is
It may be readily determined from (4) that ro ~.vln M(r,, r2,..., %) where
The fact that I(p : ~r) in (t) is a convex function of p(w) insures a unique MDI
estimate p*(~o). The representation (4) is an exponential family which has also
been represented as a multiplicative model. The representation (5) is known as
a loglinear model. Loglinear models are particularly appropriate for the
analysis of contingency tables or more generally, categorical count data.
Extensive bibliographies and applications may be found among others in
Bishop et al. (1975), Fienberg (1977), Gokhale and Kullback (1978a, 1978b),
Haberman (1974), Ku and Kullback (1974), Plackett (1974).
In loglinear models the logarithm of cell estimates is expressed as a linear
combination of main effect and interaction parameters associated with various
characteristics (variables) and their levels. The partial association between a
pair of characteristics for all values of associated variables (covariates) is a sum
834 S. Kullback and J. C. Keegel
and difference of the logarithms of appropriate cell estimates and thus a linear
combination of the parameters. The average value of these partial associations
is also a linear combination of the parameters. Since, as we shall see, it is
possible to determine the covariance matrix of the parameters, the variances of
the partial associations and of the average partial association may be calculated
and confidence intervals determined. We illustrate these ideas in some of the
examples at the end of this chapter.
It may be shown that for any p ~ , that is, satisfying (3) the Pythagorean
type property
statistic. This is however not true for the ECP case, although the MDI
estimates are BAN.
6. Analysis of information
The analysis in (10) is an additive analysis into components which are MDI
statistics with additivity relations for the associated degrees of freedom. The
component
measures the effect of the constraints in x*b(w) which are not included in x*(o~).
In the algorithms used i n the computer programs to determine the MDI
estimate and other associated values the arbitrary distribution ~-(w) in ICP is
usually taken as the uniform distribution. For the ICP case, since measures of
the form 2I(x:X*a) may also be interpreted as measures of the variation
unexplained by the MDI estimate x* the additive relationship (10) leads to the
interpretation of the ratio
2 I ( x ~ : x) = 2 I ( x ~ : x T) + 2 I ( x T : x) (14)
7. Covariance matrices
Sll S12 I
S= &l $22
8. Confidence intervals
Since the asymptotic covariance matrix of the natural parameters, the r~, is
available one can determine asymptotic simultaneous sets of confidence inter-
vals for a set of estimates of the natural parameters, or for their linear
combinations, in the model selected for detailed analysis. We describe joint
confidence intervals based on the Bonferroni inequality (Miller, 1966). With
probability / > 1 - a, joint confidence intervals for k variables are given by
where cri is the standard deviation of the i-th variable and the function
z ( 1 - a / 2 k ) is the lO0(a/2k) percentile of the standard normal distribution, that
is,
l - c~/2k = _~ (1/2~r) m exp(-u2/2) du
Categorical data problems 837
Table 1
k z k z k z
9. Outliers
In the ICP case various procedures have been suggested for measuring the
adequacy of a loglinear model with respect to specific cells in a contingency
table. A cell which does not fit the model is called an O U T L I E R . These
O U T L I E R S may lead one to reject a model which fits the other cells. In other
cases even though a model seems to fit, the O U T L I E R S contribute much m o r e
than reasonable to the measure of deviation between the observed data and the
fitted value of the model.
Many writers have used the difference between the test statistic computed
for some fitted loglinear model and the test statistic using the same model but
ignoring several preselected cells. A separate O U T L I E R computation for each
cell is time consuming. W e now indicate a quick and easy approximation using
M D I techniques (Ireland, 1972). The output of various computer programs we
use for estimating models includes a listing for each cell called O U T L I E R . The
value of O U T L I E R given for each cell is a lower bound for the decrease in the
corresponding 2I(x :x*) if that cell were not included in the estimation pro-
cedure. Large values of O U T L I E R are those which are at least as large as very
significant chi-square values for one degree of freedom. The basis for the
O U T L I E R interpretation follows. Let x* denote the M D I estimate subject to
certain internal constraints. Let x ; denote the M D I estimate subject to the
same internal constraints as x* except that the value x(wl), say, is not included,
so that x;(~ol)= x(~01). The basic additive property of the M D I statistics is (10)
or
Using the convexity property of the information function it may be shown that
838 S. KuUback and J. C. Keegel
The last value can be computed and is listed as the O U T L I E R entry for each
cell of the complete computer output for the M D I estimate x*. The ratio as in
(12) may also be used to indicate the percentage of the unexplained variation
due to the O U T L I E R cell (Gokhale and Kullback, 1978a, 1978b).
Note that if x(wl)= 0 the O U T L I E R value for that cell as computed from
the right side of (17) is 2Nln(1/(1-X*a(O)l)/N)) and for large N is ap-
proximately 2X*a(O)l). W e have Table 2 to help interpretation.
Table 2
- 2 In X*a(Wl)+ 2X*(t01).
Table 3
Poisson probability
Table 4
Analysis of information
E a r l i e r w e h a d s t a t e d t h a t d e p e n d i n g on t h e design of t h e d a t a collection
p r o c e c u r e we shall l o o k u p o n t h e cross-classification of t h e c o u n t d a t a as a
840 S. Kullback and J. C. Keegel
z 0 -.. 0
W2 ... 0
W=
(20)
0 ... Wk
N o t e that a P ( w ) = 1, Z n H ( o ) ) = 1, I ( p : It) = Z a P ( w ) l n ( P ( w ) / H ( o ) ) ) =
I ( P :H), B p = B W - 1 W p = C P = 0. In terms of the canonical t r a n s f o r m a t i o n
Categorical data problems 841
12. M D I estimate
T h e h ' s are scaling constants and the z's are the exponential or natural
p a r a m e t e r s of interest. Applications of the k-sample p r o c e d u r e will be f o u n d in
the examples at the end of this chapter. W e r e m a r k that one usually takes the
weights wi -- N~/N, where the sum of the observations in the i-th sample is N/
842 S. Kullback and J. C. Keegel
and N = Na + N 2 + " " + Nk, although the analyst may select any values con-
sistent with his view of the problem.
13. Marginais
The marginais of contingency tables have long played a critical role in their
analysis. In the case of smoothing or fitting the aim of the analysis usually is to
get a good fit to the observed table using a minimal or parsimonious number of
natural parameters in the loglinear model depending only on some of the
observed marginals. This shows how much of the total information is contained
in a summary consisting only of sets of marginals. The observed distribution
can then be said to be explainable in terms of a smaller number of linear
functions of cell frequencies. The observed distribution, the full or complete
model requires a maximum number of parameters for its description, whereas
the uniform distribution requires the least. When sets of marginals are fitted
the values of the ci(to) in (8) are either O's or l's since the sum Ea ci(to)x(w) is a
marginal value if and only if ci(w) is a 1 for every cell that enters into the
marginal and is a 0 otherwise.
14. Notation
To relate the tau or natural parameters with the associated marginal value
notationally and avoid a possible need for record keeping, we shall use
superscripts and subscripts on the taus. The superscripts indicate the variables
or factors involved and the subscripts refer to the levels or categories of the
AB will correspond to the two-way marginal for variables A
vhriables. Thus ~'11
and B each at level 1, "J2a3-BcDwill correspond to the three-way marginal for
variables B, C,/9, respectively a t level 2, level 1, and level 3. To insure linear
independence in the C-matrix we follow the convention of setting every tau
with any subscript equal to the last level or category value as zero. Any level
could have been selected for the reference value. Other parameterizations use
one similar to the analysis of variance in which sums of the parameters over all
levels are set equal to zero.
When the moment constraints in (8) involve only sets of marginals the
Deming-Stephan iterative proportional fitting algorithm may be used to
determine the MDI estimate x*(w) satisfying (8) and then the natural parameters
are determined from the ioglinear representation. The proportional fitting
algorithm may be described as successively cycling through adjustments of the
marginals of interest starting with the marginals of the ~-(t0) distribution until a
Categorical data problems 843
tO 1 2 3 4 5 6
A 1 1 2 2 3 3
B 1 2 1 2 1 2
1 1 1 1 1 1
1 1 0 0 0 0
0 0 1 1 0 0
C= (28)
1 0 1 0 1 0
1 0 0 0 0 0
0 0 1 0 0 0
because this output includes the values of the taus for the model and their
covariance matrix. One can then infer whether certain individual taus are not
significantly different from zero and get a new estimate using this fact. We may
therefore determine a final model that provides an acceptable estimate with a
simpler structure and fewer parameters. Such models may be nonhierarchical
in structure. Walls and Weeks (1969) have given proofs that the variance of
least square estimates of regression coefficients increases when variables are
added to a regression equation. Although a similar result has not been proven
for loglinear models, this property has been observed in enough cases to lead
one to believe it to be so. Also Bishop et al. (1975, p. 313) state: " A model with
fewer parameters may improve the precision of the estimates. Suppose we have
two models for predicting the cell frequencies in a table of counts, both of
which are compatible with the observed data, one model being a special case of
and having fewer parameters than the other. Then the overall variability of the
estimates from the simpler model about the 'true' values for the cells is smaller
than the 'overall variability' for the model with more parameters requiring
estimation. We have no general proof of this theorem, although it is specifically
true for many regression problems and we believe it is true in a rather general
way". The model building philosophy in the examples at the end of this chapter
is in accordance with the preceding empirical theorems.
We quote some of the negative comments about nonhierarchical loglinear
models because we believe that our proposed approach and methodology and
the examples tend to refute these comments. "In larger tables with non-
hierarchical structure a possible strategy is to partition and look at smaller
sections of data . . . . We can either consider the structure of each set of tables
separately or rearrange the cells to form new compound variables" (Bishop et
al., 1975, p. 38). "As in conventional analysis of variance, interpretation of
parameters in non-hierarchical models appears difficult; consequently, the
usefulness of non-hierarchical models is not clear" (Haberman, 1974, p. 200). "It
is possible to consider fitting nonhierarchical models to data, but we cannot
then compute the estimated expected values directly via our iterative propor-
tional fitting procedure. Rather, we need to transform the table, interchanging
cells, so that the non-hierarchical model for the original table becomes a
hierarchical model for the transformed table" (Fienberg, 1977, p. 39).
The examples introduce some additional concepts related to the previous
discussion but in our opinion such that a meaningful presentation is more easily
accomplished in terms of specific data rather than theory.
This example considers the MDI analysis of a case-control study of two types
of exposure. The count data were cross-classified into a 3 x 2 2 2 con-
tingency table. The log-odds or logit rePresentation for the observed data is
indicated. A nonhierarchical model is derived and its interpretation indicated.
Categorical data problems 847
The data are from a study made by Professor Julius Schachter and used as an
illustration by Heilbron (1981) for an analysis of ratios of odds ratios. The study
relates to the possible role of the herpes simplex virus Type 2 (HSV-2), and of
another common sexually-transmitted agent, Chlamydia trachomatis, in the
etiology of cervical dysplasia in women. Exposure to HSV-2 is here taken to be
indicated by
x(hill)x(hi22)/x(hi12)x(hi21) f o r h = 1, 2, 3, i = 1, 2 .
We shall treat the data as 12 samples of the binomial variable Group over
the twelve combinations of the explanatory variables Partners x T i t e r x
H.Ratio. The log-odds or logit representation for the observed data (complete
Table 5
Observed and estimated data
Characteristic Index I 2 3
model) is
Table 6a
Analysis of information
Table 6b
Analysis of information
Table 7
Parameters of x~
Parameters of x*
nonhierarchical model x* fitting the constraints x(hij .), x(... 1), x(. 1-1),
x(1 11). The log-odds or logit representation for the model x* is
Using the Newton-Raphson type algorithm there were obtained the values
of x*(hijk) listed in Table 5, along with the cross-product ratios or odds ratios
x*(hill)x*(hi22)/x*(hi12)x*(hi21) for h -- 1, 2, 3, i = 1, 2.
The tau parameters in the log-odds or logit representation (3) are listed in
Table 7, and their covariance matrix is listed in Table 8.
Table 8
Covariance matrix, parameters of x*
7111
0.016489 -0.015695 -0.005370
-0.015695 0.021743 0.000291
-0.005370 0.000291 0.034339
Table 9
Odds factors, x*
j=l j=2
there is no difference among Partners for values four or more. Other things
being equal the odds of Dysplasia to Control are 1.75 to 1 for P a r t n e r s x
H.Ratio (1 - 3) x (t>0.85) as compared to any other combination.
We may use (3) to get
by level and location in the handling of discipline problems. The sample size
upon which the percentages were based was also given.
We shall present our statistical analysis as an application of the principle of
minimum discrimination information estimation (MDIE).
Accordingly the data in Table B-6.6 were converted to the form given as our
Table 1. W e indicate the n u m b e r of YES responses to 'very much' support and
the n u m b e r of N O responses.
Since the joint responses to the question of support from the community
sources are not available, we must examine the data in Table 10 as five
contingency tables, one for each of the community sources.
852 S. Kullback and J. C. Keegel
~m
0
U9
O_o t ~-
n~
~0~-~ ~ r~ un r-- -~ r~
~D
~I~ ~ ~ ~ ~
~J
~I ~I ~ ~I I~ ~I ~
.=.
oo
0
. . . . ~ ~ ~ I ~ ~ ~I~
o
rT~
04
~ ~o ~i~ ~
0
0
o
Categorical data problems 853
The different values for the Level x Location total for the different com-
munity sources are a consequence of missing reports. In most cases the
differences are small and did not affect the analysis. H o w e v e r the total 352 for
Elementary x Suburban area for L O C A L C O U R T S implies about 25%
((464-352)/464) not reporting. This seems to have affected the analysis for
L O C A L C O U R T S as compared with other community sources. We shall
c o m m e n t on this later.
For each of the community sources we denote the observed occurrences in
Table 10 by x(ijk).
To eliminate some of the ' r a n d o m noise' from the original observations, and
obtain a simple structural model relating the response, Support, with the
explanatory variables, Level, Location, we fitted a loglinear model to the data
for each of the community sources by M D I E . The models are derived by fitting
certain marginals or combinations of marginals, that is, the estimates are
constrained to have some set of marginals or combination of marginals equal to
those of the original observed values for each community source. The model is
not necessarily the same in detail for each community source. We present later
in Section 21 details about the fitting procedure but at this point we shall
consider some of the implications of the models.
We denote the estimated occurrences by x*(ijk) and list the estimated values
for each community source in Table 11.
The loglinear model can be reformulated as a multiplicative model and the
odds (YES/NO) expressed as a product of three factors: a base value factor
relative to each community source, a factor depending on Level, and a factor
depending on Location. For L O C A L C O U R T S there is also a factor for
E L E M x S U B U R B S interaction. The magnitudes of the factors are an in-
dication of the relative importance of the various effects and interactions. The
odds can also be obtained as the ratio of Y E S / N O in Table 11 but the gross
odds give no indication of the relative importance of the c o m p o n e n t effects.
The odds factors for each community source are given in Table 12.
W e calculate from Table 12 that the odds (YES/NO) for community source,
P A R E N T S , for E L E M E N T A R Y , S M A L L C I T I E S is the product 0.5886 x
1.5878 x 1.4993 = 1.4012. From Table 11 we see that the corresponding ratio is
144.131/102.869=1.4011. The odds (YES/NO) for community source,
S C H O O L SYSTEM C E N T R A L O F F I C E , for J U N I O R H I G H , R U R A L is
the product 2.8867 x 1.0000 1.0000 = 2.8867. From Table 11 we see that the
corresponding ratio is 265.149/91.851 = 2.8867. The odds (YES/NO) for com-
munity source, L O C A L C O U R T S , for E L E M E N T A R Y , S U B U R B S is the
product 0.2204 x 1.0000 x 0.6659 x 1.6019 = 0.2351. From Table 11 we see that
the corresponding ratio is 67.000/285.000 = 0.2351.
For community source P A R E N T S we note by examining the odds factors that
the best odds for support are for E L E M E N T A R Y , S U B U R B . The poorest
odds are the same for S E N I O R H I G H , L A R G E C I T I E S and S E N I O R H I G H ,
RURAL.
For community source L O C A L P O L I C E we note by examining the odds
854 S. Kullback and J. C. Keegel
Table 11
Estimated values based on appropriate loglinear models
Table 11 (continued)
SENIOR HIGH
Table 12
Odds Factors
PARENTS
0.5886 1. ELEM 1.5878 1. LG. CITIES 1.0000
2. JR. HI 1.2572 2. SM.CITIES 1.4993
3. SR. HI 1.0000 3. SUBURBS 1.6341
4. RURAL 1.0000
L O C A L POLICE
0.8624 1. ELEM. 0.5741 1. LG. CITIES 0.8042
2. JR. HI 0.8228 2. SM. CITIES 1.2944
3. SR. HI 1.0000 3. SUBURBS 1.4351
4. RURAL 1.0000
L O C A L COURTS
0.2204 1. ELEM. 1.0000 1. LG. CITIES 0.3961
2. JR. HI 1.0000 2. SM.CITIES 0.6659
3. SR. HI 1.0000 3. SUBURBS 0.6659
4. RURAL 1.000
J
1. LG. CITIES 2. SM. CITIES 3. SUBURBS 4. RURAL
SCHOOL B O A R D
2.2234 1. ELEM 0.7450 1. LG. CITIES 0.1771
2. JR. HI 1.0000 2. SM. CITIES 0.5126
3. SR. HI 1.0000 3. SUBURBS 0.8282
4. RURAL 1.0000
SCHOOL S Y S T E M C E N T R A L OFFICE
2.8867 1. ELEM 0.7976 1. LG. CITIES 0.1863
2. JR. HI 1.0000 2. SM. CITIES 0.5261
3. SR. HI 1.0000 3. SUBURBS 0.7331
4. RURAL 1.0000
not significantly different from zero or from each other. For the community
source LOCAL COURTS second-order interaction also seems to be present.
W e s u m m a r i z e t h e s e r e s u l t s in T a b l e 13. ( N o t e t h a t w e h a v e u s e d t h e i n d i c e s as
superscripts to represent variables.)
T h e e s t i m a t e s l i s t e d in T a b l e 11 a n d t h e v a l u e s in T a b l e 12 w e r e o b t a i n e d b y
Categorical data problems 857
Table 13
SCHOOL SYSTEM
CENTRAL OFFICE jk _ --0.056957
T21 --
--0.7180
Table 14
LOCAL COURTS
C o m p o n e n t due to Information D.F.
x(ij .), (x. 11), x(- 21) + x(- 31) 2I(x : x~) = 20.892 9
2I(x*:x~) = 0.094 1
x(ij .), x(.jk) 2I(x :x*~) = 20.798 8
2 I ( x * :x~) = 8.939 1
x(ij "), x(" 11), x(" 2 1 ) + x(-31), x(131) 2I(x :x*) = 11.953 8
rerunning the data with the tau parameters as above set equal to zero, or to
each other. For L O C A L C O U R T S the constraint x*(131)= x(131) was also
used.
We give in Table 14 the Analysis of Information values comparing the x*
models with the final x* models all of which fit their respective data sets well.
The data set for this example originates from a study on the incidence of
coronary heart disease (CHD) done by the Medical Bureau for Occupational
Diseases in Johannesburg. We are grateful to the director of the bureau, Dr. F.
J. Wiles for permission to use the data, and to Dr. T. J. Hastie and Dr. June
Juritz for making the data available to us.
I. T h e data
In a sample of 2012 miners studied by the Medical Bureau for Occupational
Diseases of the Chamber of Mines, there were 108 who suffered from coronary
heart disease (CHD). The problem was to relate the occurrence of coronary
heart disease on the factors serum cholesterol, systolic blood pressure, and
smoking habits. The cross-classification of the observed data is represented by
a 3 x 3 x 3 x 2 contingency table x ( j k l m ) where the values of the cell occur-
rences x ( j k l m ) are given in lexicographic order in Table 15. Note that the ages
of the miners was not furnished with the data.
2. T h e analysis
In order to obtain a first overview of the possible relationship of the
dependent variable Coronary Heart Disease (D) on the explanatory variables
Serum Cholesterol (C), Systolic Blood Pressure (P), Smoking Habits (H), a
sequence of nested marginals was fitted using the Deming-Stephan algorithm or
iterative proportional fitting procedure (Gokhale and Kullback, 1978a, pp. 214-
216; Ku and Kullback, 1974, p. 116). Summary results for the initial model X*a
and the set of marginals selected as a potential final model x ; are shown in the
Analysis of Information Table 16.
The log-odds (logit) representation for the estimate x ; is
Table 15
Observed and estimated occurrences
Characteristic Index 1 2 3
Table 16
Analysis of information
of the characteristics. These odds did not show the monotonic behavior one
w o u l d e x p e c t . A n e x a m i n a t i o n o f t h e s t a n d a r d i z e d v a l u e s i n T a b l e 17 s e e m e d
to imply that only four of the parameters were significantly different from zero,
t h a t is, ~'~, ~'ucD, "121-CPD'~'11/4D"T h e n o n h i e r a r c h i c a l p a r s i m o n i o u s m o d e l x * w a s
o b t a i n e d b y f i t t i n g t h e m a r g i n a l s X'c: x ( j k l . ) , x(...1), x(1..1), x(12.1),
860 S. Kullback and J. C. Keegel
Table 17
Parameters of x~
x ( " 11). The estimate X*c has the log-odds (logit) representation
ln(x*(jkla)/x*(jkl2)) : r~ + rco + T12
_cPD.
~ ~- r ~ D. (2)
W e consider the data as 27 binomials of the binary variable C H D (D). To use
the N e w t o n - R a p h s o n type k-samples iterative algorithm (Gokhale and Kull-
back, 1978a, pp. 199-205, 211-212, 245) the appropriate 31 x 54 B-design-matrix
is set up as follows (cf. Gokhale and Kullback, 1978b, 1002):
(a) The 54 columns correspond respectively to the 54 cells in lexicographic
order as in Table 15;
(b) Rows 1 to 27 each contain two ones, one each respectively in the
columns corresponding to the cells jkll and jkl2 and zeros elsewhere for
j = 1, 2, 3, k = 1, 2, 3, l = 1, 2, 3, in lexicographic order;
(c) Row 28 has a one in every column in which m = 1 and zeros elsewhere;
(d) Row 29 has a one in every column in which j = 1 and m = 1, and zeros
elsewhere;
(e) R o w 30 has a one in every column in which j = 1 and k = 2 and m = 1,
and zeros elsewhere;
(f) Row 31 has a one in every column in which l = 1 and m = 1, and zeros
elsewhere. The first 27 rows of the matrix correspond to the constraints
x*(jkl.)= x(jkl.), and the next four rows of the matrix correspond to the
respecitve constraints
x*(.-.1)=x(.--1), ~; x*(1..1)=x(1..a), ~;
x * ( 1 2 - 1 ) = x(12.1), _CeD. HD
X'c('" 11)= X('" 11), ~'11
'~121 ~
The values of the M D I estimate x* are also listed in Table 15. In accordance
with the entries in Analysis of Information Table 16 we note that 2I(x : x * ) =
29.126 with 23 D.F. implies that x* is a good fit to the observed data (the 0.1
significance level of the tabulated chi-square for 23 D.F. is 32.0). The values of
the parameters for x*~ are listed in Table 18 and the covariance matrix of these
parameters is given in Table 19.
Categorical data problems 861
T a b l e 18
P a r a m e t e r s of x*
T a b l e 19
C o v a r i a n c e m a t r i x x~
D CD CPD HD
T1 Tll TI21 ~11
(1) (2) (3) (4)
T a b l e 20
Odds factors x*(jkll)/x*(jkl2)
T a b l e 21
O D D S x~(jkll)/x*(jkl2) N o C H D / C H D
SYSTOLIC B L O O D P R E S S U R E
1. Introduction
In an illuminating paper Fuchs (1979) presented an example of real data in
which biased inferences about the relationship between two variables, while
controlling for the effect of one or several covariables, may result from the use
of insufficient covariables in the analysis. Fuchs (1979) notes: " O n e of the basic
assumptions in testing the average partial association is that the investigator is
aware of the relevant covariables that may influence the response profiles of
the dependent variable within the subpopulations". Fuchs (1979) cites tests
proposed by Cochran (1954), Mantel and Haenszel (1959), Hopkins and Gross
(1971), Sugiura and O t a k e (1974), and Landis, H e y m a n and Koch (1977) for
testing the average partial association between the dependent variable and the
subpopulations.
Fuchs (1979) computed average partial associations by collapsing the original
table over various combinations of the covariables. H e noted discrepancies in
the assessments of the average partial association when different covariables
are used. We shall use the same data as Fuchs (1979) and find a suitable
loglinear model fitting the data. We shall compute the average partial asso-
ciation using the fitted model.
2. The data
As indicated by Fuchs (1979) two groups of drivers (D) are compared in the
analysis. One group is a simple random sample of the entire population of
drivers in Wisconsin (Control). The other group includes drivers with known
cardiovascular deficiencies (Cardiovascular) and was obtained by subdividing,
according to the type of existing condition, a simple random sample of drivers
with several medical conditions. The dependent variable is the number of
traffic violations (V) within a one-year period (1974). For each driver, the
available data also include information on the age interval (A), sex (S), and
place of residence (R). The cross-classification of the observed data is
represented by a 2 x 5 x 3 x 2 x 2 contingency table x(ghiflc). The values of the
cell occurrences x(ghijk ) are given in Table 22. In this analysis it is important to
assess which group of drivers has a better record.
Categorical data problems 863
Table 22
Observed and estimated occurrences
1 2 3 4 5
3. The analysis
In order to obtain a first overview of the possible relationship of the
dependent variable Violations (V) on the explanatory variables Driver Group
(D), Residence (R), Age (A), Sex (S), a sequence of nested marginals was fitted
using the Deming-Stephan algorithm or iterative proportional fitting procedure
(Gokhale and Kullback, 1978a, pp. 214-216; Ku and Kullback, 1974, p. 116).
The fitting constrains the estimated tables denoted by X'a, X~, etc., to have the
same marginal values, for the fitted set, as does the observed table. Summary
results for the first six sets of fitted marginals are shown in the Analysis of
Information Table 23. The information numbers for the subsequent sets of
fitted marginals are not included since they implied no further significant
interactions. We remark that information numbers of the form
2I(x :x*) = 2 ~, x(ghijk) ln(x(ghijk)/x*a(ghijk))
and
2I(x~ :Xa*) = 2 ~ x~(ghijk) ln(x~(ghijk)/x*(ghijk))
Table 23
Analysis of information
Information D.F.
for this case are the same as log-likelihood-ratio statistics. Also the M D I
estimate x * ( g h i j k ) has the explicit representation in terms of the marginals as
x * ( g h i j k ) = x ( g h i ] . ) x ( . . . . k ) / N , N = x ( . . . . . ) (Bishop et al., 1975; Gokhale
and Kullback, 1978a, 1978b). In Table 23 the value 2 I ( x : x * ) = 190.211, 59 D.F.,
implies that the binary variable Violations (V) is not h o m o g e n e o u s over the 60
binomials of the combinations of the explanatory variables (Kullback, 1959,
Chapter 8) and we seek a model to account for the behavior. The conditional
effects in Table 23, last part, suggest that main effects (two-factor interactions) of
Driver G r o u p (D), Age ( A ) , Sex (S), on Violations (17) are significant and that the
interaction Driver G r o u p x R e s i d e n c e on Violations (three-factor interactions)is
significant. Note that the results suggest no Residence (R) main effect but a Driver
G r o u p Residence interaction.
To obtain the M D I estimates of the nonhierarchical parsimonious loglinear
model implied by the preceding analysis, its parameters, and their covariance
matrix, we consider the data as 60 binomials of the binary variable Violations
(V). If we denote the M D I estimate satisfying the suggested constraints by
x * ( g h i j k ) , then we have the log-odds (logit) representation for x * : x ( g h i j .),
x ( g " " k ), x ( " i " k ), x(" " j k ), x ( g h " k ),
ln(x*(ghijl)/x*(ghij2)) = r v + r Dv
gl - TAV + rSlV + "1"ghDRV1 (1)
Ty; T DV
gl
for g = 1; r AV for i = 1, 2;
flh = l n ( x * ( l h i j l ) / x * ( l h i j 2 ) ) - In(x*(2hijl)/x*(2hij2))
(2)
. x*(lhijl)x*(2hij2) DV DRY
=In x . ( l h i j 2 ) x . ( 2 h i j l ) = r u +rlh 1 , h =1, 2, 3, 4, 5.
We recall that parameters with subscript g = 2 are zero. Note that for the
model x* the partial association between Driver G r o u p (D) and Violations (V)
is related to the Residence (R) categories but not with Age (A) or Sex (S).
Using the values of the parameters in Table 24 we compute the values of the
partial associations. Their variances may be obtained from Table 25, using the
fact that
Var(flh) = V a r ( r ~ v) + V a r ( r 1~ v) + 2 C o v ( r ~ v, ~lhl
_DRY,)" (3)
It is found that
Categorical data problems 867
I [
II
II N II II
tt~
II
%
I
II II il II
I
t)
868 S. Kullback and J. C. Keegel
Using the Bonferroni inequality (Miller, 1966, p. 8) we can obtain joint 95%
confidence intervals for the/3's in (4) as
From the results in (6) we may infer that the Control Group has a better
driving record in Residence category Urban 1 but that there is no significant
difference in driving records for the other Residence categories. The average
partial association between Driver Group and Violations is
8 = ( 8 1 -}- /32 -1- 8 3 -1- /34 -}- /35)/5 = T lDl V "}" z~TllD R1V -~-- "i121
-DRV .
n- --DRV
7131 ..~ DRV
'7"141 )/5 .
(7)
Using the values of the/3's in (4) we compute/3 = -0.212450. The variance of/3
is given by
4 4
Var(fl) = Var(z Dr) + ~'~ Var(zg] v) + 2 ~, COv(TDV, _DRV~
"~1hl )
h=l h=l
4 4
+ 2~ Z E CV0"lm
Rv, "rDRV~
lnl 2,
m ~ n (8)
m = l n=l
Table 26
Odds factors
Acknowledgements
W e are grateful to Dr. Carlyle E. Maw for his interest, support, and making
available data tapes provided in conjunction with Violent S c h o o l s - S a f e
Schools, The Safe School Study R e p o r t to the Congress, U.S. D e p a r t m e n t of
H E W , National Institute of Education. Some of the examples are based on
these data tapes. The work for Section 19 was supported under grant NIE-9-76-
0091 which we gratefully acknowledge.
Computations for the examples were performed using programs on file at the
C o m p u t e r Center of The G e o r g e Washington University.
Information regarding the procurement of these computer programs and
appropriate instructions may be obtained by writing to Dr. J. C. Keegel,
D e p a r t m e n t of Statistics, The G e o r g e Washington University, Washington
D.C. 20052, U.S.A.
References
Agresti, A. (1977). Considerations in measuring partial association for ordinal categorical data. J.
Amer. Statist, Assoc. 72 (357), 37-45.
870 S. Kullback and J. C. Keegel
Bishop, Y. M. M., Fienberg, S. E. and Holland, P. W. (1975). Discrete Multivariate Analysis. MIT
Press, Cambridge, MA.
Cochran, W. G. (1954). Some methods for strenghtening the common X2 test. Biometrics 10,
417-451.
Fienberg, S. E. (1977). The Analysis of Cross-Classified Categorical Data. MIT Press, Cambridge,
MA.
Fuchs, C. (1979). Possible biased inferences in tests for average partial association. The American
Statistician 33, 120-126.
Gokhale, D. V. (1973). Approximating discrete distributions with applications. J. Amer. Statist.
Assoc. 68, (344), 1009-1012.
Gokhale, D. V. and Kullback, S. (1978a). The Information in Contingency Tables. Marcel Dekker
Inc., New York.
Gokhale, D. V. and Kullback, S. (1978b). The minimum discrimination information approach in
analyzing categorical data. Communications in Statistics - Theory andMethods A7, 987-1005.
Goodman, L. A. (1970). The multivariate analysis of qualitative data: Interaction among multiple
classifications, J. Amer. Statist. Assoc., 65, 226-256.
Haberman, S. J. (1974). The Analysis of Frequency Data. University of Chicago Press, Chicago, IL.
Heilbron, D. C. (1981). The analysis of ratios of odds ratios in stratified contingency tables.
Biometrics 37, 55-66.
Hopkins, C. E. and Gross, A. J. (1971). A generalization of Cochran's procedure for the combining
of r x c contingency tables. Statistica Neerlandica 25, 57-62.
Ireland, C. T. (1972). Sequential cell deletion in contingency tables. Statistics Department, The
George Washington University.
Ireland, C. T. and Kullback, S. (1968). Contingency tables with given marginals. Biometrika 55,
179-188.
Johnson, R. W. (1979). Axiomatic characterization of the directed divergences and their linear
combinations. I E E E Trans. Inf. Theory 25 (6), 709-716.
Keegel, J. C. (1981). Iterative fitting procedures for minimum discrimination information estima-
tion in contingency table analysis. In: G. A. Fleischer, ed., Contingency Table Analysis
for Road Safety Studies. Sijtboff & Noordhoff, NATO Advanced Study Institute Series, pp.
107-117.
Kotz, S. and Johnson, N. L. (1983), editors. Encyclopedia of Statistical Sciences, Vol. IV. Article
"Kullback Information". Wiley, New York, pp. 421-425.
Ku, H. H. and Kullback, S. (1974). Loglinear models in contingency table analysis. The American
Statistician 28, 115-122.
Kullback, S. (1959). Information Theory and Statistics. Wiley, New York, 1968; Dover, Inc., New
York, 1978; Peter Smith Publisher, Inc., Magnolia, MA.
Kullback, S. (1983). Kullback Information. In: S. Kotz and N. L. Johnson, eds., Encyclopedia of
Statistical Sciences, Vol. IV. Wiley, New York, pp. 421-425.
Landis, J. R., Heyman, E. R. and Koch, G. G. (1978). Average partial association in three-way
contingency tables: A review and discussion of alternative tests. International Statistical Review
46, 237-254.
Larntz, K. (1978). Small-sample comparisons of exact levels for chi-squared goodness-of-fit
statistics. J. Amer. Statist. Assn., 73, 253-263.
Mantel, N. and Haenszel, W. M. (1959). Statistical aspects of the analyses of data from retrospec-
tive studies of disease. Journal of the National Cancer Institute 22, 719-748.
Miller, R. G., Jr. (1966)+ Simultaneous Statistical Inference. McGraw-Hill, New York, p. 8.
Plackett, R. L. (1974). The Analysis of Categorical Data, Griffins Statist. Monographs and Courses,
No. 35.
Rao, C. R. (1965). Linear Statistical Inference and Its Applications. Wiley, New York, 2nd Edition,
1973.
Shore, J. E. and Johnson, R. W. (1980). Axiomatic derivation of the principle of maximum
entropy and the principle of minimum cross-entropy. IEEE Trans. Inf. Theory, 26 (1), 26-37.
Categorical data problems 871
Smith, K. W. (1976). Marginal standardization and table shrinking: Aids inthe traditional analysis
of contingency tables. Social Forces 54, 669-693.
Sugiura, N. and Otake, M. (1974). An extension of the Mantel-Haenszel procedure to k 2 c
contingency tables and the relation to the logit model. Communications in Statistics 3, 829-842.
Walls and Weeks (1969). The American Statistician 23, 24-26.
Worcester, J. (1971). The relative odds in the 23-contingency table. American Journal of Epide-
miology 93 (3), 145-149.
874 P. R. Krishnaiah and P. K. Sen
with density
e x p ( - x ) x -1
g(x;O)= I"(o) , x~O. (2.1)
Also, let Xk,n denote k-th order statistic when the xi's are arranged in ascending
order, that is, xL, <~ <~Xk,n. T h e r-th moment of Xk,n is given by
n~
~b(k, n, r, O, v) = r(O)(k - 1)!(n - k)! {G(v; 0)} k-~
G(v, O) = e x p ( - v ) v ~ vJ (2.5)
= (O-+j)!"
Table 1
Moments of gamma o r d e r s t a t i s t i c s ; 0 = 0.5
,|
Table 1 (continued)
0 = 2.5
Table 1 (continued)
0 = 3.5
Table 1 (continued)
0 =4.5
t s
. k ,~,(k,r,) ~(k,n) /~{k,n) pt(k,n)
Table 1 (continued)
0 = 5.5
Table 1 (continued)
0 = 6.5
Table 1 (continued)
0 = 7.5
Table 1 (continued)
0= 8.5
Table 1 (continued)
O= 9.5
i i
Table 1 (continued)
0 = 10.5
n k /,t(k,-I
Table la
Moments of order statistics from exponential population
The entries in Table 1 are reproduced from Breiter and Krishnaiah (1968)
whereas the entries in Table la are reproduced from Gupta (1960).
Let Xl, x2 I n be a sample from a normal population with mean zero and
. . . . .
Variance one. Also, let x0) >/ ~> x(, ) be order statistics from the above popu-
lation when the observations are arranged in descending order of magnitude.
The r-th moment of Xck) is given by
, n) = (k - 1)!(n
/z,(k, n! - k)! i _
~ x[1 - F(x)]k-ltF(x)l"-kf(x) dx. (3.1)
Table 2
Expected values of order statistics from normal populations
2 3 4 6 6 ~ 8 9
10 II 12 13 14 13 16 17 18 19
1 i'53875 1"58644 1"62923 1'56799 1'7033~ 1"73591 1'70509 1'70304 1'92008 1'8444t
2 !'00130 1'00192 1'11573 1"16408 1'20790 1"24704 1'28474 1'31873 1'35041 1'87904
3 0"~5606 0'72084 0.792~4 0'84983 0"90113 0-94769 0'00027 1"02940 1-00573 1"01~4~
.37578 '4519~ .53684 -60285 -66176 .71488 .76317 0"80738 0.94813 0"886~
II "13207 .22481; .312'~5 "88S33 -456~7 'b1670 '67001 '01946 ,067g "70001
21 22 23 24 ~5 2b 27 28 20
1 1.86748 1.88917 1.90069 1.82916 1.94707 1.96531 1.98210 1.99827 8.01871 g.02982
1.40760 1"43202 1-45~16 1.48137 1.~0338 1.~2430 1'64423 1"66326 1"58146 1'69~8
1.13095 1.16047 1'18824 1.21445 1'23924 1'20276 1"28511 1,30041 1,32874 |'34610
0.92098 0'95380 0,08469 1.01300 1'04091 1.00079 1'09135 1'11471 1'1209~ 1"16088
74638 "7fl150 "81527 0"84087 0'87082 0'90501 0"93171 0'06706 0"98116 1'00416
6 0'59030 0'62982 0.66067 0.70115 0-73354 0'76405 0"70289 0"82021 0.34618 0-87084
7 "44833 '46148 -53157 '56898 -80399 .63600 '66794 "00727 .72501 "76150
8 "31493 '36203 .40559 .44609 '40391 .51936 '55267 '68411 '61386 "64206
9 "18690 "23841 '23570 "32985 "37047 .40860 "44430 "47801 ,60977 "63983
lO " O d ~ O 0 '11880 '10907 "21766 "20183 "30288 '84106 "37706 .410t~ '4~1[
~O 31 32 33 34 35 36 37 38 39
1 9'04376 2'05646 2'06967 2"08241 2'09471 2"10061 2'11812 2"12928 2"14000 B'16069
] 1.61660 1,63100 1.84712 1.68200 1.67636 1-69023 1-70303 1.71060 1-7301i 1"74131
3 1'38481 1'38208 !'39983 1.41037 1'43228 1.44762 1-46244 1-47070 1"49061 1"~409
1'17865 1'19803 1'21072 1.23468 1"2fi190 1.20860 1'28406 1"80010 1'31614 1"83~64
$ 1,02609 1.04709 1.00721 1,0~868 1'10609 1.12806 1'14010 1'16877 1'172~$0 1'1~
6 0'99439 0.01688 0"93941 0"95906 0"97886 0'90790 1.01624 1'03290 1'06096 1"0074]
7 "77660 .80006 "82369 '84655 -86660 .88881 0"90626 0"92498 0'94200 0"91104!
8 " 0 6 8 8 f i .80438 '71876 -74204 "76435 "78574 '80629 '82606 '84508 .a634~
9 "66834 "69546 "02129 "84606 '00964 -69214 .71382 "73406 "75408 .77891
I0 "47829 "60200 '62843 "666G2 '68043 "00627 "02710 "6490~ "07009 .690~
11 0'$8226 0"41287 0"44185 0'46042 0"49572 0'52084 0.54488 0'66793 0"50005 0.01131
12 "20440 '32986 '35756 "38060 '41444 "44091 "46520 '48042 "61868 .6369:
13 .2,0806 .24322 .27573 .$0664 "33582 .36371 .34M132 "41578 .44013 .4@34~
14 "12473 "18128 "19572 "22832 "25924 '28863 "31608 "34830 '80892 .8934~
I& '04~148 "080~7 "11096 "16147 '18416 '21616 '3~ '3737$ "8~64,
Table 2 (continued)
41 oi 4:~ ,~ t4 45 4* 41 41 41
I 2.16078 J.170~8 $.1803~ |.18009 2.10882 2.20772 2.211110 9.22486 1.12211 1.24111
2 1'70611 1.78408 I,77571 1.73854 1,7~707 1.80733 1.~1732 1.3370~ 1.86660 1.1~601
i.51701 1.62964 1.04183 1.0.5377 I.+',6633. 1.57868 1.58754 1-691120 1.~41860 !
1'01874
i.34138 1"20728 1.37048 1'38329 1,310674 1'40784 1'41962 1'43108 1"44224 1"4611
0 1.30100 1.117112 1.22190 1"14660 i.'+0881 1.27170 1"21~12 1"294141 1.200~7 | . I l H
1.06312 1.00372 I.113~4 1.12810 1.14213 1.13676 1'10800 1.10188 !'19422 1.1041151
*P 0.S)7722 0,99348 1,00032 1.02443 1.03824 1'063.58 1'0~76 1.08104 1.00420 1.10701
*~i114 .10082~5 0,91480 0.23(182 0.94834 0.98130 0.07509 0'99010 1.00300 1.0171~
"79369 .81046 '827152 "84472 "80097 .874173 'fl ~OI .90084 0'92120 0"t16620
10 .7011113 .72871 '74800 '76441 - 7 8 1 4 1 5 "79700 '81601 '82060 "84441 ,Ik$1M)|
II 0.08177 0.00149 0.67002 0.68889 0.711680 0.7238/5 0.74040 0.75063 0.77228 0.7874|
12 .06716 .077D0 .69788 .61707 *834~61 .86303 '07083 .60768 .70397 .71071
13 .48601 .60749 -62827 .64830 .68703 . b m O 3 l . 6 ~ 4 3 B '021fl0 '63881 .66021
14 .41000 .43944 .40114 .48204 *b0220 .62180 '64040 .65806 .67626 .69361
10 .14070 . 1 1 7 2 3 7 .39004 -4178, .43886 -46912 '47608 ,4076~ .61688 , ~
16 0-28422 0.30890 0.332~57 0.30033 0.37723 0.39833 0.41868 0'43034 0-46784 0"4707|
17 "11980 .24fi00 -27043 '29411 .31701 .33898 '36010 .$8060 .40034 .4194|
10 '16044 .18346 .20931 '23411 .2fi792 . 2 8 0 1 0 1 .30280 -32410 .34400 .3~441
Ig .09302 .12192 .14807 '17488 . 1 1 5 0 7 2 .22308 "34662 '30862 .28902 .31049
M "0,1111'/ .0~086 .08917 '11826 . 1 4 2 1 1 5 .10707 .19097 .11|96 -23010 .t5741
t
I S.34907 2"2~67fl +43| 3.271611 2.2869+ 3.20391 3.3+70 2.20010 I.III2M
2 i'86487 i-86371 23+ i.88080 1.8971t 1.9(1&0~ 1.111182 1.02041 1'90780
21 1'02803 1.03821; 773 1,110896 1.67474 1"68340 |.09186 1.10013 1.7011Jl
i'46374 1'47409 ~42(~ 1.49407 I-.5131~ 1.02237 1"61140 1+040k 1.64~0
i 1.&3100 1.84207 279 !'30326 1.3834( 1"39323 i.40378 1.41118 !.4,111117
Table 2 (continued)
\ 60 61 66 67 68 69
k
.i
I 2.31028 2.32556 2-35532 2,36097 2.36652 2.37198
1'93516 1.94232 1.97618 1.98260 1,08891 1,99510
1"71616 1.72394 1.76071 1.78767 1.77451 1.78123
4 1'55736 1'5fi567 1-60487 1,61228 1,61955 1.62870
1'43023 1.43900 1.48036 1.48817 1"49584 1'50338
70 [ 71 76 77 78 79
Table 2 (continued)
70 71 72 75 74 75 76 77 78 79
"/6 0"34fi 0'35061' 0"37202 0.38507 0'30873 0'41122 0.42343 )'43540 )'44711 0"45850
27 '30~ '3222~ '33506 .34934 "36242 .37521 "38772 39097 '41196 '42371
28 '271 ~8541 -29045 31317 "32(157 .;13968 ' 3 5 2 5 0 ' .36504 "37731 ,38034
234 .2489{I '26333 2 7 7 4 0 "21) 114 ,30457 [ ' 3 1 7 7 0 ,33055 "3431 l .35542
M -197 .21277 [ ' 2 2 7 5 0 -24199 .25608 "26984 "28329 -29645 ,30031 .32190
I
31 0"i61 0.17690 I 0'19208 0"206~.H 0'22133 0'23543 0.24922 1.26269 ~'27588 0"28875
~2 '125 .1412/: "156~3 17202 '18684 '20130 "21543 2 2 9 2 3 '24272 '25501
~k~ .08U '1057~ "12178 13737 .15257 .16740 '18188 '19002 "20983 '22334
~14 .053 .0704~ 'Ott6S8 102S[~ '11848 '13370 "14854 '10303 "17713 '19101
~5 '017 .0352C "05209 '00852 ,118453 ! .10014 '11536 '13021 '14471 '15888
-- 0.0000~ 0.01736 0"03424 0"05068 [ 0"06070 0"08231 ~'00754 b11240 0"12601
37 -- -- __ 0 0 0 0 0 .I)1689 .03333 "04935 '06497 '08020 '09507
. . . . (~0o00 "01(144 0 3 2 4 7 '04809 "00333
;$9 -- -- -- -- ,00000 .01602 '03165
40 -- - - . , -- . . . . . 00000
I
Table 2 is reproduced from Biometrika with the kind permission of the Biometrika Trust.
x~ = V T ~- c~ yi - ciy0 (4.1)
Zp 1/2 + H N
P [ m a x ( x l , . . . , xN)~< H I = f_oo{ F ( ( 1 - , O ) 1/2)} f ( x ) d z (4.2)
where cii = p,
Using (4.2), Gupta (1963) computed the values of the right side of (4.2) for
p = 0.100, 0.125, 0.200, 0.250, 0.300, , 0.375, 0.400, 0.500, 0.600, -:3,0.700, 0.750,
0.800, 0.875, 0.900, N = 1(1)12, and H = - 3 . 5 0 ( 0 . 1 0 ) 3 . 5 0 . In Table 4, we
reproduce the values for p = 0.125, 2, 0.500, ~, 0.800, 0.900. Gupta, Nagel and
Panchapakesan (1973) constructed the percentage points of the maximum of
equi-correlated normal variables.
892 P. R. Krishnaiah and P. K. Sen
Table 3
V a r i a n c e s a n d c o v a r i a n c e s of o r d e r statistics in s a m p l e s of sizes u p to 20 f r o m a n o r m a l p o p u l a t i o n
m 4 ~ V*Im* I j v~
2 | t .l~1600113~)
2 . 3 1 ~ 1 3 .210721~ 4 ,170~,a~lM,M
2 2 .(~lOUO1139 4 6 .136001~
3 1 1 ./$,504073038 b 6 .11~71842
'~ .~b~144477 4 ,21044tJ~H6 6 6 .1~101'~114
3 , It},4~4M 1 .37~71434 10 I 1 .344343ff333
2 2 .448~711046 2 I Ig~073~97 2 ,171 ~1620d}00
4 1 1 .491716~9 3 3 . 1 1 ~
4 ,0~47~'~'~7
3 .I b ~ O l 5 .074766024~ 6 .07()741 ~4'7'/
4 . !040~4~00 6 6 .0683t~/1~
2 .3604 ~x3434 7 7 .04~2G6279
3 .7,36943aU3~ t~ .03~363073 .04 ltkt44~O
6 1 | .447634009| 2 .23~0104~
2 ~4330t~im 3 163 I t ~ 7 2 7 I0 . ~ I
3 .14~14772,~2 4 .1 ~ 3 ~ 3 3 1 7 2 .'I146241430
4 .10b7710776 b .007M47193 3 .14tmzm|~
b ~07421b~t~ 6 4 .1117016961
2 .311~18~1 7 .(.i~2401H i ~ 6 . Ut~J74~l124,
3 .2Ut~!64440 3 . ~o~n6~Tt~O 0 .07419t~414
4 . i 49U42{J~i4t 4 .15~312 7 . ~
3 3 .2~t~Ci3~t 6 6 12tmO37M6 .0~30073Z;
6 t 1 .41~2710~0 6 .0U7~17131~ 9 . OqC3,~11b6l
2 .2{~ggit~/3 4 18711162196 3 .17~0~f~4
3 ,13t)43~ b 1491764908 4 . l ~
4 . 1 0 ' ~ ;,q~3040 1 .3~73M3264 b .107744M~
b .077~37830 1781434240 0 . Ot~Z,tMOi~
6 .~14M4 3 |~07454442 7 .0749 lt~l~4.
2 .2"/t~777392 4 .00130714O0 .06,30;13244
3 . it~t~O 5 .07~422364 4 .16793~144
4 .139t~ 6 ,t~4aat~
& ,t O b ~ O b 4 ~ 7 .049ff/a~R~! 6 .I067h~I0
3 3 .246'~I~ ) 4 t~ . 0 4 ~ 7 7 .0~4~
4 ,I~J~7~7~)7~ 9 .ffdt06621~ 6 .161(~
7 1 I .3919177761 2 .27./169~778
2 .t ~1900~ 3 . IMt ltl~/~ II ! .33324744'~
3 .13211bb~l I 4 I 17~1~ 2 . It~04771
4 .0~48~g~07 .0~4477394
.07~t~t346 6 .07t~461431 4 . (.kq/~ 17(~q
6 .O ~ J 1 8 7 1 2 4 7 .0~Z164~
7 .0448~106 8 .(~1714(M)01 O .0672007a~
2 .2~7328862 3 I lg~JS~ 1~CJ 7 .04~7M0~
3 .1744h~33274 4 i 421)779776 8 .~1~2347~
4 .1307~ 5 I 137t~I0176
6 O03,~2.Mm~ I0 .0'~4 I ~
6 ,079~1174~ 7 .077~351806 II .02331 ~'~a~
Tables for order statistics 893
Table 3 (continued)
il 12
2 2 2051975798 4 1212063211 8 .0~8922143~
3 140309~11 5 .0~2~602 9 .0~14460445 ~
4 , 1071492~)9,5 6 .0822228461 10 .0440637542
5 .08644302~7 7 0701213964 11 .0300643799
O .07193ck5024 8 .0604384621 4 4 .1330111820
7 .0608869~62 9 .0~II & .I0~12067
10 .04~367615
9 ,0442&1945~i 4 .13981OO406 7 .0780173339
10 .0371029977 . 1136687821 8 .0677217143
3 3 .1657242880 6 .0061646279 9 .0691628729
4 .1269672925 7 .0~12410810 I0 .0617328060
5 .1026407291 8 .070079~32 5 6 ,1232t~:~
6 .0~178832 9 .0~0~20674 6 .1037367701
7 .07247410,50 .13061373~9 7 .0890434764l
8 .0618873278 0 . I0~122,47 8 .0773~2864
9 .0527~00~9 7 .0936961~20 9 .0676230994
4 4 . i479546~ 8 .OO08972960 6 6 . I18317~26
.1198752801 6 .1266377911 7 .1016824204
6 .1(X)0346,58~ 7 . 1083946831 8 .0884194610
7 .0848765182 13 I .3162~1842 7 7 .1167989950
8 .0725451434 2 .1567Ti~004 14 I 1 .3077301026
5 5 ,1:1964lt.~iOt 3 .10~9U8842 2 .15172W,~62
6 . l 167449806 4 .0~0736 3 .1031719531
7 .0991935960 5 .06&4634400 4 .0788715016
6 6 .13716243,~ 6 .0648221797 5 .06~|9(~7428
12 1 1 .3236363870 7 .046883,3088 6 .0537064714
2 .160~373762 8 .0406132548 7 .046069918~
3 .1089309641 O .03~122~462 8 .040114168~
4 .O830686767 I0 .030932"~44 9 .03&214170C
5 .0070884464 II .0268~372~0 I0 .03103711~
6 .0,%9933694 12 .0228868068 II .02733628~
7 .0476620974 13 .018434822ff 12 . O'xJg061001
8 .04102U8554 2 .1904130721 13 .020608026~
9 .03644390~ .130'~29 14 .0106279801
lO .0~)~012~91 4 .0997~6~606 2 2 .18442002~
11 .0"2,57945392 .00~7860,38 3 .12607019~
12 . 0206221233 6 .0078146832 4 .096~240~
2 2 .19726460"30 7 .06804672~ 5 .0786202~1
3 .134~T203~8 8 ,0603107946 6 . ~
4 .10310~y0206 9 .0439005067 7 .06(168967 II
5 .0836045822 I0 .038300179~ 8 .049370614{
6 .06978~668 II .0333147706 9 .04336171fig
7 .0~94590~2 12 .0284018130 10 .038233740~
8 .0512113198 3 . 1513017013 II .0336863ZI]
9 .0442747124 4 .1162698131 12 .029468131~
I0 .0381191478 5 .0044~X~ 13 .02~28~'~2~
11 .032'2/K~340 6 .O7929'22993 a 3 .145704,5~}~
3 3 .1579786877 7 .O679282364 4 .111081fib7
894 P. R. Krishnaiah and P. K. Sen
Table 3 (continued)
t4 3 15 16
fi .(~Jl 1181271 6 .064330081~ 3 .008a000764
6 07fio7fi4957 7 .0~4074400 4 .07540400~
7 { ~ 9 0 ~ 4 ~ 2 f i 8 .O484238833 5 .06130~724
8 0,574341188 9 .0427294113 6 . Ofi16624963
9 ofioq 07780"J 10 .0379177516 7 .0445503706
It) .0-1,15109192 11 .0337161721 8 .0300194716
11 1~19'~2310 12 . ff29915234 7 9 .034537811~t
12 t~.13:i22071 13 .f f 2 6 3 3 0 , ~ 6 10 .03078100V3
4 4 1272273070 14 . ff227213594 11 .02753fi3612
lIBtJ931 I08 3 3 .140732"2A02 12 . O'246479007
6 ,0873562483 4 .1082138452 13 ,0219956756
7 .0751519909 5 .M160675fi 14 .0194fi,Sfi037
8 .0655311~36 6 .0743~A36 15 .016871ff289
9 .06761209fi7 7 .06405~183 16 .0138287378
10 tk~t~,02"240 8 .056013612"2 2 2 .1743940788
il 0-I 48243409 9 .04944~100 3 .1191400287
,5 5 .1171012461 10 .0438960670 4 .0914350018
6 .0987747&50 11 .0390420915 5 .0744591145
7 12 .0346~ 13382 6 .002809'3~]0
8 .0742181416 13 . ~ 7 .0542033941
9 0 6 5 2 ~ 7 7 7 1 t 4 4 .12"2"2328270 8 .ffl75000760
10 .0576,101464 5 .00973231M 1 9 .0420638230
6 6 1115324579 6 .U~I4170fi~6 10 .0375018250
7 . olin i 4ofi595 7 .072fi0.16~9 11 . Of
135fi74012
S .t~IbVSl7110 8 .063617~qT/ 12 .0300461298
0 .0730O09221 O .06tD99061 l 13 .0268180679
7 7 1O90260480 10 .0498187838 14 .0237301~62
8 11 .0443247452 15 .ff3057~433
15 1 1 .3010-115703 ]2 .0393501S20 3 3 .136338~12
2 14812077{J~ 5 5 .11186989~ 4 .1048700750
3 1OO72"23449 6 .0045~D4 5 . U855189036
4 O77O594O60 7 .0815891122 6 .072r207508~
5 .06258458fil 8 .07 L4331681 7 .062356~515
6 .0fi26530120 9 .0631224388 8 .0546749107
7 I0 .0~0719306~ 9 .048436~000
8 , O395736673 11 .O499127743 10 .0431079377
9 ,03-191~905 6 0 . lOA~m6366 1! .03a~52906
10 .0309614122 7 .0914~204 12 .03402772~
li .0275211039 I$ .06014075,59 13 .030914913fi
12 .{Y244126313 9 .070~m82000 14 .0273fiOfi371q
13 ,0214810828 I0 .0629824402 4 4 . 1178~75/~I
14 .0185333'~3 7 7 . I ff.~916923 5 .096251~13
15 .01.51137071 el .09t0.t90064 6 .08 L,'B80445
2 2 1791215291 9 .0796738323 7 .0703000DI 1
3 12241769,53 8 ~ 1016946521 8 .0616728M
4 0939067144 16 1 1 .295~J~90 9 .0546505021
5 .0763912337 2 .1448881689 10 .048764774H
Tables for order statistics 895
Table 3 (continued)
t:fi
,I
Value it i V~lue m i J Value
l 17 17 0
0.136607328 .0413028102 ~2 .04781 ~2~9
.t~flH I 12f~}9 1X170349110 7 7 .09290317fl1~
. O;.t,t9'25:~749 1X~32040~92 8 .0~1~194607
i, . iO73517tl~9 . O`299082825 9 .0728154074
1D(~2:12622 .0'270170370 IO 0662667274
.078,~80532 .0242386812 li .1K~7626219
. ~k~t}488~r2 . O`215459,396 8 8 . O`J0736 l l t ~
.061136-1182 .01876,58306 9 .11~'~00"~ 7
0,M ,r~i:~!)41 1324207975 10 .07245099~1
.0.IN~i84327 1018792434 9 9 . ~,)11},11},5814
.04378~2~J59 .0~1421716 18 1 1 .28.1 to301297
( . l 0 |l).4(Jl 9(1~ 070285ff1(XI 2 1392~01620
.1}~74tt27156 .01Kff964413 3 .(D46172637
fat~Z1961t8 .05342U8202 4 .0724851730
.0~81545640 .04745f~87 5 05~274
.0~08NH805 .IN2472UStgl 6 0 4 0 S e O 0 6 ~
.1kM5210724 . ff,t~ I 0 ' 2 , ~ 7 7 .0431302310
7 .11~1740'2,6~ i 3 .0344194567 8 .037ff260 i 95
.08~$181,ql 6 .0310047771 l) . tk337388141
.O76O015677 . ff278210708 10 .0302610667
06789319T2 . O`247342005 11 O272938041
{~J572 I:~t107 11400681{}7 12 0"2470(J247l
08,r~y~J121H 0931620,339 13 0 ~2,3801573
t .2896,331~7 .0788266621 14 0 2 0 2 5 3 7 4 2 1
14 IIH24629 7 .06~2298009 15 .01~2488619
09~t748737 8 0 5 9 ~ 2 6 0 9 2 16 016285044 l
.07388,1~15 O .Ik~Ob7575 17 .Ol4L~68875
06012723O'2 10 Uq7723~J73 18 .011771U064
. r0,507326948 11 .ff1~1261816 2 2 166~.~
.04:1~23Cq91 12 ff,18604 '263O 3 113r~58132
t L ~ 1 6 7 2 8 3 4 13 .03484124030 4 ,{J~71507604
. (Kt414410,Mi 14 .0312881~1 5 .07 ! 0 8 2 5 9 9 0
0 ; ~ t 8 0 5 4 8 5 5 10341J04377 6 .01~t1975754
.0'2"/4465527 6 .0875729930 7 .05~0217,123
,O`247237144 7 .07586345,34 8 .045761Kt62~
, tr2'~,~6:,10771 . O6672O4245 9 .0407317967
.01096tK1~1 O .0693187706 10 .0365451U04
.0177476891 10 .0~31 ~A17771 11 0329704804
.01&15,52071 II .0-177987292 12 . ff298442464
. {I1'~/"~4751 12 . ff130970703 13 0270462261
! .1701426762 13 03883751167 14 .O2448O635O
1161866734 6 ~/68824tHt0 15 . O`221}607 ! l l
.0~9191~2557 7 0 ~ 9 8 1 1 7 3 8 16 .0196804667
072(19703Ni 8 .07391302t~ 17 .01721540"2,~
.0613091~450 9 0 6 5 7 4 4 2 7 3 6 3 3 12881198043
05;10761573 10 .05~9030403 4 .09918285311
.046614~18 11 .0~10137~5 /i 0 S 0 ~ 9 9 7
896 P. R. Krishnaiah and P. K. Sen
Table 3 (continued)
18 3 18 19 3
6 { ~ x 3 2 4 700 8 8 .0$64900639 8 .0611MI411
7 ~ ',~59~Y2 9 .077176~ 9 .04M2~8814
06224 ~,413 lO .000~91332 10 .04 l(Yd6~21
9 .0405162123 11 .O0~f/1109~ 11 .03713464~
10 o,I17473296 9 9 i0t~3127880 12 .033739117:
11 0370730~7 10 .0767442321 13 .030721~H
12 .03,110~K)I7 [ 19 1 1 .2799368O6O 14 .0"279~f360"~
13 .0~,)157060 2 .1367768168 15 .0'2~44~1~
14 .0279~7b014 3 .09',~)061763 16 .02301960~1
. (Y/~22".44784} 4 .071190242~ 17 .02tN~2146,4~
16 .022~161109 5 .O68OO94835 4 .I 0 7 4 7 4 ~
4 4 .1105(RK)331 6 .04904(0678 5 .0~79051~
5 .0903973787 7 .0424705246 6 .0745033878
6 0"/6557U277 8 .0374006329 7 .06464061~
7 .006362',~6 9 .0333319305 8 .0671X~32284
,0b~4310521 I0 . ff299634144 9 .0b0~72~8
9 .0520394281 II .0'271011338 10 .045757654~
10 .t~6718~t)4 12 .0'246129452 li .0414165001
11 .042160-~I 13 .0'224037540 12 .03763~7M
12 .03~ 1~94i32 14 .0204007370 13 .03427(~
13 .0~46102645 15 .0185431530 14 .031226~J41
14 031 :?452497 16 .0i07731147 15 .(Y~3944527
0 ~ 2 5 4 8 2 M , 17 .0150223067 lO .ff~14~
.09990~4321 18 .0131789904 5 .096794474~
6 19 .01093~2527
7 .O734460811 2 2 .16278/~651 7 .07127~74~
.0647101858 3 .11105~)145 8 .0628~700~
9 .O576543520 4 ,Of~29310f~ 9 .0561272ff2~
10 .06177~75 5 .069,5970759 I0 .050514163~
11 0,167468133 Q .O58891O196 I1 .0457330144
12 7 .0~103~1003 12 .0415~1234
13 0383932046 8 .0449~T247 13 .037863(~fll
14 .O347682770 9 .040t~917M 14 .0344996~1
6 .0032407331 10 .0360490040 15 .031375~q)2fi
7 0809~02644 11 .0326137544 6 .0i~021~03
.0713338046 12 .02962~236 7 .07~20'290~
9 .06368296~ 13 .02~9716~2 .069O294300
10 .0~7119728~ 14 .024~41909 9 .061633@g~
11 .05158~2 15 .02".~3,3068~ I0 . O554877906
12 0167370896 16 .(Y~02017247 11 .~931~
13 ~! 2~79846 17 .0180952193 12 .0450~4841
7 7 0~9016707,5 18 .0158767294 13 .0416203~
8 07~6179677 3 3 .1 ~,7138904 14 .0379290~
9 .O7OO199O26 .0907367O97 7 .06~172~1
10 06'~J209074 5 .079O298792 8 .075~15341~
11 .0568501034 6 .0~9273696 9 .067~33161
12 .0615190092 7 .0fi803~1~4 I0 .~703~
Tables for order statistics 897
Table 3 (continued)
19 7 2O 20 5
II .0551032224 1! .0~122405467 7 .060317575C
12 . r0501089625 12 .~960 8 .0612251420
13 ,O456621835 13 .09Mt8315105 9 .0547222521
8 8 .0~28339961 14 .02,tb479403 10 .04033742"/8
9 0740273546 l5 . O~Z4526609 11 .O447662310
lO 06~9582"29 16 .0204~2 12 .04O8014074
II 0604372723 17 .O 1 8 5 9 9 4 0 2 4 13 .037294840~
12 05497820~'~ 18 .0107136502 14 .0341351571
9 9 .~ I ~76,3,30 19 .0147107671 15 .0312332O4C
I0 O732703911 3 .! ~28134687 16 .02851092~
II .O 0 ~ 2 0 7 ~ 0 8 4 . o04tioIg010 6 6 .(}~71511254
I0 IO 0807000751 5 .077235&O08 7 .075770336(
20 I l 2756066156 6 .0654510179 8 . O0695M78~
2 1344941714 7 . ~ 7 7 9 .050~5976~
3 .0913234064 8 .r0501310269 I0 .063991063~
4 .0690879991 9 .04477632o2 II . 0 4 ~
5 0670566384 i0 .o4o3482354 12 .044670~1
6 O4827O1O93 II .O365934287 13 .04(]6.38~I
7 0 4 1 8 4 3 7 8 2 6 12 .0333397940 14 .0373845194
8 o36~o37ofm 13 .0~)4645702 15 .0342111024
9 0 3 2 9 2 9 6 3 O 2 14 . tYZ'/875~79 7 7 .06201~0~
I0 .t r 2 9 6 ~ 2 ~ 2 3 15 .(~90.4381 8 . O73O38367{
II .02688,38808 16 .0232716371 0 .06533076~
12 0 2 4 4 8 3 9 5 6 7 17 .0211277373 I0 .05803874~:
13 0 2 7 ~ 4 9 8 0 3 18 .0180874448 11 .05,~7~
14 .0204584~7 4 .1046766243 12 .048788~,5~
15 O187096782 5 . ~ 4 ~ 13 .04,1612100(
16 .0170711408 6 ,0726321560 14 .04(}645998'
17 .01549&1854 7 .0~K)731775 8 8 .0706~)0757
18 .01392T/072 8 ,0~)68~1 9" .071259100'J
IO .0127.530117 9 .0497M9273 10 .O04310~7~
2O .01020472O4 10 .0448455403 11 ,05~90731~
2 2 1595731636 11 .040~811669 12 .053264,149~
3 1088143707 12 .0370709493 13 .0487150834
4 .0835758044 13 .033879~02 O 9 ,0778118317
.06~,~,A7554 14 .031(]045146 10 .0702526464
6 06776996~ 15 ,07~17 11 .0638176734
7 .(kSOl 1O057~ 16 .025~97454 12 .05822291~
8 ,0442041191 17 .0'~0343 10 10 .07694743&~
9 .0394O03443 5 5 , O03996O007 II ,0(I~026010~
I0 . 0 3 ~ 6 .0797773755
Table 3 is reproduced from the Annals of Mathematical Statistics with the kind permission of the
Institute of Mathematical Statistics.
898 P. R. Krishnaiah and P. K. Sen
Table 4
Probability that N standard normal random variables with common correlation p are simul-
taneously less than or equal to H ; p = 0.125
Tables for order statistics 899
Table 4 (continued)
1
p=3
\N I 2 3 4 5 6 ? B 9 )0 11 12
\
3.5C .000~3 .O0000 .00000 .0~000 ,00000 .00000 .00000 ,00000 .00000 .OOOOO .00000 .00000
3.4C .00034 ,o0o0o .ooooo .ooooo ,ooooo .ooooo ,ooooo .ooooo .ooooo ,oo~)o0 .o(~oo0 *ooooo
3,3C .00~48 .ooooi ,ooo0o .ouooo .oooo0 .ooooo .coooo ,ooooo ,ooo~)0 ,~o~oo ,ooooo ,ooooo
3,2C ,000~9 ,00001 ,00000 ,0(,000 ,00000 .00000 ,00000 .O000O ,00000 ,00000 ,~:0000 ,00000
3. It .00097 .00002 .o0ooo .o~ooo .ooooo .oooo0 ,ooooo ,ooooo ,ooooo .oooo0 ,ooooo .ooooo
3,0( .00~35 .00003 .00000 ,00000 .O0000 *00000 ,00000 .O00oO ,00000 ,COC,~)() ,co000 .000o0
Z.OC *0018~ .00005 .00000 .ooo00 *~0000 .00000 .00000 .oeo00 ,0000o ,0000o ,00000 ,00000
2.BC .o0256 ,00008 .OO001 .O000O .00000 ,00000 ,00000 .OCOUO ,00000 ,O00CO ,()0000 .00000
2*70 *00}47 *0001~ *00001 *00000 .00000 ,o00oO )OCOO0 .(,0(~00 .OCO00 .0~00{3 .00000 ,O0000
2,6C ,004~6 .00020 .O000Z *00000 ,00000 .00000 .00000 .O0000 *00000 .~'000~ ,OO0~O ,00000
Z*SC ,00621 .0003[ .00004 .00001 ,0000o ,00000 ,00000 ,00000 ,coooo .00000 .00000 ,ooooo
Z,~C .008~0 ,00048 ,00007 *ooooz ,00000 .00000 ,00000 ,00000 ,ooooo ,oaooo .00000 .oo()oo
2.3c ,01072 ,00072 .00011 .00003 ,oooot ,00000 ,~0000 ~00000 *ooooo .{,000(~ ,ooooo .ooooo
Z*2C .01390 *00108 *O001e .00005 *O000Z .00001 ,OOCO0 .00000 ,OOOUO .00000 ~O0000 ,00000
~.I( .01~86 =00159 .00029 .O000e *00003 .OO001 .00001 .00000 )00000 ,O0000 ,00000 ,00000
Z.O( ,02275 ,00231 ,0004~ *O00t4 .00005 .0000~ ,0000; .OO001 *00000 .00O00 *00000 ,000~0
I,~ ,02072 .00~31 *000~ *00023 ,0000~ *OOOO4 ,OOO02 .000~ .0000~ ,O00~O *000~0 .00000
1,6( .O3593 ,O0*e9 .00~I~ .O003Z .O00t~ ,o000r .OOOO4 ,O000Z ,00001 .0000~ .00001 ,00000
1,7( .04451 .00~6 .001~ .000~9 ,O00Z~ .OO013 .OOO07 ,OOOO4 .O00U3 ,0000~ ,OOOOt .00001
1.6C ,054e0 ,0090~ .00251 ,00o93 ,0004~ .oo02z ,00012 ,00007 to000~ .00003 ,0000 ,o000z
1,50 ,0668~ .01214 .00368 .oo145 ,0006~ ,00036 .oo021 ,ooot3 .0000~ .ooo06 .oooo~ ,00003
[,4( .06076 .01~61 .00532 .0020 ,ootoo ,00059 ,00n35 .000~2 o0001~ ,00019 ,00007 .00006
1,3 .o9e8o ,02207 .O07~B ,OO330 *O01~B .000~ ,000~ .OOO3B .00O26 .OOO~B .o0o13 .oO01O
[,2( .|)50? ,02896 ,01063 ,0048b .00257 .00150 .00094 ,O00b] .00043 ,00031 ,000~3 ,000]8
|.i( ,135~7 ,03~54 .014~9 .00705 .00306 .00232 .00150 ,00102 ,00072 .000~) ,00040 .00031
|)0( .1~66 ,04807 .O~OOZ *0~00~ .00~71 *O03~J .002.~3 .00~62 .OOtl7 .00087 ,000~6 ~000~
0.9( .~640b .0~083 .02~88 .0~411 .00829 .O05~B .O0]5? .002~3 .OOt~ .0014~ .00109 *OO066
0.8( .2118~ .07b07 .035~e .OI~4B .01;05 .0077b .OO537 .OOJSO .OO29O .00~23 .00175 .OOI40
0,7( .Z41gb .094O2 .04644 .O2649 .0]~4 ,Oit|9 .00792 ,00~83 *00*43 ,OO346 ,00275 .00223
0.6( ,Z7425 ,11489 ,05~B ,0354V .0~298 .015B6 ,0114I ,00B~O ,0066~ ,00~6 ,OO425 ,0O349
0.4( .34456 16556 t09552 .060?6 t04176 ~0304 ,O~2BO .0t772 .~1413 ,01350 .0095~ .007')9
0.3( .38209 *19604 .11763 *07778 ,C5405 .04072 .03230 .0~47% .0~003 .026%1 .OI]B.~ .OIk?4
Oe~( e42074 eZZQ~ tl43~0 .09153 *07119 .05.)93 e042~4 e03]~5 .OZTB7 .023~0 .019r2 .01~92
0.1( .460|7 .2653B *i73[I *1220~ .09002 .07028 .05604 .0457b .03eO~ .0322% .0~760 .0~392
0.0( .50000 130409 s20613 . 1 4 ~ . 7a_ ~ ] 1 4 1 3 .,090~2 ,07311 ,06001 .05L13 .04375 ,03?90 ,0331~
\N l t 3 4 5 6 7 8 $ IO li It
0#00 .~0000 ,30409 .20613 .14~74 .11413 )090|2 )0731] .0606t ,DSII3 .0437~ ,03790 )03318
0,10 153983 ,34504 ,24251 *lBI21 14133 .]1377 .09364 .07093 ,06744 .05036 .0511| ,04516
0*~0 157926 .38780 *28201 .2L642 17250 .)4144 ,lIB.3 .lOlOg ,08744 ,0~654 .06768 ,06036
O*~O .6179| *43187 0324Z3 )Z5516 .z0763 ,Ir324 .14739 ,Iz737 ,11149 ,o@#sb3 ,0860~ ,079zz
0*40 .6~542 *47670 .36871 ,2971[ ,24653 .20913 .,18049 *15796 139fi3 *I2497 *lltOO ,|0Z|7
0*60 ,69|46 *52|73 *41489 ,341R0 *ZBEB7 ,24891 ,21777 .19288 .t7256 .15~74 ,14157 .12954
0,60 ,7~575 ,56638 ,4~21~ ,~8R66 ,33~$R 329~Z~ .2~897 ,2]00 ,Z097t )kOlO0 *|7~06 *I~14~
0o70 o75804 161010 *50974 .43703 t3B)A7 ,33858 ,~0371e.27501 .25100 t23061 .21310 *1978 t
o.ao ,~8814 .65236 ,55707 .4R618 *41t24 .38733 *35t40 .3~143 129605 27426 .2553& .2~880
0.90 ,81~94 *b9271 .b0344 .5~536 .48150 ,43771 ,40135 ,37062 ,34429 ,J2;45 .30144 62~377
1.10 .06433 tT&6~] 36909} ~6J0~4 ,58150 ,%4009 .50471 .47412 ,~473; *4~&O *40245 .3~34&
1.20 ,~8493 .79882 .-r3104 *b758] .62060 ,5~039 ,55641 ,~2665 .50033 ,4760] ,45570 ,43657
J.30 .90320 e6~847 376822 ,718$~ *67570 ,63902 ,60b~t )~7849 ,55~10 .5~024 *~0~51 .49062
I,40 *91924 *85~09 *80ZZ] ,7575] .71900 .6R5~9 ,6554~ ,628"16 ,~047| .50~86 ,56291 .544~B
I,S0 .03319 .~7673 .~292 ,70354 7%~12 .'~F~4~3 ,701~4 ,b76~ 0 .~5430 ,~3380 eb149] *~T48
1,60 ,945Z0 ,69946 .86026 .8Z604 .79573 ,768~8 .7~403 ,~166 ,70l|5 .662~4 .~647| ,~4841
tl?O ,9~543 .QI743 ,6~43| .05497 *82866 .804f14 ,78310, .7~31~ ,74466 .72~%Z ,71153 *69656
leBO robs07 .93263 .90518 ,fl~036 ,8~76~ ,fl3727 t~lflJl ,800~ .T844l .7~913 e7547~ ,74|2~
,00 .97~25 ,95661 ,~3021 ,9~=14 .90~33 ,8906~ ,~7~6 .~63~ .8~172 .~oI~ . ~ t .elaze
Z,10 ,@BZl4 .96~86 .~088 ,93608 )9~qot t91184 390037 .~895~ .879~ )06~4Z .86007 aO~ll~
Z,20 *96610 ,97327 ,961~4 .9~017 ,9~966 ,92913 .9203l .91t34 .90~70 .89*60 .~876 ,07928
21~0 198928 #97927 .96908 .~6IOZ .@5~t ,9.~62 *~.36~ ,~29bV .922~ .~2%~5 .~0947 690]~
~.40 ,OQlO0 *00406 .9r677 .96962 .96319 .9~6~4 .95074 )044~ ,939~3 .9.]378 ,92n~1 .9~34]
1,50 .99379 ,967e9 *98226 .97b~8 19717l .9~'~3 .9619] e957ZV ,952~ 0 t~4~45 ,94423 t9qOIZ
~.60 *99534 .990ee .986~9 ,9824~ ,97849 *97463 .97090 .~b/28 .96376 .9~O:t~ .9e701 )9~376
2.70 .99~53 .99319 .98997 .96684 .9038] .YBOO7 .97~0l .gl~z~ .972~ 0 .~6~4 .~bTZ~ .V6471
2,80 .9@744 t~497 .~92~* .99023 .987~5 oU85~3 .9n35~ ,96144 .97937 .9t734 ,~7%~5 .97339
2.90 .99813 .9~632 .99454 .~2e[ .99[1~ .98947 .987~4 )9862~ .98469 .9R]16 .9e)65 .98017
3100 *~86~ *~33 *~604 *~477 *~3%3 *99~)1 .99Ill .98993 ,0t~677 .98763 *98650 *9~40
3.10 .99903 .99806 ,99715 *99623 .99~33 .9~444 .~9357 .99270 .99185 .99K01 .990;8 .98937
3.20 .999~| *~063 .~9797 ,9973| ,996bb ,~02 ,9~%39 ,99477 .~94L5 ,~]%4 .99~94 .99~34
3,30 ,99951 ,99904 ,99857 ,99810 ,99764 ,~9716 .9~b73 ,99&29 . ' ~ .9954| .99498 ,994~5
3.40 .99966 .99933 .99900 .99867 .9983~ .99603 .99771 .99739 .9970B .99677 .99647 ,99616
3.50 .99977 .999~3 .9993l .99908 .99U85 .99063 .99~4) .Vgdl9 .99797 .9977~ .99754 .997:)2
900 P. R. Krishnaiah and P. K Sen
Table 4 (continued)
p = 0.500
1 2 J ~ 5 ~ I 0 9 [0 |1 IZ
3.50 ,000~} ,OUOOI .0o00o .00000 .00000 .0OUO0 ,00000 .00000 .00020 .00000 .00000 .00000
3,k0 .00034 .0000[ .OOOO0 .0oo00 .O0OOO .00000 ,OOUO0 .00000 .33030 .00000 .00000 ,00000
3,30 .000~0 .00002 .00000 .OuOO0 .O0000 .OOO~O .OOOO0 .00000 .00000 .O000U .00000 ,00000
3,20 .O0069 .0000J .00001 .00000 .00O00 .00030 ,00000 .000~0 .00000 .UO000 ,O0000 ,OOOOg
3 , 10 .00097 .00005 .00001 .00000 ,00000 .OOOO0 ,00000 .00000 .00033 ,00000 .00000 .00000
3.00 .001~) ,00000 .00o02 .00000 ,00000 .OO000 .03030 .O000D .00000 .00000 .00000 .00000
2.90 .OOIU/ .00011 .OOO0~ .C0001 OOO00 .00030 .00000 .00000 .00000 .00000 .00000 .OOOUO
2.80 00256 .00020 .O0006 .0000[ .O000i .00030 .00000 ,00000 .OOO00 .00000 .00000 .00000
2.10 .O0)4Y .O~u~O .00001 .00002 ,O000I .oOOOl ,00O00 .00000 .00033 .00000 .00000 .00000
2* 60 .00~06 .000~ .00011 .00004 .00002 .0000[ .03031 .00000 oOO030 .O0000 .03000 .00000
2.50 .0062[ .0006! .0001~ .00006 .O00O$ .00002 .0000[ .O003l .00000 .00000 .00000 .00000
2.**0 .00020 .00090 .OOOZl .OO0|I .O00o~ .O000~ .00002 .U003[ .00001 ,0000[ .00000 .O0000
2.30 .o1o72 ,001~1 .000~| .000[I .o0oo'~ .00o05 .oooo3 .00002 .00031 .00001 .0000| .00001
2.20 .OltvO .002~ .00062 .00021 .ooul4 .00008 .00035 .0000~ .33033 .00002 -00001 .00001
~*|0 OL~Ob .00289 . O (109J .0004~ .00022 .0001) ,00009 .00000 ,0000~ .00003 .0000} .0000~
2.00 .02275 .00605 .O01J! .0006~ .000|5 .0007[ ,00014 .000[0 .00001 .00006 .0000~ .0000~
|.90 ,02812 .00501 .00201 .0009~ .00056 .000~ .00023 .00016 ,00012 ,00009 .00001 .00006
1.00 .03593 ,03/67 .O0!B9 .00|5) .000~) .00053 .000~/ .OOUSb *000~0 .000[6 .000[~ .000|0
|.TO .0~65/ ,OIOST .00411 .OOLIO *0012~ .UOOB2 .0005! .03052 .00032 .00025 .00020 ,0001/
[.60 .05.*~0 .0[300 .00)/b .00~05 .00[05 .00[~6 .00000 .00006 .O00$L .00060 .00033 .00027
I , 50 .06001 ,0[03? .00!99 .O06~fi .002/2 .OOLR5 .ou[~] .00101 .00019 .00003 ,000~? ,000~|
1 . ~tO .08016 .02)9~ .0109~ .UObl6 .00393 ,00712 ,00200 *00153 .0312[ ,00098 .000B[ .0000|
1 ~0 .O}68O .0J094 .014f0 *00050 .00560 .00J)) .OO296 .0022~ .00182 ,00149 .O0|2** .O~LO$
1.20 [150T .03955 .019!0 .01179 .0070~ .00565 .006?? .00335 .002/0 ,00223 ,00100 ,O01b~
1.10 .|~567 .05999 ,02590 .0[000 .01092 .00191 .U0611 .00405 .003@5 .00329 .002?9 *0024[
L.O0 .[~8b6 .002~| .0~300 .0216~ .01494 .Utl09 .0~06[ .00692 .035?0 o004/9 .03..10 .00351
O. 90 I0606 .011~6 .04)4? .020J2 .02015 .01521 ,0[[93 ,0097~ .00809 oU06~6 ,0059| .00516
O.BO ,2|100 .09609 .05520 .U3095 .U2003 .02U50 .01041 .0|3~B .01113 ,00969 .008~1 ,0073~
O. 10 .24196 .[1412 .00941 .0)?02 .U)525 .027~6 .0~2|0 .0|042 ,0[562 .01348 .O|[T9 .0[04]
O. bO .2/425 ,13/57 .00619 .060bU .065/0 .u]616 .0295? .U26~1 .0Z|24 .O|B4/ .0102~ .01549
0,50 ,J085~ *[0332 .10500 ,0?6|/ .05050 ,046)~ .O~US/ .03~90 .0204? ,02695 .0221.* .0|90~
0.**0 ,]5558 .[9198 .120~1 .094~8 .0/393 .ObOl? .05041 ,U4310 .0}!62 ,0)~2) ,02969 .02677
0. ]0 ,30209 .221~9 .Ib~I~ .I|600 .0922| ,0761Z .06611 .05519 .0~932 ,06)03 .0)92** .01561
0.20 .420/4 .25711 .18103 ,[~0/5 .11~16 .09500 .00141 .07113 .O6302 ,05650 .09|15 .04661
0.|0 .660i? .294~2 .7150} .IbS?] .I]050 ,[I[?? .IU[Sb .00940 0?90| .07215 .0057** .0603~
0.0,o ~0~00 ~3))35 ,25000 .20000 *[bb6? .1~206 .12500 ,[11[| .10000 .09091 .08553 .0?692
L 2 3 ) 5 b ? 8 9 |0 11 12
50000 35333 o25000 .20000 .10661 .1~286 .12500 .L[|I| .10000 .0909| 003)3 ,07692
O. lO 51983 *31608 .28112 .234~6 .19023 .[I|97 .15[)) .|]620 .12)53 .1|307 .10622 .09013
0.20 .)1926 .61623 .J2180 .71L92 .23308 .204~5 .10240 .LbOB1 15058 l)Bb9 .12863 .[2000
0.30 .6i191 .*5931 .)7006 .)L?06 .21106 .240]Z .21b)1 .[~1[] 10129 .[6801 .|5609 .|~692
0,**0 65562 .50202 .61319 .35650 .)lilt .2Y930 .25360 .23281 .21559 .20099 .J006) .|1151
O. 5 0 69146 .56624 .45855 .)9074 .35606 .321~) .29~0) .21|01 .25J32 .2)150 .22386 .2[190
0.60 12515 .50906 .50)16 .44425 .3990[ .36509 .3)10) .3I)BO .29611 .211)2 .26260 .26911
0.70 7580& .63019 .56885 .69U4| .44605 o~[0)[ .30270 *]5020 .3)119 .32006 .30458 .29089
0.00 .7001k .61098 .59)2] .536bl .~92~3 .65288 .67896 .~045[ .38~53 36525 .3~915 .33~8)
O. 9 0 .0159.* ,70922 .6363e .58224 .5|983 .50536 .~1060 .45211 ,**)0}0 .6|Z]I .39502 .3010/
1 O0 BOt]~ .T~SZO .61110 .02010 .18600 .55751 .52610 *)0030 .61920 .~6056 .61J9** .62098
|*10 86113 .11860 ./|10! .60966 63[02 .~9913 .17[95 .54039 .52710 ,~0930 .69~19 .6/78b
I . 20 .0019] .0096| o75J73 .7[000 .67626 .666[6 .01628 .19560 .~1569 .51700 .51|06' .5~690
1.30 .90320 .g3134 .T8~66 .]4T99 ,~|5|0 o68~1J .66201
l **0 ,64151 .622~8 .605~5 .50980 .)I)59
9i92.* *06243 ,01066 .10309 ./5)26 .72~53 .10519 .08529 .66144 .6)[27 .63652 .02291
1.50
,93Y19 .0861| *04656 -0151} 1003 .16525 ,14600 .12652 ,11001 ,09690 .68119 66041
I.~0 94520 ,90627 .01|43 .86398 .820~0 .79973 .70136 .76619 .169/5 .Y3590 .22128 ~7111J
1 10
9~5..3 .92124 -89331 .86966 .86908 .8)090 .81~5} 79902 .10631 .77t80 .10236 .1~[63
|.~0 I,gO~OT .93101 .91233 .89218 *~74~0 .85810 , 8 ~ ,831~6 .01948 .80841 ,7901[ 18868
1.90
91128 .96017 .92867 .91112 .09669 .80317 .UlOO? .05958 .04914 .$3943 0]036 .0210~
2 . O0 I-9~721 .93055 .9211 .92845 .91585 .90h2 .09395 0806Z9 08F5]| .0669I .85902 .81|59
~.|0 98216 .96~|~ .95~16 .96260 o932[! .9Z~66 .91~85 .90509 .8}8~0 o~9090 .86~) .077~6
2.20 9U610 .97624 .96)flu .95..41 .96190 .9~005 .9~U~ .92391 .91~58 .91156 .90586 ,90065
98928 ,91990 .91169 .90619 .95710 .95012 .94..9[ e93930 .91611 .92911 ,92637 .91984
2*40 99180 .98459 .97809 .91215 ,96065 .96151 .956~$ .95720 .967}0 .9~3H2 .9}992 .93619
2.50
.99319 .90025 o98S21 .9r856 .9162} .9TOIf .96635 .96212 .95922 .95592 .95202 .96979
2*60
.9516 .99[[3 .98776 .90306 .980~0 .91712 .9/~11 .91125 ,9601 ,96580 .96337 .96096
2./0
996~3 .99J36 .990~) .90160 .90509 .95266 .90030 .9~80~ -9~592 .91380 .9liB8 .96990
Z.80
99146 .99509 .99288 .99001 .90886 .98097 .98~[I .98}45 .9OlBO .98020 .91065 .91716
2.90
990[] .996~0 .99616 .99321 .99L11 .99012 .980:~6 .98~b5 .98639 .98511 .90390 .98283
3.00 .99065 .99738 .996|0 .9950 .99396 .99288 .99[EI .99089 .90994 .9090| ,98012 .98126
). |0 99903 .99012 .99~2 .996[ .99560 .994~) *99600 .913~5 .99254 .991}5 .90|28 .99061
3.20 .9993| .99066 .99001 .9916~ .9960~ .99~20 .99~13 .99~20 ,99068 .99[? .9936~ .99~[9
3.30 ,99952 .99905 .99~61 .9981T .9911~ .991~5 .9~69~ .99651 .91019 .99582 .99546 .9951[
~.60
99966 .9993.* .999U2 -99fl12 .990? ,9981~ .9~10~ .99151 .991t0 .99/0 .99610 .99~52
).50 99'/17 .99954 .999~2 .99911 .99890 .9981U .99e50 *998~0 -99U1[ .99/92 .9922~ .99~$0
Tables [or order statistics 901
Table 4 (continued)
p=~2
) 4 S 1 i ! I0 II II
I*$@ ,~OOZI .OOOU+ ,OOuUl .uuuu*l ,OUOUU ,U~GOu ,O.UUO ,UOUuu ,UUOUU .tJOOuu ,UUOOU ,OUUOO
1.40 , O O . J4 ,OOt~| ,OOOOl , O~;OOU ,UUOOO , uOuOU , OuuOU ,OuuOO ,UUUUQ .~0Ouu ,UOUUQ *OUOUO
,UUU44 ,OUUUS ,oOufli .Ouou| .~UUUU ,OOOuO ,UOUOU .UOUUO ,O00QO ,OOQUU ,O~OUO ,OUOOO
I*|0 ,Ud061 ,OOUQR ,ROOU) *GUO01 ,~JU~I *~OUO ,OOOOU ,UO~UO *QOOGO ,GC~UUU ,GOflOU ,~JQU~
I.IO OOO+/ ,OQUi| ,OOoOq ,O~UQ/ ,UQOU| +U~U| ,~OuO| ,UOUUO ,UUU4~ ,OO~UU *U~QOO ,U~UdO
,Ou|I4 ,UOOTU ,UUOOl ,UUOU~ ,OOUO~ , U OCt1| .D~OO| *~UOI ,OOUOI ,~OQO0 ,U~UU ,OOUOU
|.SO ,~UIR# ,0~76 ,UOOIO oOuoOb ,~UUUI ,L~JO~ *GO~ *UU~I *~OOI ,UUQOI .GOQU| OOQO~
1.10 ,OO;4~ ,0004| ,00016 .UOUUR ,UGOQ~ 0000) *Ou~)~ *U4,)OU; *I)OCHJ| *UO~Ui *I~UUU| *GU~UUI
+, TO OOS4? ,000~) *OUUI4 *UOU|J *OOOOI ,U~OUt *UOU04 ,DUCK)] *U4JGU~ ,UU~07 *UOOUI ,~OOOt
U0~66 *UUO+| ,O00Jl *OOO~@ ,OUOi+ ,UOG~+ *l~U4k~ *OGOO~ *UO004 bOOO) *OOOOt *QOUU~
,OUmll ,OUI Jl ,OllO++ ,OoO I f l ,GUOlq ,UQOL4 .U4JO t o ,4kdUOI . O~UOt *OOGO+ U(NJO4 , UOOUt
+,60 OUITO ,OO|i+ , Ol*WlO *00U4~ * ~O0)U * GOD7 I * D~J i & *0~4) i ~ *O~OiO *~1004 , OOO0 | , O00~
+,]0 U~O~ .0U454 .UO| IR *OUU~ *U4)044 ,UGU]~ *UO~q ,~JOl+ *OOOl& *U~GI | *Ol~l I *OOO|O
,011VO .O0~U *O01ml . OUO++ * GOOd& * OOO4R *OOU)I *~GU)O *UO0++ ,UO071 *UOQII UUU|+
+, I I ,UI ll6 *GO|V1 ,OU4 )R *OUi4) * OGUVl *U~GI~ * U(~+& *UOO4S *O~O+i * U U ~ I+ (lOOT, ,QUQ+4
06~14 oOU66q .0~] |i *OO+O4 *~146 *UGIUt * O~GRJ .uOUtl ,QO04? .U~04i .00044 *QGO J~
|,+0 ,U~ll~ *OUR'1 . 0 ~)~$4 O07R~ i~O+U4 .~146 *0~|~ *O~IUU *O0~S *~uo 1) .UU06+ ,O00~6
1.10 U)~ *Ollq+ *OUl )U , OU4U+ * (~J++O * l ~ *QGi?/ *GO141 *U0|24 ,00101 *UOO~4 * UUOI4
h 10 0446~ .Oi+bl *OU~+4 o OOS6O , bU4O| *00)1S * 0G~44 *UU+i7 *GO 1 i | ,UU|+~ .UOI|+ ,GO|+4
hiO 0+440 ,O+UJq ,U1144 *OU~b$ * UOS&4 *00441 ,OQ|tO ,OUTDO *OG+SS *00+7| UO~UI ,UO|IO
l.SO U&ilI .07bit .01+11 .OIO)S .~U)I) .OORII .OU+O) .~04+$ .GOal .OoIT) .O0+ii .O0+Sq
1.40 .OiU16 .UII41 .014q0 .0| ii) .It04/ .00111 .006+4 *OOSql *~OSl$ *OU4+S .O04Ol .OOIkl
I.M ,OSilO .04464 ,01SII ,Oll+l .Ui4O) ,OtiS) .00641 ,,OILS ,OOt|i ,UUIJ) ,OOSbq ,OOS|i
lISO| *Obl|l .UJ]I| *0+11| .UIRSb .U|S|t .01111 oUIIO$ *ilOq11 .OAf4+ .OO+ib .O011t
hiO .11+61 .OR++I .04+UI .0|01+ .UI4~U .U+O0| .01104 .0141| *OlJll *U11 l q .U|UIO *OU+IO
1.00 ,16166 .01661 *ObJIl *Oiq4) ,U)+46 ,06611 *07~44 *UlqI/ .OIl+l ,01611 01441 *OIJZS
O.SO 1140~ ,04107 *O~+|S o04416 oL4~S *U|)I| *07164 ,U~$|~ *06 )|0 *O+Oq+ .Ulqi+ .OI/T|
0.I0 .+11R6 .1165+ .UiUq+ .Oh+J4 .U+U'I4 *~)7) *U)I&4 .UI|4+ .0|011 *U7~44 .OI+~| O~$JO
O* 70 .Tqiq& .|JVmO .++I~4 .01P11 .+6J/I .0441+ *m+t) 041RI *O|i|l +O|+SI .u|+iO *0)0+I
.I/474 +I~|I| .11R+I .0+441 *+|l+q .0b17+ .O~U|I .0+414 04+)+ *04+4J .06+|4 *Oil]S
.|A,~S .I11i4 .14111+ .114+I ,L'J&rq .161466 .01%I0 ,O&|q| .06+11 *0q/46 *Oq++1 .0+010
0 * 40 .144~I *22161 .I+.Iqi *lilJ/ .I1111 .IUJ14 .0'1741 .UM4iJ .Ullql .O/IR+ .U+/li *U6|I+
O* JO .JRTllq .+~&+R .1+bfl .I&I14 *I4UR+ *|74t+ .IIJ64 +IO|OI .UV+I+ .ORR6+ .UR)6U +O|IFI
0,+0 .4JU1q .IR410 +~I)+ .1+llt .161E1 .14441 *t)+|t *12411 *IL&O$ .IURb& .I04)+ *U+&I~
O* I0 .4&UI/ *Jl/U+ .lh+b/ *+2+4) .I+64+ .I/IOS .1611+ .14q|| .l)qll .II14+ *IPq)l *|&011
.4+U+I *If+L+ .+y4+~ *I6116 .++qUp .7016+ .I+}0+ ?++|11 .I&&65 .I+|12 .144)I t161|4
I [ | 4 + R 7 M q IA II 11
@,|O .~J~HI .4Jbbl ,}311 .2~q ,~641| .~41~4 ,2~)21 ,~UNI| .l&~q .ilb|O ,11f44 *iiq&S
O,&O ,?~*ll~ ,k|**~O ,$4q44 .~040| .q/bUt .44J|? .4~1~] ,4~i ,l~lU& ,1~1|1 ,|&|~l *))U4|
O*IO ,~i114 .Iul&l ,i)lii ,S~Olb ,S~iIS ,St|q4 ,~kulO ,4q1~| .4|$1u ,4~i&& .44~11 ,4)10
|*4G *41q14 .4/|41 ,illZ4 *il+OI ,/4Q+q */~644 ,?+bil ,/qlU~ .I)01~ ,/l~il ,|U+40 */UU~I
|*SO *4|llq ,4q+&4 *16J|1 *lJq+l *iZOIt ,Ifl4+O ,r+O)O ,11114 *li&4? *FSh+6 *14&ql *T|114
|*tO ,q4~ *4tO/+ ,4d64i *~&+O? .4~0|0 *Ill4q *N~41 ,DOqt/ ,FqqJq *F~Ot~ ~#lltI *~J/O
hIG *q4~4| *ribS4 *q~41I ,41/67 ,1#741 *i%qII *lqi4I *tIi47 *iT'll *17t@4 ,II+)~ *O~iJO
I*JO *S&4OI .440~4 .41|~| *+O&k| .l+J~4 .4~747 ,4~)01+ 64~,46J *8+67| .44886 ,446UI *iiS|I
|,q~ ,qlkT| *+6164 ,+|6|II ,++]+O .qiT&6 . ~|116 ,4~66 ,IRtU4 .1t006 .R~si| ,i~16R ,R66|]
+,H .v7174 ,qi||q ,64141 ,+JlqO ,v2 Ill ,q40~/+ .+t)+O .~644 ,+O~qS ,iq441 ,O+}U~ ,ia+qJ
+,IO *4141J *~N~q~O ,q+RR] *++UO+ *~746 *~JSI4 *++~&l *+64|~ ,ql+O~ ,+i4)U *~O~qU . 9 0 b #&
+,~0 *ql61~ *qf~t ,6614i *q60~| .640~ .q4i44 *46J|4 ,+tR|& *q)441 *qJO~U *qT&|4 *qJ]Ti
~*~ *i+l~# *qiliS .~1445 *+616q *~R~61 .Q540S ,4~4Rq ,q+|OI .q4)41 .~446~ ')4ii7 .q~ll+
+,q~ ,~4iIU ,qR%4& .qiOik .ql+41 .4|14q ,~b|~O *qb44J *qbi|l ,q+~41 .~%~lu ,~%|lq ,+~U|~
~.40 .+4ilq ,ql6i~ ,~141S ,+li|J ,~li4 ,~14q4 .qlJ74 .~&+l| ,q6|)8 .'*b',ll ,'bAli~ *+~11)
~.141 ,q~614 ~*)|~+ ,qRUl~ ,qa+~l ,VilOZ .4iU|O .ql6S+ ,~16~& ,V!46I ,qlT~i *ql|7~ ,V6~66
+,|0 ,++~%1 .+~llU ,q~174 ,64601 .qR|lU ,~617~ *~166| ,qS~O] *qRO~i *~)~14 *~l~lJ *ql~]
~*q~ .q+li] ,+'*b~6 .q~il ,qq)+~ ,+q+l? * ~li/i ,qqU/I *~11 *'llai+ .*PRRU~ .~t;64 "*qRi4 !
~*TO ,q~ll ,~+*~71 ,qqR|& ,+ql~l, ,+*1~/0 *q*l~l& *')461& ,+1~466 *++6%1 ,~471 .~6446 ,+v6~7
|,~0 ,q~&6 ,q'J4J& ,+++O| ,q+R6] ,4~RSM .+4115 ,+qRII ,+~1++ ,~+~17 .q~f~) +~+|)4 ,++?|k
|.~ ,qqqF|. , q + ~ +. 6 . q * 1. 4 ] & . +. ~ + | i .. + ~ + U | . , q + l t 4 . ,+4R&
. + .qqR44 .4+0|+ .qqlT~ -~ql|2 ~++/q+
9O2 P. R. Krishnaiah and P. K. Sen
Table 4 (continued)
p = O.8OO
-n
1 1 ~ It 6 f 0 I I0 II II
1.00 00011 .0g0u~ *0UUOJ .0VUUI *U0U01 .00001 *000~i *000~ *00000 .80~00 *000N *000~I
).40 . 0 0 0 )4 .00001 *00004 .g0U0Z .0000i *00NO *00001 .00031 *N001 *00021 *00ON .0~0H
2. I~ .00001 .00011 .0000+ .0UU02 .00001 .04001 .00001 .0H01 *0Hil .00001 *~01 .004~I
)*J0 .000i .0UUIF .O0001 .00002 .00004 .UO002 *0N0+ *00001 .00011 *00001 *0H01 .~0~|
0.10 +.D0011 *00U++ .00011 .00001 .00001 .OO001 .00014 .0001 .0|001 o0OO00 *00++ *00~1
)*00 ,02121 .000)1 .0001 ,0001Z .00009 ,00007 000041 00031 .00044 .0000 ,00002 *00001
l*i0 00ill ,00US* ,00011 P0O|i .0Q012 ,00011 000041 *00001 *00~G& 00001 ,0000t ,00004
J*00 .0G~S5 .00011 .UO04~ .000~I *000~0 *00011 0HI2 .000|I *00010 .0OO00 .0~000 .000~I
) . 10 00341 .00110 *00060 .00041 *000J0 .00014 .000+0 *0001| .00011 *0OO1 $ .00011 .0~011
+.i0 .03410 .00iS .+0011 .000++ *00001 00021 .0~0X .0NIl .00001 .00010 *OOQII .OO011
+.i0 .00ill .0001/ .G011+ .00014 .000~2 *000Sl .0~ii .0~110 .0~02) .00~10 .00011 .08014
1.40 .00110 .00ZVV .00111 .00i++ .000+4 .00016 .0004 *lOiS& .00041 .00044 .10040 *00011
I* 20 *010 I+ .OUiUS .UOii+ .00112 *GOt 14 .00110 *03011 *08011 .00011 *00014 *00HI *000t+
~1,+0 .Gilt0 .0Oi+l .00121 .00101 .00150 ,00111 .00122 *01115 .D0104 *00014 .000iS *0~It
I* I0 .01111 .00141 *00411 .001)I .001iS .00110 .00111 .0011& .001+I *00124 *00112 .00111
).00 .0+J1+ *00Vl) .U0+Z9 +00410 .00111 .00201 .00+ill .00122 *00131 .001+0 *00111 *001&I
1.90 ,0ll Ii *01iV0 .0011 .0UII4 .00500 ,0~421 ,002il ,0010S *S0112 .00Z11 ,00~40 *00~11
1.00 .01101 .01111 *0110+ .001JI .00110 *OOSFi *00S0) .01441 *00401 .00210 *0014| .00|11
I . I0 .04421 .00110 .014S1 *01114 .00011 *00111 .Nil+ .00110 .00114 *00101 .00+II .00000
*0+410 .01121 .011+I *014&2 .OliOS *01019 .00911 .001+2 .0012+ *00~91 *00141 .00~01
$*~0 *01111 .01~i1 .0042t .0110! *OISll .01212 .01111 *01000 .0100S *001IU .00116 ,00012
.01011 .0411i *01101 .0Z+$1 .01014 .01196 .011N .0|410 .01)11 0102S *0111S .01011
l*~0 *01110 *0S411 *02+11 .01111 *O+iSS ,Ui]il 00001 .0111~ 01141 .01ill *01+14 .01011
I.+0 *II+01 *01101 .04110 .01111 .0]MI .01101 .00111 014S+ *000ii *Dill' 01150 *O|lll
1.10 ,lJS&| .OaUVi *OiOSl *04V01 .04+&) ,021;0 *02411 ,0|120 ,01014 *00711 ,OiSli *00421
1.0~ .12111 .0II+ .01419 *01141 *0120Z *0~151 .04210 .0)ll *0)|05 *02410 *0)+H .02101
*II+01 lll)J *09010 .0tS|i .0~$I0 .0+HI .0SJI .01550 *0461 *04)+ .0lIS .UIHI
0* 00 .01116 .lllJl oi014i *05151 .01006 .0106T *01115 *01151 *01012 .01010 .00010 *011i~
0.10 .lit& *II020 .i+++S .II001 .01151 *04111 *01110 .0|i+l .011|I .01010 *0&it1. .0II
0.0 .17001 *li+i .11//I .I1121 *II f11 .10t0] .0510 .01111 *01121 .01101 *01114 *01110
I. I0 *|0121 .1112 *IIIII .II+02 .1)I~i *i+I04 .ill,0 .llli| .IDa01 .10100 *091/i *09101
0. 0 .141SI .J+0Jl .+0111 .11005 .li+ .iS1/* *14i11 .illtl .11/04 *l+l)l *IISI .li+l|
0.~0 .2110 *+I,I+ *+)I0 .11114 .II021 olFI+I .I1711 *i2004 *|$019 .14441 *I1001 .11401
O, +0 .4J0/i +)+040 *010+ .04101 *010/I *i0110 .19021 .III$I .11/12 .I1014 olill *II101
0 10 .41011 .I$III .)011+ .+PI+0 .+0519 +)011 +Jill .+l)+i .00512 .II+I .I024 .llli5
0.00 .IN0~ .)V|II *)I,M .|+fly .+liD0 .012.2 .Ill0' .+1001 *)1101 .01014 .+/115 *llill
0.00 . ~',)uoo . jv75e ,3*bJil ,.Zz 3V9 , ;tglO0 ,27)$4 ,1s965 ,2dl~lZ3 .~3ui| o Z;zO~* ,21:~1Q ga167
O,lO * DJ'J{II ,qJl0* , .lUsUG .3913U ,32841 131Oll 19545 * Ill/I *1?311 *21Ill * 2 ~lil? ,24913
(~ , , ~ 0 * ~'SV~+I , * YtlV* . +*~C*'+] * 192.Sl ,~766 * ]*l~& .313]; , ilOilll o 30v0o * 30o~9 ,ivi*2 ,lilt+l
o,:1o .+17vi , P~,'~ i c , **+ l b 5 ,,,~]*~ ,*00,31 ,31qt*sZ , i?30il ,350~0 * 341~70 , ~in,~** ,3~)o*! , i11111
la*qQ * I~5542 ,%lk119 , %d+~l+{J 14?~l* * q +1+~1~ , 4 ~1~}1 9 *lill| I * 4 I~lt i.laQ | ] , J1PVl 0 , .J)Oi5 ,Jill+iS
(~ I i l o .~914 i ,i~ i4*l ,~51~ .~l Y]2 ,i~1ii ,47~)] , ~.%11 ] , *lli~ilO ,i]O Y~ .t20~ w41 I'll , 4 0 3 4 +.
I ~ , v+O #~t+Yil , ~,a~ib . ~v,~ JO ,~09~0 ,~**0 ,.~1,7? , *Vlil , *l,lVil **l~ll , *4,~92 ,*b]i , Q*~blmS
OllO +tMUI 4 .7iqi~ .~7i04 *biODI ,6)727 , il~841 , bil) i 4 , ~ l ~ *5~d~l 7 1 5 4 M .~il , b~(~]3 *~11~
~,90 *015~4 .7*~++ I . "/U 7 ' ~ ,lY011 ,650~1 *129ii * 614l I *ill 115 * 410~ ,5~(J* r , ~ 8 I ~40 I lY~il
1,00 ,1~*1 ~ * , ?U(J J 3 ,7,~7"+ * 71 bill * I*~02 ,IlYUI I ,6,~++1 *IblYl *1~4 * O ) .1.+I I ~1, ,~2305 , I il34
20 , ~'UiV~ *~6~4 U0~.14 ,74124*** * Yb,+hO , Y~Oli , ).~ ? i l ' ) , }~1,*02 * /I 7,,~ ,7010'~'~ ,~OI~V ,4,'04]01
1,40 I.VI~24 * I~OllZ .i9761 *e]V] 0 ,1J2067 *Bl~5| ,IS02 I0 *YQ~2 * ?N4QS ,7170 *YY|I] ,711111
1,50 ,'/])IV , ~4J i ~ 1 , H P'~UI , ill )ill * l~Vls0 *U~VYl ,U]O~S *l~ll ,I141~4 . MO~i , IS02I? , }~i?~
i*UO ,*~biOY , ~,~+t *~14 *~12102 *vi2.3 ,UODli ,41~176 ,I~1] Ii , I~INI04 ,4~10 J * ] ,IT,ll *lYl]U
[*90 *vll~l i +,~47 ,~*,20 , Qjb)i * 9~OI *V~I?~ *0ii)~ *V|IiS , VO?Oi * VOJOi ,149V )i , UVil~il
I,(~0 V?Y,+b , .+~'S , 2 ,'+~*Vl4 , "~* l i l ' ~ ,Viii) .9)1,,4 *9314Q ,0~7]] v1)b? ,~2014 ,V*iVO ,VS40,1
~*10 *~+n?l* ,V~I~U ',,+*0,+ ,')~ ;VO ,9"~21 ~ , V,la34 , ~044,13 , ~,*IOV.) V];Y5 . v3,1113 ,9]t4 ,,1Via
2,40 '~'~* U O ,*~U~O .'~U.~ J ~Y~9 ,'+14*6 I ,slY*all *9120* V?OOil , Viklll5 ,960~V * ~*~04 , ~113~0
~,50 ' J ' J ) 7'~ * ',U*lY~ .910~63 *'~m*O& * 04~1 n S *~y~I ,9711ill ,9YilSl ,'+1%12 ,91)78 , QYt~i *~?l]t
&ilO * + + ~ J,+ **~*'~3 , ~u~UI * ++(I Y S O * vii408 fl,4~1 .V413l ] 9111 i 4 k ,91O70 .~ Ivl,~ , ~Yl4, I ,VYGY
Z,TO *'~*+l, h 3 *')*~4 I Y v~JzJI ,VVO14 ,91~9.~9 91~811 ,~Yog , ~l+l I O ,~e'v* 0 ,')104 ~2 , (* ~PJ~ ,~II~Y
i,ao .,ivy*** . ,~,~ ? . ,~,~.~ vv]o~ . ~,v~oo .gvlo? ,~0+1 ,9og,~l ,iNI/I ~3 . vl+lsOd, .~743 . V41~.4
3.(~0 , '~'~Ut+'~ * '*'+ 11~7 ,vv~a WIll V ~+V~V gvilOS ,9'V4~ ~V410 , WlI]I,? . ++V3241 * V~+;VO * +/'0milil
3.10 *'~V'~Q] , ,~*++u.I,: ,v~Y~ .vv?i] vv4.yil , vvl~3? ,9v41.00 , ')~:.4,5 .vgil3] ,'+V~02 ,VV,I 1~ ,V~4411
3,20 *vv~JI ~'~u4sG ,.#vlsJ~ , vVl~O v v Yi~T VV)~6 , VVOV V~il] , V~iltaV ,VV&]I ,11)94111 ,9~10~4
]]G **~Vb4 *~'Vl~ . *+'~OU,~ , VV++DY * VVO)3 ,+~Vlll I * ~,97~10 *99771 , V'~ y i l 4 ,e9711 ,v@721 , v~110~
3.40 ' , ~1,1, . VVV*IO , v v v * *t , ~,VU*~ * '~,1'411 ~VI~6 . ')V*+DO , VV41~) ? 99112+I ,VVUl I . ~'Vll O0 * V +/iV
3.1~ VV'~I Y , VVv~II . ,,vV* J 9VViV , v v ' ~ l I~ . vv+,+05 , v011V~ , V'kl4114 , V',~I 7 . v V i l l , l~ , vVlslO , V'illllO
Tables for order statistics 903
Table 4 (continued)
p =0,900
C
2 3 4 5 ~ ? 8 g I0 |l IZ
3.~0 ,ooea3 ,oo009 .oooQ~ *qo00~ ,oo0o3 .00003 ,OOOOZ .00o0~ .ooooa *ooooZ ,oooo~ .ooooz
),40 ,00034 *00014 *00009 *OOUO~ ,0000~ *00004 ,00004 *00003 *00803 *o0oo~ ,0ooo3 ,0ooo~
3.30 .00048 *000~0 .000|3 *00010 .o000e .00007 .00006 *00005 *OOOO5 *0000~ .0000~ *0000~
3.~0 ,O0069 *000~9 ,0OO19 ,00014 ,0003~ *00010 ,00009 .O000a ,00007 ,00007 ,00006 .0000~
3.10 .00097 *000~3 .00028 *OOO~| .00018 *O001S *00013 .0001~ .00011 *00010 .00009 .00009
3*00 *0013~ ,000~i *00041 ,00031 *O00Z6 *O002Z .000~0 *0001~ *00016 *00015 ,00014 ,00013
a.9o ,00187 *OOUfi7 ,00059 *00086 *00038 *00033 *00029 *000~6 *00024 *000~3 *O00Zl ,00020
2.80 ,0025~ ,00[~2 *0000~ ,000~6 *00055 .000~8 ,00043 *00039 *00036 .00033 ,0003| *00029
*70 *00347 .00]~0 ,001|9 *0u094 ,00079 ,000~9 ,0006~ ,O005& .008~2 .00048 .0004~ ,00043
Z.60 *00466 .00235 *O016b *00132 *oolia *00098 .00088 .oooao .00074 .00069 .00068 .00062
~.30 10062[ *00:|~2 *00230 ,00i~4 *00t57 .00138 *00[~4 *OO|14 ,00106 .00099 *00093 *0C08~
2,40 .00820 .00~36 *00313 .00255 *00218 .00193 .0017~ *00160 .001,9 .001~0 *0013~ *0012~
~,30 ,01072 .00585 *00~Z8 *00349 *00300 ,00266 .30~4Z *002=3 *00~08 *00~95 *001~ *O01~b
2*20 ,01390 ,00778 *OD~7~ .00~7~ .00~09 ,00365 ,00332 *00307 *00~87 *OOaTO *00236 *0024~
~.JO ,01786 *01024 *007~8 *00~35 *00552 *00494 *00452 ,00418 *0039~ *00370 *003~| *00335
2,00 ,0Z~75 ,0|3~6 *030|3 ,00844 *00738 *00~63 .00608 *0056~ ,005]0 *00501 ,00~77 *004~
|*90 *0~872 *0~727 .013a5 .OIIl| .00976 *00882 *00811 .00755 *OO~lO *00673 *00~4~ .00~|~
1~80 ,03593 *022|1 .017|5 *Oi~SO *01Z80 *0][60 ,0107| ,01000 *00943 ,00895 *0085~ eO08~O
i,70 *0~437 *02806 *0~201 *01873 *01662 *01513 *0[400 *013i1 .0]a39 .01179 *Oll2~ *0[884
1.~0 .05480 .03527 ,0~797 *02397 .0~138 ,01953 *01814 *01703 ,Ol6l] ,01538 .01473 .0|418
1.50 ,0~81 ,0~393 .035~2 *03039 *02724 ,02499 .08327 ,02191 .020?9 *0i986 *0|90~ ,01~37
1.40 *08076 *034~7 ,0~395 =3]B38 *03429 *03[~6 ,0~9~7 *08/91 *0265~ *02541 *0Z4~3 .0235m
[,30 .09600 *066~I *05]~ .04752 *04300 *039]~ ,0~723 *0352] *03359 *03280 .0310| *02998
].20 ,|1307 ,08056 *O~bb9 *058~0 *0~3~9 .0~94Z ,04~44 ,04406 *04209 .04043 .03900 *03?76
|.00 *i5866 .11549 ,0973~ .0U615 ,07961 *0743b ,07028 .06700 ,06*2? *0~9~ *08996 ,O5822
0190 ,1840~ *|3652 ,lib15 ,10415 ,09&80 *08998 *08529 *08|49 .0?833 *0?565 .0?]33 *07130
0,80 .21i8~ *16o02 ,13740 ,I~]95 *114~4 ,I079i *10~57 ,09823 *09~62 ,09153 ,00881 ,08652
0.70 *24196 *18602 ,|6116 ,1462~ *I3SgS .12029 .1~286 *~[736 .113~6 *[097~ *|067~ *|040S
0.60 ,2742~ ,~1448 ,]8?46 *[?lO? ~13970 1~][8 [444? .1]~99 *[3439 *[3046 *|Z703 ,12402
0.~0 ,_%08~4 ,24533 .2[~24 .1984~ *18~01 *17&65 .169~S ,;~3|i .l~a08 .15370 *14989 *14652
0*40 .344~8 ,27~41 ,24743 *22829 ,214~4 .2046? ,[9659 ,18995 *|8435 .1/954 ,|7533 *|?lbl
0~30 *38209 *31352 *~80B8 *~6051 ,246il *23517 *~2645 *2[926 .21318 *20794 *28335 *|9929
0,~0 .~207~ *35040 ,31636 ,~949~ *~79b? ,2~03 ,Z587[ *25100 *2444? *Z38~3 ,23388 .2~949
0.|0 .4~017 ,38~75 .3~3~1 ,]3129 ,3[330 830304 *29319 ,~8~08 *27808 ,~7207 ,2~m80 *26218
0.00 .soo08 .4~822 *39233 ,36931 ,3527~ ,3399~ .329&? ,32110 ,31380 .30?4? ,30|B9 ,~9~V3
3 * 5 6 7 a 9 ]O II IZ
0.00 50000 .42822 .39c33 .369][ .35z74 .33~96 .J2967 .32[18 .am300 .J0747 .30189 .29693
0.10 *53983 *~6041 *3213 .40(366 .39185 .37~40 .38783 .35894 .35135 .34473 .33092 *33373
0.20 .579Z& .50892 *87263 .~4894 *~3)~ *~I8Z5 .40734 *39822 .3~0~0 *36359 *37758 .372Z8
0.40 ,6554~ .589~3 .551~05 *53008 ,~[3~ *~OOr'L .~08~4 .4795:J ,~7131 ,46450 *45829 .4~@
O,SO *69146 ,b~8~5 .5981Z .57129 ,~3~31 ,5~CQ@ *52998 .~072 ,51273 *~057~ *49950 ,49390
0,60 *725~S *~6597 ,63323 *6[I*~ *fq'~eZ ,%0i5( ,~7879 *561&n .35381. ,%46n9 .e~07a ,9~517
0.?0 .738G~ .?0~09 .67090 .6~'982 .~33(33 *~130 .~1086 .~OZOO .59~33 .~075~ .58)3[ =57~07
O,RO ,7~81~ *73~31 ,70708 ,6n703 *b?te9 ,~%9;J0 ,~970 ,6~124 ,63303 *b2720 .8Z1~3 ,61615
0890 e81~9~ e76f~40 e?~Id~ ,7?2~I .70~31~ ,~9070 .GPTIS .67905 *~7397 ,66~70 ,6&009 *~502
[,00 *8*|~q *79818 .77317 *?~bY? ,7{,2al *?~l~ .7227~ .71508 *70839 .7024~ .69734 .6023~
I,[0 e 8 ~ eS~Sa e9077~ *7{~07~ *77A4~ *7~t~a .7661~0 ,7=900 .7~. 0 ,73725 .73~Z6 *72773
[.20 *88~93 ,8504Z ,@~988 .81333 *OOqL[ .79~00 .78734 ,78075 .77~,97 *76982 *76319 *7~000
l*~O *903~3 .87281 *~3~.9 ,S~[41 *031~7 .92500 .8[~03 *'fOol *BO~?a *80000 ~79573 *79|8~
1.40 *9|98~ *89275 *~7~58 .9649~ *8~h87 *04['~ .8~17 *83~7~ *83195 *88767 ,82380 *eZO~8
|,DO .93~19 ,91033 ,89bEO *8869~ ,87793 *Q713z* ,~57~ .8~389 ,85~b0 ,O~aT~ ,09929 *eqb[]
[,bO *~5~3 ,9Z368 ,911~5 ,908~3 .~9751 ,e')lll *.OGZ8 *88248 .87808 ,e?3n? ,87~18 ,e~936
i.70 *93543 *93093 .928~7 .9~07~ ,9]~69 ,909~'3 *~033~ ,9815) *eO~D~ ,$9~6 *89234 .80005
1.80 ,96~0~ *95025 *99139 .934~3 *~2%02 ,92~28 ,E~2356 *91838 *91~q~ *01288 ,@1045 *908~8
[*90 ,971~8 ,9398~ .957~1 ,9~,687 .9',~5 ,9387'~ ,93~58 *93280 49303~ ,92308 .92~04 .98417
a.oo ,97725 .9~8~ .9~17o .9~7os .9~33~ ,9%o~5 ,9~757 ,9.521 .9~310 .9~X20 .939~ *93786
~,IO !*982~ *91q%1 .9~986 .96563 ,9~235 ,95,;,3% . 9 5 7 7 1 ,95573 .933~6 ,9523~ .9~090 .9~95~,
~,~0 ,98~[0 ,g?')~7 *9750~ .'37273 ,~7020 .96~0~ .9~620 *~b~ *9O300 *9~]?~ *96054 *959~]
a,30 .98928 ,98~a0 *985[0 .978~7 ,97O~0 197876 *973~ 192189 =~?o&S *~9~8 i96858 i~7~ 4
a,4~ .99180 .90796 *9ft533 .98J31 ,gUt6G gSO~ *9790t ,9779~ *97~93 .97~03 .975Z1 *974~5
~,50 ,99379 *~90~0 .913813 ,~Ct]2 *90~0 ,9~7 ,90368 ,93281 ,9(]20l *901~9 *90003 ,93001
a,60 *9Q534 ,~30] ,99192 ,9~013 *9091i ,909~2 *90784 ,90&74 ,986[~ ,90553 *90300 ,98~31
Z,70 .9~5] ,'99477 *9935w .992~4 ,T9173 *99103 99042 ,00987 ,~937 ,988~32 ,90853 ,980ii
a.80 .99744 .99611 .99516 .99~1 .9037B .9932~ ,902~6 .9923~ .90195 .99tC9 .99J~7 ,99096
Z,90 *99833 *99718 *9964~ .993133 *99337 *9989~ ,90439 .9O~2~ *99390 *99368 *993~3 *99319
~.00 .998&5 ,g~-191 ,9~)737 *996r1~ *996~8 ,9~627 *9959'# ,9q57#, *@Q%51 *99530 .99510 e99492
3.[0 ,09903 ,990~9 .9,)~09 .99777 *997fi0 ,99727 .9~706 .~607 *99669 ,9~3659 ,99~39 .9962b
3.Z3 *9993t .9909Z .99S63 ,99839 .9~1~ ,9~,r,o~ .(;qTU~ .99777 *9')~%) ,997~ *9%~73~ *9'}/?~
3*30 *99952 *999~3 ,9'#()0~ ,~,)[t~ ,99~170 .99~t,7 ,9()841, .99~36 .9~82~ *908]? lg~)~o ~) *9~)AO I
3,~0 j999~6 .9~9~ *99~31 .99919 .99~)08 *99~99 .99590 m99~83 ,99B7~ .9w8~9 .99~3 *9{)8~
~,~'0 *99977 .9'~963 *9993Z *999~ *99935 .99929 ',999~3 .999i7 .99~1~ ,99907 *99003 .qgf199
Table 4 is reproduced from the Annals ofMathema~cal Sta~s~cs with the kind permission of the
Institute of Mathematical Statistics.
904 P. R. Krishnaiah and P. K. Sen
(~ ~(p, a, z ) f ( z ) d z (5.1)
J-o~
where
, a, z) = j - W---Z-=
j
dddddooo~odddddgdgJdd
O ~ - O ~
0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 ~ ~
dggddddgddd
~ 0 ~ 0 ~ 0 ~ 0 0 ~ ~
~ 0 ~ 0 ~
~m~mON
. ~ ~~:~...
O 0 0 0 O O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
~ 0 ~ 0
~ 0 ~
0 0 0 0 0 0 ~
0 0 0 0 0 ~m~ ~ m
~0 gddg~dddgddgdgdd~
. . . ~ ~ ~ . . . . . . . . . . . . . . .
~ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~
~
;=
zj;;g~gj~dgddgddddgdddgdgdgddgg
H
906 P. R. Krishnaiah and P. K. Sen
.OODffg,,,~
ddgd dddddddd~g~dddgdddg~gd ~ 0 0 0 ~ 0 0 0 ~ 0 0 0 0
t~he~l
0 0 ~
dd~d ggggdggdggggg
,..~e~, 0
O,Oe'.~
~ ~ ~
e l l ~ i t l l i ~-~-~~~~
I i 1 1 ~ i 1 1 1 ~ i I i l I
Tables for order statistics 907
~ ~ O ~ m ~ m N ~ o
ddd~d o o o o d o g o d d d g d d d d d d d d d d d ~ d d d d ~ d g d g
d~dd ~ g d d g d g d g g d ~ d g d g g g d g g g , ; g ~ d d d d , ; d d d g
ggdo d g d g g d g d d o d d d g d g d d d g d & d g d d d d d g g d d g
ggog d g g d g o gddo g g g g g g d g g g g d g g g g g g g g g d g d
~O ~0 a3 ~0
&g~;o g g g d g d d g g g g d g g g g g g g d g g d g
~ " .
C~ O0 0 C ~ O
0 4 ~
Q~Nm
gg~d &ddo d g g g d g g g g g g d ~ d d d g g g g d g d g
~ e e
~ w m ~
e e ~ l l S ~ l e ~ l e l
ggod ooooooooooggggggdgggg;gg
e t e e
0 0 0 0 ~ 0 0 0 0 (
~o~ gdgodg~ggdgddgdgdgdg~dddggdd
e e e e e e e e * e u ~ b e e o e e e e J e , e e o e o * e e e e e e e e m
908 P. R. Krishnaiah and P. K. Sen
o l o o i o
0000000000 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
,i.m,oI0ml~~,.,04 ~0~00~0~-~ ~ o ~ o ~ m ~ ~ o ~ m o
eeee eoee
0 0 0 0 0 ) 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
~ O ~ O m
ooooo.
000000
R ~ O 0 ~
e o e 4 e *
0000000000 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
NO~
~o~'~o
0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0
I t l t l t O O i l * * e e * ~ e e e ~ e e . e e e ~ e e e e 0 o . e e . e
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
m O N ~
0 0 0 0 0 0 0 ~
dggggg ggg;ggggggdggdggddg;gg
O ~ ~ m ~
~ ~ N ~ m ~ O ~
0 0 0 0 0 ~
~dgd~dg~dg dg~g;g~ggdg gggOg 0.0. .0. ~ g 0.0. .0. 0 gO*O
~ O ~
g~dd ~ggddddgdggdgddddgdggg
0 0 0 0 ~ 0 0 0 ~ 0 C) O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 ~ 0 ~ 0 0 0 0 0 0
~ 0 ~
gdgddg
~r
Tables [or order statistics 909
+~,+t+"
++t++++++~++++~++ 0*0, O,
g d d gdgdgdddgddgd gg~dd
.. ~ . ~ . ~ . ~ .
0 0 0 0 0 0 0 ~ 0 0 0 0 0 0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 0 0 0 0 0 0
ddgdgg~ddgdddd~d~ddd~ggd~g~dgggdg~dgg~gd
~ 0 ~ ~
,:dddd
,~d,;dd
~ * l o
000
:~[o
gdd
~ t~,4~f~fq
ddd
,;,J,+
910 P. R. Krishnaiah and P. K. Sen
d~gd~dgdg~dgd~
gdddgdddg~dgd~gggd gdgg~ggggg~dd~d
~ e e e e e e o o e e e e e e e O t e e e e e e e t e e e e O e e o
0 0 0 0 0 0 0 0 0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
e e e o e o e e e e e o e O e e e e e * e e e * e e ~ e * o o * e o
0 0 0 ~ 0 ~ 0 0 0 0 0 0 0 0 0 0 ~ 0 0
#m
gdgg~gg~d~gd~gdgg~ ggd~dggdgddddd~
~ ~ 0 ~ ~ ~
d~ddd~d~ddgd~dg
O~
"~ WOg
~ O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Tables for order statistics 911
Ov-v,-~)
ddd~;d~;d
0 0 ~
0 ~ 0 ~ 0 ~ 0 0 ~ O 0 0 0 ~ ~ N ~lr~mr~tm4. ~lr4,4, t~t~u~f~t~O~
ddddd~ddd ~gdddd~dgdg
.oo~ ~
O ~ N ~ ~ O ~ m ~ m ~
~~ O ~
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
O 0 ~ N ~ O 0 ~ O ~
g~gddgdddddgd~ddd;gd ddgddggdd&dddggddd
~ ~ 0 ~ ~ ~
0 0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 0 0 0 ~
f 1 4 ~ ~ e 4 4 ~ m N m N N 4
e o e e o e o e e e e e e e o e e e e e e t e e o e e e e e o e o e e e ~ *
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
+ ~ ~ m ~ o ~ m m ~
~=g~~'~~=~,.
O~m
0 0 0
g ; ~ d g ~ d d d ~'~dc;gdgggc;g g ~ g d d g d g g d ~ g ; d d g g d
gddddgg~d~g g~ddddggdddddgdgd4
..~.~ * e e e e e e e e
0 0 0 ~ 0 0 0 0 0 0 0 ~ 0 0 0 0 ~ 0 0 0 0 0 ~ 0 0 0 0 0
0
912 P. R. Krishnaiah and P. K. Sen
~~. ., .~ ! ~ ! ~ ! ~~ ~~ i ~ ~ i i ~ ~
~
~ ~ ~ ~ ~ ~ ~ ~
f*..~O(P,~if~l',..~eOIf~f'..l',..4"O'...4f"-~,.~'.4~f'~
0
e e o e
0 0 0 0 0 0
o e *
0 ~ 0 0
e e e o
0 0 0 0 0 0 0
*
'JO0000000000000000000
~.~.~.-.!~!~.
~ ~ ~ ~ 0 ~
~ O ~ O ~ O ~ I ~
gdg~dddddddgddg~gddg~ d~dgdgddddddgdgddd
~ 0 ~ ~ ~ ~
o o o o o o o o o o o o o o o o o o
gdddd~ddddgdddd4d~ggg ddd~gdddd~dddgdggd
~ ~ 0
~d~gdgdgdd~ddd~ggd~g ddddddddd~dgdgdddd
gdddd~ddddgdd d.dddgd~dddd~dgddddddddd~d
e o o e e o e o o e o e e e o Q e e w o o o e o o e o e o o o o 6 e o o o o e
Tables for order statistics 913
~ o ~
dddddddd d~ddd
~ 0 ~ 0
dgddddddgddgdddggg~ddgddd
I,O,~*O~O.-*Sf~m
,i',OmO,..*;*q4"W r-O0, O,,-*
0 0 0 0 0
e e o e ~ o o o ~ o e e e ~ e e o e ~ o e e e e
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0
~ m ~ O ~ O O m ~ ~ m ~ m ~ ~
~-~o~
dd~ddgd~ ddddg gddgddd~dggdgddgddddgggdg
g d d d d g , ; g g g g ; g g g g l ; d ; g d g d d d ; . ; d d . ; g ; g ; d g g.g
ddddd gddgdgdddgggggdgddggggddd
dddddddddddddddddd~ddd~ddddd
)O00001;)O0:'300,.'~."+,"*,--+,'-+~lr,+NN+qr~
,;~;ddddd,;~,d~;ddd~gddddddd d d d d g d g d d g d d d d o
0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 0
dd~dddd~;gdgggg;dgdddg
dd~dd~g~dddd~d~d~dg~
d ~ d d d d d ~ d d d d d d d d d d d d d ~ d d g d d d gddddd&~do
gdd~g~ddg~dd~dddddddgdddgddgd;dg~ddd~
d~;~ddddd~gdd~d~gddd~d~d~dg~ggdddd~
0
Tables for order statistics 915
m,ON
~;;;c;c;
~ 0 ~
~DO.-*O
~ O m ~ r..o*~.4* ~ 0
. ~ .
O 0 0 0 0 0 0 0 0 0 0 0 Q O 0 0 0 0 0 0 0 ~ O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Nm~
dddd~ddddgdd~d~ddddggdddd gdddd
m~Om ~ 4 ~ 0
m ~ m
. . . . . ~ ~
O O Q G O O Q Q O 0 0 O O 0 0 0 0 0 0 0 ~ O 0 0 0 0 0 0 0 0 0 0 0 0 G O 0 0
ql~OlOaO
dd;c;
e 0 0 .'-*
e e e s g d g d .~dc;g
,;dd;d 0 0 0
r,- P. o a0
916 P. R. Krishnaiah and P. K. Sen
~ ~ ~ ~ ~ ~ - ~
gdddgdddddddddddddddddddddd~dddddd
~0~
~ggdddgddddddddddddggdddddgd
0<~
~d~d~. dgdddd
0 0 ~ + ~ ~ + ~ 0 + ~ + ~
~ ~ ~ ~ . ~ ~
~ m O + ~
* e e e o o o e * e e o e e e * ~ e o o @
O 0 0 O O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
f".~OCl"r,.~'~
~I.OQI~OP..
0 0 0 0 gdddd~ddddddddddddg~dN~dggd
~ ~ 0 ~ 0 ~ ~ ~
dddgddd d g d d d d d d d d d d d g d d d d g ~ g d d d ; g g
t3 t ~ dddd~dd d d d d d g d d ~ d d ~ d ~ d d g d d d d d d d g d d
Tables for order statistics 917
~ N N N N
m m ~ o
0 0 0 0 0 0 0 0 0 0 ~ N ~ m
~d~d~gd~dgg~dgg~dd~ g~dgddd d~ggdd
~ 0
++&++~g++g+d4+++++++ dgddgddddgg
~ O N ~ m ~ O ~ m m m ~ O ~ m m
ggg~dddd~dggdddddggg gdg~;gdgdgg
~d~dd
0 ~ e m e m
g~d~gddd d~g~d
N ~ O m ~
918 P. R. Krishnaiah and P. K. Sen
~'"~ ~ ~i:~ ~~
~dddddd~ddd~dd~ddd~d~dgd~dd~d dgodo~d
m~
~ON~
oo=ooooooooooooooo,~o,',o ..::,::g,;;,:;:;~;
ggdgd~ddgddggg~gddg~g~;~gggggdgg ggdggdg
~ ~ N O ~ N ~ m
~ O ~ m ~ O ~ ~
~ m ~ O ~ N ~ m ~ m ~ O 0
ggdg~gdggdgggdggdgg ~ggdggggggg~g
~ m ~ ~ O N ~0
~ m m m ~ m m m m m m m m m ~
~ggo~dgdgddgg~g
* 0 ~
-Of"-::.- ~.-~
dgdgggggdg;g~;;gggg
~dddgg~ddgdgg~ggAd~ ~gdAggJgddgggddgdggd
gd~gg~gd~gddddggddggg~gd~d~ddggdg~g
Tables for order statistics 919
~ N ~
~ 0 0 ~ ~ ~
d~ddddddd~g~d~ddgd gdgddddd~d~g~d
ddgg~;gggdgggddddg gddggddd;gddgg
gd4 ....
0~00 dgggd~,~gggg ggggg~ggdg.; gdg~dgggg
d d d & d ; d ~ ~dd ; 4 g d d gd ; d ; d ; d g d ~ d d ;
t
920 P. R. Krishnaiah and P. K. Sen
0 ~ 0 0 ~ 0
mm++m-++0 0 0 0
~dddd~d~ddd dddddd
~ ~ O N ~
~ 0 ~
ddg~gdddddd ddddgddddd
~4,0e'-,4',0
0 0 0 0 0 0 ~
,, . ~ e e t e e e |1 eee
0 0 0 0 0 0 0 0 0 0 O 0 O O 0 0 0 0 0 0
.... ~ ~
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 o
0 0 0 0 0 0 0 0 0 0 0 O 0 0 0 0 C , ~ O O 0
O ~ m ~ m ~
dddgdddddd dddddd
ddd~gddddd ddddgd
~mm
m ~ m m ~ m ~ m ~
dgd~gggdd
O~
dddd d d d d d d d d d d d d d d d d d d d ~ d ~ d d ~ d ~ d d d ~ d d d d ~ d d d
~ ~ o ~
gd~ddddgg dgdddg
d d d d d ~ d d d ~ d d d d d d d d d d d d d d g d g g dddddd
dd~dddddd
922 P. R. Krishnaiah and P. K. Sen
~ ~ 0 ~ ~ 0 ~ ~ 0 ~
+++++++++++++++++++
g+gddddgd++g d~d~
~0~0
g++++d+g+gd+ ++d+
dggdgggggggg &dggggdggdgggddggggggdd
~ m ~ O ~ ~ O ~ ~
+++ o+++~+++o
ggggdggdgggdggdd ggdggggdddd;~gggd~g
O 0 ~ ~ m ~ ~
gdggggggg~gggdgg dggggg~d~d~g~gdggg
gggdgdggddggggdg g+dgdgg++g+ggggdggd
~ ~ + ~ + , ~ N ~ + ~
~ O ~ m ~ ~ O ~
~ m ~ o ~ ~ O ~ m
~m
++g+g++,;++g+gdg+ +++++++++++d+++++d+
+~o-~-+-++~+;~++~++
++.+.+++
+,.;gg++g+dgd+d+d+
+-
"~_.,
+.,,
,+++~ d & m + d + + + + + + + + + ~ +
Tables for order statistics 923
dd++dgddddddddddddddddd+ddddd+dd+ddddd
+++,+=+o+... )~+)+ + +)
.+~+o -+++o~+7~
d~ddddd~dddd++~+d+ddddddd+dddd+ddddd
o o o o -
i i i i i
'''**i
i i i i i
i':~
~ttt
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0
:::::=:+m++-+---~-++:++=+~+m--o~++++++o~.+o.++o
+~.
+ ~ . + N . - - O . + O +
+~o+o-~++---+:m+++++++mm+m==m+++-+~+~.
++.. . + .+
~ 0 ~ ~ g*(l~N41f~if~,..e44*~mlol~mO*Nif~*om4f~lm~m~m~m
,,4DOT,,+
-+.+~++!+++++~,~,++++~++++++++++++.~
dddddd++ d d d d d d d , ~ d d d d d d d d d d d d d ~ , d ~ , d g d d d d d
I~P,*h*lDmm(OlO
ddddddd,~ dgddgddddd~dgdddd~ddddd
m ~ O ~ m ~
0,1,..010++1~10~.,,*0,0101,,.4. I,,.10
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
~ ~ m ~ ~ o ~ ~ ~ ~
o.++o--'+:++++~+
:+++m -+- ~++++-++. .++
+:+:+++++~+~;~+++|+
gdddddgd
+++++++++++++!+~++++++!+
dddddddddddgddd
dddgd~d~dd~dd~d~ddddddd
c:
0
Tables [or order statistics 925
dgddgdd~dddddddd~dddddd;dd
dddddddgddgd ddddddddddddddddddddgddddd
I I l l o I
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 O 0 0 0 0 Q O 0 0 0 0 0 0 O O 0 0 0 0 O0
0,0,
~ ~ ~ . . . .
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 0 0 0 0 O0
~ 0 ~ ~W
* * * e , l e o * * e , o * ~ e q e
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~ 0 0 0 0 ~ 0 0 0
dddddggdddd d~
mm~00
o~ o , ~ o
d;,gdddddd;=d
o i i 0-0 ~
0"0'
dd
O"
d d d g g ~ g g d g d d g d ~ d d ~ g d d d d d g d d d d d d d d d d d dd
.o0
N~
dd~gdg~dddd gdddd~dddddddgdddd~dddddd
~ m ~ m m ~ m m m ~.1 ,-e
,.-e ~,.e
926 P. R. Krishnaiah and P. K. Sen
+++o+:++~+~
,,Ir4"4,1r* II'~11~,,0,4
+:+++==~ff+
+ddd,~dddddd ddddddgddgd
---+ ++++
d+dddd~+ddd
m O m O ~
e e o
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
o e e t e e e e
+!++
0 0 0 ~ 0 0 0 0 0 0 0
. ~ . ~ ~.
i l l l l I I I I I
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
f e l l l i f o 11
0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ~
gdddddd~dddgddgddddddd
ddd~d~dddd~gddd~ddg~gd
N ~ ~ O ~ ~ 4 ~ ~
0
2~
Tables for order statistics 927
,;~;;(;~;,;;(;;~;;
~ ~ N
~dgdd~''ddgoo
~gdg~
o m m o 4 ~ 0 ~
~ 0 ~
ddgdggggddgdgdg;dddggd dddgdgdddgg
~.~. ~. ,:
0 0 0 0 0 0 0 ~* O 0 0 O00 f3 ~ 0 0 ~'~ C~ O 0 O00 ~ 0
gdggdgddggggddddgggggg gdddgddgdgdgdd~g~gdd
~ddg~d~gddd~gdg~g~gdd~
0 ~ ' 0 0 0 * 0 0 0 ~ : . 0 0 0 0 f 0 0 0 0 0 0 0
928 P. R. Krishnaiah and P. K. Sen
d~gddddd~d~ddddgdd~dd dddddddddddd
ddddddddddddd~d~dd~ddd
O O O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 O O O O 0 0 0 0 0 0 0 0 0 0 0 0
~ ~ ~ 0 ~ ~
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
e e o e e e o e t e o e e e e o e 4 e o o o e e e e e e o t e e e e
~ O O O 0 0 0 0 0 0 0 ~ O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
dddgddggd~d~dd~g~ddd
dddd~ddd~dddd~d~ddddd dd~ddddddddd
dgddgdddddd~ddddg~d~d
~d~dddg~gdd~
~J
r~
0
Tables for order statistics 929
~m~O~
+ddd+ddd+dddd+ddd+dddd++ ddddgddddddddd
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 O 0 0 0 O O 0 0 0 0 0 0 0 0
04
~o~m ~ N~N
~ 0 ~ ~ ~
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
,.- ,- ,-. ;=
l t ! I I i I I
~0
e e e o e e e e o e e e l e e
~ . ~ ~
OQOOOOOOOOQOOOOOQQOO~QOQ 0 0 0 0 0 ~ 0 0 0 0 0 0 ~
ee+ e t v e
o+~ ~'~
g~gdddddd~dgdd
<~
,++++~ ~ g ~ d ~ d d d ~ g d d ~ d g d d d g g d d ~ dddd~gdd;ddddd
,.~ ~L
930 P. R. Krishnaiah and P. K. Sen
me~mq~Ol,,.4.1.-.-eom,om
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
;~t~-+++
I I I I I I
0 0 0 0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
+l +l +i +l
!++i+ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
O~ W~
~+z+++ m++- o.
.+++i++ "+
O 0 0 0 0
O~r~
. , + N I M l'~i'n~P ~'+~l'illl
O"O,O'. 0'0"0'0' 0'0"I'
d d d d d d ,;dmd,,;~,,:;dd<:;,:;,,:; d g d g d d d g d ~ d g d d d d d d d d d
++)+++=+~++
+d+ddd+++dgd+d+ddd+++
,,4P ',I' Vi
e. ~. ~.
O00
ii'~ *l's ~
. . . .
0000
4
O00
i!!+"+! !!!!!ii
lee
0o00
. e * * *
00~00 oo
-
+')+))
,,'+0
l',,.*i'k
,,+.41'
0 , ~ 0~
d~d,,;dd ,,; d ,,;d,,;;dd ,~,,;,,;,,;,;+;~,,,;ooooo'ooooo 0 0 0 0 0 0 0
Tables for order statistics 931
~ ' ~ ~ ~
dddddddddd ddddgd
e e e e ~ o o e e * e e e o e o e e e ~ e o e e o o e e e e e e e e o e e o
00000000000000000000000000000000 000000
~*ee~
~gddddgddgdddddddd~dgd 00000 d~g~dddgdd
dddddddddddddddddddd ddd~;ddd~;dd;;ddd,;,;~
dddddddddddddd,;ddddd d~d~dddd~ddddddd
~g~ddddgdddggddggdg dgd~gddgg~g~dggdd
ddgd;ggdgggddg~dddgd ddgdg~ggddgddddgdd
ddgdd-;d+dggg++dgg~gd dgd~dggdd,;dd~d~d~
0 ~ ~
dddddd~ggddddggddddd
dgdddd;gd~dddddddddd ddddd~dddddddddddd
mm~r<r
dd~ddg~dgddgdddddgd
0
Tables for order statistics 933
~ 4 ~ 0 ~ ~ 4 ~ 0 ~ 0
+++++?++
d~ddd~ddd~ddddddddd ddd~ddddddddd~ddddd~dd~
ddddd~dddddddddddddd
dddddddddd~dg~ddgddd + d d + + + d d m g d d d d g d g d d + g g d
dddd++dddd+dgdd+ddd ddd+d++++&d+dddd~d+dgg++
d~dd~gd~ggg~g~ddggdg d d g g d g g d g d d d d d d d d d g d d g ;
~ NN
dgdddddddd~dd~ddddd
+++':++'"++++++'++++,,+
+++,"+ ++++++. ' ~ ~ ~
~d++dddddd+dddd+ddd
,,;,:;.;d,:;.;.;,;
~ P ~ e e
ddddd~d~ddddd~dddddddd~gdg~dddddg
~ ~ . ~ ~ ~ + ~ ~ ~ ~ l l Q l l e +
934 P. R. Krishnaiah and P. K. Sen
ddddddddddddddd dddddddddd~dd~dd~gd
dgddddddddddgddddddddddgdddddddd~d
gddddddd~ddgddddddddddddddddgddgdd
dddgddgddg~ggddggggdggddgddggdgdd~
o e o e @ e e o o ~ e o o o e o e o ~ o e e O e e e o o t o e o o 9
~d~ddgddgddddgdd~ddddd~ddddddddddd
~dddgddd~dddd~dd~gd~ddddd~ddddddgg
* e J e o e o o o e o e e e e e e e o e e e o e e e o e o e e e e o
Tables for order statistics 935
References
[1] Breiter, M. C. and Krishnaiah, P. R. (1968). Tables for the moments of gamma order statistics.
Sankhya Set. B 30, 59-72.
[2] Dunnett, C. W. and Sobel, M. (1954). A bivariate generalization of student's t-distribution,
with tables for certain cases. Biometrika 41, 153-169.
[3] Gupta, S. S. (!960). Order statistics from the gamma distribution. Technometrics 2, 243-262.
[4] Gupta, S. S. (1963). Probability integrals of multivariate normal and multivariate t. Ann.
Math. Statist. 34, 792-828.
[5] Gupta, S. S., Nagel, K. and Panchapakesan, S. (1973). On the order statistics from equally
correlated normal random variables. Biometrika 60, 403--413.
[6] Harter, H. L. (1961). Expected values of normal order statistics. Biometrika 48, 151-165.
[7] Harter, H. L. (1964). Expected values of exponential, Weibull, and gamma order statistics.
ARL 64-31. Wright-Patterson Air Force Base, Ohio.
[8] Krishnaiah, P. R. (1980). Computations of some multivariate distributions. In: P. R. Krish-
naiah, ed., Handbook of Statistics, Vol. 1. North-Holland, Amsterdam.
[9] Krishnaiah, P. R. and Armitage, J. V. (1965). Tables for the distributions of the maximum of
correlated chi-square variates with one degree of freedom. ARL 65-136. Wright-Patte~'son Air
Force Base, Ohio.
[10] Krishnaiah, P. R. and Rizvi, M. H. (1967). A note on moments of gamma order statistics.
Technometrics 9, 315-318. f
[11] Sarhan, A. E. and Greenberg, B. G. (1956). Estimation of location and scale parameters by
order statistics from singly and doubly censored samples. Ann. Math. Statist. 27, 427-451.
[12] Teichroew, D. (1956). Tables of expected values of order statistics and products of order
statistics for samples of size twenty and less from the normal distribution. Ann. Math. Statist.
27, 410-426.
P. R. Krishnaiah and P. K. Sen, eds., Handbook of Statistics, Vol. 4 "~7
Elsevier Science Publishers (1984)937-958 .J/
1. Introduction
937
938 P. K. Sen and P. R. Krishnaiah
to adopt some of their entries in this set of tables presented in this chapter.
These adaptations from the original sources are made with the kind per-
missions granted by the following Societies and Publishers: Academic Press,
American Mathematical Society, American Statistical Association, Biometrika
Trustees, Institute of Mathematical Statistics, Marcel Dekker Inc. and Wiley.
where R + is the rank of IX~l among ]Xll . . . . . Ix, I, for i = 1 , . . . , n; by virtue of the
assumed continuity of the underlying distribution, ties are neglected with
probability 1. Note that under the null hypothesis that the distribution of X~ is
symmetric about 0, W* is also symmetrically distributed around 0. The mean
of W* is therefore 0 and its variance is equal to n ( n + 1)(2n + 1)/6. For large
values of n, one may use the normalized statistic
and use the standard normal distribution tables for computing the critical
values. In fact, this approximation works out quite well for n greater than 35.
For this reason, we shall consider only the range: 5 ~< n ~<35. Note that if ci is
equal to 1 or 0 according as Xi is nonnegative or not, and if
W * = 2 W . - n ( n + 1)/2. (2.4)
Table 2.1
Selected critical values for the Wilcoxon signed rank statistics
n k, P { ~ ~k}
5 O, 0.0313
6 O, 0.0156 1, 0.0313
7 O, 0.0078 2, 0.0234 3, 0.0391
8 O, 0.0039 1, 0.0078 3, 0.0195 5, 0.0391
9 1, 0.0039 3, 0.0098 5, 0.0195 8, 0.0488
10 3, 0.0049 5, 0.0098 8, 0.0244 10, 0.0420
11 5, 0.0049 7, 0.0093 10, 0.0210 13, 0.0415
12 7, 0.0046 9, 0.0081 13, 0.0212 17, 0.0461
13 9, 0.0040 12, 0.0085 17, 0.0239 21, 0.0471
14 12, 0.0043 15, 0.0083 21, 0.0247 25, 0.0453
15 15, 0.0042 19, 0.0090 25, 0.0240 30, 0.0473
16 19, 0.0046 23, 0.0091 29, 0.0222 35, 0.0467
17 23, 0.0047 27, 0.0087 34, 0.0224 41, 0.0492
18 27, 0.0045 32, 0.0091 40, 0.0241 47, 0.0494
19 32, 0.0047 37, 0.0090 46, 0.0247 53, 0.0478
20 37, 0.0047 43, 0.0096 52, 0.0242 60, 0.0487
21 42, 0.0045 49, 0.0097 58, 0.0230 67, 0.0479
22 48, 0.0046 55, 0.0095 66, 0.0250 75, 0.0492
23 54, 0.0046 62, 0.0098 73, 0.0242 83, 0.0490
24 61, 0.0048 69, 0.0097 81, 0.0245 91, 0.0475
25 68, 0.0048 76, 0.0094 89, 0.0241 100, 0.0479
26 75, 0.0047 84, 0.0095 98, 0.0247 110, 0.0497
27 83, 0.0048 93, 0.0100 107, 0.0246 119, 0.0477
28 91, 0.0048 101, 0.0096 116, 0.0239 130, 0.0496
29 100, 0.0049 110, 0.0095 126, 0.0240 140, 0.0482
30 109, 0.0050 120, 0.0098 137, 0.0249 151, 0.0481
31 118, 0.0049 130, 0.0099 147, 0.0239 163, 0.0491
32 127, 0.0047 140, 0.0097 159, 0.0249 175, 0.0492
33 138, 0.0049 151, 0.0099 170, 0.0242 187, 0.0485
34 148, 0.0048 162, 0.0098 182, 0.0242 200, 0.0488
35 159, 0.0048 174, 0.0100 195, 0.0247 213, 0.0484
the probability masses are multiples of 2-". A s such, for small values of n, we
may not have entries for all values of a (vizl, 0.005, 0.01, 0.025 and 0.05). For
example, for n = 5, the smallest nonzero significance level is 0.0313 cor-
responding to k = 0, while for k = 1, the significance value 0.0625 is greater
than 0.05. Thus, some caution must be used in choosing a level of significance
when n is small. It is recommended that instead of the conventional level, one
should use the exact level provided by the tables. For n = 35, using continuity-
correction and normal approximation, we have for k = 213, P{W, ~<k}=
0.0480 compared to the exact value 0.0484. Thus, the error is less than 0.0005.
This picture holds for all n >t 36.
For two samples of sizes m and n, we denote the sample observations by
X 1 , . . . , X m and Y1. . . . . Y,, respectively. Let R1 . . . . . Rm be the ranks of
X1 . . . . . Xm in the combined sample of size N = m + n; without any loss of
generality, we take m ~< n. Here also, ties among the observations are neglect-
940 P. K/Sen and P. R. Krishnaiah
U n d e r the null hypothesis (H0) that the two samples are drawn i n d e p e n d e n t l y
f r o m the s a m e population, we have EWm,,= m ( N + l ) / 2 and Var(Wmn) =
m n ( N + 1)/12. Also, for large values of (m, n), under H0,
n m W1, W2 a n d Olmn
3 3 6, 15 (0.0500)
4 3 6, 18 (0.0286)
4 10, 26 (0.0143) 11, 25 (0.0286)
5 3 6, I 21 (0.0179) 7, 20 (0.0357)
4 10, 30 (0.0079) 11, 29 (0.0159) 12, 28 (0.0317)
5 15, 40 (0.0040) 16, 39 (0.0079) 17, 38 (0.0159) 19, 36 (0.0476)
6 3 7, 23 (0.0238) 8, 22 (0.0476)
4 10, 34 (0.0048) 11, 33 (0.0095) 12, 32 (0.0190) 13, 31 (0.0333)
5 16, 44 (0.0043) 17, 43 (0.0087) 18, 42 (0.0152) 20, 40 (0.0411)
6 23, 55 (0.0043) 24, 54 (0.0076) 26, 52 (0.0206) 28, 50 (0.0465)
7 3 6, 27 (0.0083) 7, 26 (0.0167) 8, 25 (0.0333)
4 10, 38 (0.0030) 11, 37 (0.0061) 13, 35 (0.0212) 14, 34 (0.0364)
5 16, 49 (0.0025) 18, 47 (0.0088) 20, 45 (0.0240) 21, 44, (0.0366)
6 24, 60 (0.0040) 25, 59 (0.0070) 27, 57 (0.0175) 29, 55 (0.0367)
7 32, 73 (0.0035) 34, 71 (0.0087) 36, 69 (0.0189) 39, 66 (0.0487)
8 3 6, 30 (0.0061) 8, 28 (0.0242) 9, 27 (0.0424)
4 11, 41 (0.0040) 12, 40 (0.0081) 14, 38 (0.0242) 15, 37 (0.0364)
5 17, 53 (0.0031) 19, 51 (0.0093) 21, 49 (0.0225) 23, 47 (0.0466)
6 25, 65 (0.0040) 27, 63 (0.0100) 29, 61 (0.0213) 31, 59 (0.0406)
7 34, 78 (0.0047) 35, 77 (0.0070) 38, 74 (0.0200) 41, 71 (0.0469)
Selected tables for nonparametric statistics 941
n m wb w2 and O:mn
14 8 54, 130 (0.0041) 58, 126 (0.0098) 62, 122 (0.0211) 67, 117 (0.0475)
9 67, 149 (0.0043) 71, 145 (0.0096) 76, 140 (0.0228) 81, 135 (0.0478)
10 81, 169 (0.0044) 85, 165 (0.0093) 91, 159 (0.0242) "96, 154 (0.0478)
11 96, 190 (0.0045) 100, 186 (0.0090) 106, 180 (0.0221) 112, 174 (0.0477)
12 112, 212 0.0046) 116, 208 (0.0087) 123, 201 (0.0232) 129, 195 (0.0475)
13 129, 235 (0.0046) 134, 230 (0.0097) 141, 223 (0.0241) 147, 217 (0.0472)
14 147, 259 (0.0046) 152, 254 (0.0093) 160, 246 (0.0249) 166, 240 (0.0469)
15 3 8, 49 (0.0049) 9, 48 (0.0086) 11, 46 (0.0196) 13, 44 (0.0380)
4 15, 65 (0.0046) 17, 63 (0.0098) 20, 60 (0.0243) 22, 58 (0.0400)
5 23, 82 (0.0039) 26, 79 (0.0097) 29, 76 (0.0209) 33, 72 (0.0491)
6 33, 99 (0.0042) 36, 96 (0.0092) 40, 92 (0.0224) 44, 88 (0.0474)
7 44, 117 (0.0043) 47, 114 (0.0086) 52, 109 (0.0233) 56, 105 (0.0455)
8 56, 136 (0.0042) 60, 132 (0.0097) 65, 127 (0.0237) 69, 123 (0.0437)
9 70, 155 (0.0050) 73, 152 (0.0089) 79, 146 (0.0238) 84, 141 (0.0478)
10 84, 176 (0.0048) 88, 172 (0.0096) 94, 166 (0.0238) 99, 161 (0.0455)
11 99, 198 (0.0046) 103, 194 (0.0088) 110, 187 (0.0236) 116, 181 (0.0486)
12 115, 221 (0.0044) 120, 216 (0.0093) !27, 209 (0.0234) 133, 203 (0.0463)
13 133, 244 (0.0048) 138, 239 (0.0097) 145, 232 (0.0232) 152, 225 (0.0489)
14 151, 269 (0.0046) 156, 264 (0.0089) 164, 256 (0.0229) 171, 249 (0.0466)
15 171, 294 (0.0049) 176, 289 (0.0093) 184, 281 (0.0227) 192, 273 (0.0488)
16 4 15, 69 (0.0037) 17, 67 (0.0078) 21, 63 (0.0250) 24, 60 (0.0497)
5 24, 86 (0.0041) 27, 83 (0.0097) 31, 79 (0.0250) 34, 76 (0.0455)
6 34, 104 (0.0040) 37, 101 (0.0085) 42, 96 (0.0244) 46, 92 (0.0490)
7 46, 122 (0.0048) 49, 119 (0.0092) 54, 114 (0.0234) 58, 110 (0.0443)
8 58, 142 (0.0044) 62, 138 (0.0096) 67, 133 (0.0224) 72, 128 (0.0463)
9 72, 162 (0.0048) 76, 158 (0.0098) 82, 152 (0.0247) 87, 147 (O.O477)
10 86, 184 (0.0043) 91, 179 (0.0099) 97, 173 (0.0234) 103, 167 (o.o487)
11 102, 206 (0.0046) 107, 201 (0.0099) 114, 194 (0.0250) 120, 188 (0.0494)
12 119, 229 (0.0049) 124, 224 (0.0099) 131, 217 (0.0236) 138, 210 (0.0500)
13 137, 253 (0.0050) 142, 248 (0.0098) 150, 250 (0.0250) 156, 234 (0.0458)
14 155, 279 (0.0045) 161, 273 (0.0097) 169, 265 (0.0236) 176, 258 (0.0463)
15 175, 305 (0.0047) 181, 299 (0.0096) 190, 290 (0.0247) 197, 283 (0.0466)
16 196, 332 (0.0048) 202, 326 (0.0095) 211, 317 (0.0234) 219, 309 (0.0469)
17 5 25, 90 (0.0043) 28, 87 (0.0096) 32, 83 (0.0238) 35, 80 (0.0425)
6 36, 180 (0.0049) 39, 105 (0.0099) 43, 101 (0.0219) 47, 97 (-0.0433)
7 47, 128 (0.0043) 51, 124 (0.0097) 56, 119 (0.0236) 61, 114 (0.0497)
8 60, 148 (0.0045) 64, 144 (0.0095) 70, 138 (0.0247) 75, 133 (0.0487)
9 74, 169 (0.0046) 78, 165 (0.0091) 84, 159 (0.0223) 90, 153 (0.0476)
10 89, 191 (0.0047) 93, 187 (0.0088) 100, 180 (0.0230) 106, 174 (0.0465)
11 105, 214 (0.0047) 110, 209 (0.0096) 117, 202 (0.0235) 123, 196 (0.0453)
12 122, 238 (0.0046) 127, 233 (0.0092) 135, 225 (0.0238) 142, 218 (0.0486)
13 140, 263 (0.0046) 146, 257 (0.0098) 154, 249 (0.0240) 161, 242 (0.0472)
14 159, 289 (0.0045) 165, 283 (0.0093) 174, 274 (0.0242) 182, 266 (0.0500)
15 180, 315 (0.0050) 186, 309 (0.0099) 195, 300 (0.0243) 203, 292 (0.0485)
16 201, 343 (0.0049) 207, 337 (0.0093) 217, 327 (0.0243) 225, 319 (0.0471)
17 223, 372 (0.0047) 230, 365 (0.0098) 240, 355 (0.0243) 249, 346 (0.0493)
18 6 37, 113 (0.0047) 40, 110 (0.0091) 45, 105 (0.0236) 49, 101 (0.0448)
7 49, 133 (0.0047) 52, 130 (0.0085) 58, 124 (0.0237) 63, 119 (0.0484)
8 62, 154 (0.0046) 66, 150 (0.0094) 72, 144 (0.0235) 77, 139 (0.0452)
9 76, 176 (0.0044) 81, 171 (0.0100) 87, 165 (0.0231) 93, 159 (0.0475)
10 92, 198 (0.0050) 96, 194 (0.0090) 103, 187 (0.0226) 110, 180 (0.0493)
Selected tables for nonparametric statistics 943
18 11 108, 222 0.0047) 113, 217 (0.0094) 121, 209 (0.0247) 127, 203 (0.0461)
12 125, 247 (0.0044) 131, 241 (0.0097) 139, 233 (0.0239) 146, 226 (0.0474)
13 144, 272 (0.0048) 150, 266 (0.0099) 158, 258 (0.0232) 166, 250 (0.0485)
14 164, 298 (0.0050) 170, 292 (0.0100) 179, 283 (0.0247) 187, 275 (0.0495)
15 184, 326 (0.0047) 190, 320 (0.0091) 200, 310 (0.0239) 208, 302 (0.0465)
16 206, 354 (0.0049) 212, 348 (0.0092) 222, 338 (0.0231) 231, 329 (0.0473)
17 228, 384 (0.0046) 235, 377 (0.0093) 246, 366 (0.0243) 255, 357 (0.0479)
18 252, 414 (0.0048) 259, 407 (0.0094) 270, 396 (0.0235) 280, 386 (0.0485)
19 7 50, 139 (0.0042) 54, 135 (0.0090) 60, 129 (0.0238) 65, 124 (0.0471)
8 64, 160 (0.0047) 68, 156 (0.0093) 74, 150 (0.0224) 80, 144 (0.0475)
9 79, 182 (0.0050) 83, 178 (0.0093) 90, 171 (0.0239) 96, 165 (0.0474)
10 94, 206 (0.0045) 99, 201 (0.0093) 107, 193 (0.0250) 113, 187 (0.0472)
11 111, 230 (0.0047) 116, 225 (0.0092) 124, 217 (0.0233) 131, 210 (0.0468)
12 129, 255 (0.0048) 134, 250 (0.0090) 143, 241 (0.0240) 150, 224 (0.0463)
13 148, 281 (0.0049) 154, 275 (0.0099) 163, 266 (0.0247) 171, 258 (0.0497)
14 168, 308 (0.0050) 174, 302 (0.0096) 183, 293 (0.0230) 192, 284 (0.0489)
15 189, 336 (0.0050) 195, 330 (0.0093) 205, 320 (0.0235) 214, 311 (0.0482)
16 211, 365 (0.0050) 218, 358 (0.0100) 228, 348 (0.0239) 237, 339 (0.0474)
17 234, 395 (0.0050) 241,388 (0.0097) 252, 377 (0.0243) 262, 367 (0.0499)
18 258, 426 (0.0050) 265, 419 (0.0094) 277, 407 (0.0246) 287, 397 (0.0490)
19 283, 458 (0.0050) 291, 450 (0.0099) 303, 438 (0.0248) 313, 428 (0.0482)
20 8 66, 166 (0.0048) 70, 162 (0.0092) 77, 155 (0.0244) 83, 149 (0.0495)
9 81, 189 (0.0048) 85, 185 (0.0088) 93, 177 (0.0245) 99, 171 (0.0473)
10 97, 213 (0.0048) 102, 208 (0.0095) 110, 200 (0.0245) 117, 193 (0.0498)
11 114, 238 (0.0047) 119, 233 (0.0089) 128, 224 (0.0244) 135, 217 (0.0474)
12 132, 264 (0.0046) 138, 258 (0.0094) 147, 249 (0.0241) 155, 241 (0.0493)
13 151, 291 (0.0045) 158, 284 (0.0099) 167, 275 (0.0238) 175, 267 (0.0470)
14 172, 318 (0.0049) 178, 312 (0.0092) 188, 302 (0.0235) 197, 293 (0.0484)
15 193, 347 (0.0047) 200, 300 (0.0095) 210, 330 (0.0232) 220, 320 (0.0497)
16 215, 377 (0.0046) 223, 369 (0.0098) 234, 358 (0.0247) 2.,43, 349 (0.0475)
17 239, 407 (0.0049) 247, 399 (0.0100) 258, 388 (0.0242) 268, 378 (0.0485)
18 263, 439 (0.0047) 271, 431 (0.0094) 283, 419 (0.0238) 294, 408 (0.0495)
19 289, 471 (0.0049) 297, 463 (0.0096) 310, 450 (0.0250) 320, 440 (0.047"4)
20 315, 505 (0.0047) 324, 496 (0.0098) 337, 483 (0.0245) 348, 472 (0.0482)
21 9 83, 196 (0.0047) 88, 191 (0.0095) 95, 184 (0.0225) 102, 177 (0.0472)
10 99, 221 (0.0044) 105, 215 (0.0097) 113, 207 (0.0241) 120, 200 (0.0478)
11 117, 246 (0.0047) 123, 240 (0.0098) 131, 232 (0.0230) 139, 224 (0.0480)
12 136, 272 (0.0050) 142, 266 (0.0099) 151, 257 (0.0242) 159, 249 (0.0481)
13 155, 300 (0.0047) 162, 293 (0.0099) 171, 284 (0.0231) 180, 275 (0.0481)
14 176, 328 (0.0048) 183, 321 (0.0098) 193, 311 (0.0239) 202, 302 (0.0480)
15 198, 357 (0.0050) 205, 350 (0.0097) 216, 339 (0.0247) 225, 330 (0.0478)
16 220, 388 (0.0046) 228, 380 (0.0096) 239, 369 (0.0235) 249, 359 (0.0475)
17 244, 419 (0.0047) 252, 411 (0.0095) 264, 399 (0.0242) 274, 389 (0.0473)
18 269, 451 (0.0048) 277, 443 (0.0094) 290, 430 (0.0247) 301, 419 (0.0499)
22 9 85, 203 (0.0045) 90, 198 (0.0089) 98, 190 (0.0231) 105, 183 (0.0471)
10 102, 228 (0.0047) 108, 222 (0.0099) 116, 214 (0.0237) 123, 207 (0.0459)
11 120, 254 (0.0047) 126, 248 (0.0096) 135, 239 (0.0240) 143, 231 (0.0486)
12 139, 281 (0.0048) 145, 275 (0.0092) 155, 265 (0.0242) 163, 257 (0.0471)
13 159, 309 (0.0048) 166, 302 (0.0098) 176, 292 (0.0243) 185, 283 (0.0491)
14 180, 338 (0.0048) 187, 331 (0.0094) 198, 320 (0.0243) 207, 311 (0.0475)
15 202, 368 (0.0047) 210, 360 (0.0099) 221, 349 (0.0243) 231, 339 (0.0492)
944 P. K. Sen and P. R. Krishnaiah
22 16 225, 399 (0.0047) 233, 391 (0.0095) 245, 379 (0.0242) 255, 369 (0.0476)
17 250, 430 (0.0050) 258, 422 (0.0099) 270, 410 (0.0241) 281, 399 (0.0490)
23 10 105, 235 (0.0049) 111, 229 (0.0100) 119, 221 (0.0233) 127, 213 (0.0482)
11 123, 262 (0.0047) 129, 256 (0.0093) 139, 246 (0.0250) 147, 238 (0.0490)
12 142, 290 (0.0046) 149, 283 (0.0096) 159, 273 (0.0243) 168, 264 (0.0496)
13 163, 318 (0.0049) 170, 311 (0.0098) 180, 301 (0.0236) 190, 291 (0.0500)
14 184, 348 (0.0047) 192, 340 (0.0100) 203, 329 (0.0247) 212, 320 (0.0471)
15 207, 378 (0.0049) 214, 371 (0.0092) 226, 359 (0.0239) 236, 349 (0.0474)
16 230, 410 (0.0047) 238, 402 (0.0093) 251, 389 (0.0248) 261, 379 (0.0476)
24 10 107, 243 (0.0045) 113, 237 (0.0091) 122, 228 (0.0230) 130, 220 (0.0465)
11 126, 270 (0.0047) 132, 264 (0.0091) 142, 254 (0.0237) 151, 245 (0.0495)
12 146, 298 (0.0049) 153, 291 (0.0100) 163, 281 (0.0243) 172, 272 (0.0486)
13 167, 327 (0.0050) 174, 320 (0.0098) 185, 309 (0.0247) 194, 300 (0.0476)
14 188, 358 (0.0046) 196, 350 (0.0096) 208, 338 (0.0250) 218, 328 (0.0498)
15 211, 389 (0.0047) 219, 381 (0.0094) 231, 369 (0.0235) 242, 358 (0.0486)
16 235, 421 (0.0047) 244, 412 (0.0099) 256, 400 (0.0238) 267, 389 (0.0476)
25 10 110, 250 (0.0047) 116, 244 (0.0099) 126, 243 (0.0248) 134, 226 (0.0486)
11 129, 278 (0.0047) 136, 271 (0.0099) 146, 261 (0.0246) 155, 252 (0.0499)
12 149, 307 (0.0047) 156, 300 (0.0094) 167, 289 (0.0243) 176, 280 (0.0475)
13 170, 337 (0.0047) 178, 329 (0.0098) 189, 318 (0.0240) 199, 308 (0.0485)
14 193, 367 (0.0050) 200, 360 (0.0093) 212, 348 (0.0236) 223, 337 (0.0492)
15 216, 399 (0.0049) 224, 391 (0.0095) 237, 378 (0.0248) 248, 367 (0.0499)
16 240, 432 (0.0047) 249, 423 (0.0098) 262, 410 (0.0243) 273, 399 (0.0476)
26 11 132, 286 (0.0047) 139, 279 (0.0096) 149, 269 (0.0235) 158, 260 (0.0468)
12 153, 315 (0.0050) 160, 308 (0.0097) 171, 297 (0.0243) 181, 287 (0.0498)
13 174, 346 (0.0048) 182, 338 (0.0098) 193, 327 (0.0233) 204, 316 (0.0493)
14 197, 377 (0.0049) 205, 369 (0.0097) 217, 357 (0.0239) 228, 346 (0.0488)
15 220, 410 (0.0047) 229, 401 (0.0097) 242, 388 (0.0244) 253, 377 (0.0482)
16 245, 443 (0.0048) 254, 434 (0.0096) 268, 420 (0.0249) 279, 409 (0.0475)
27 11 135, 294 (0.0047) 142, 287 (0.0094) 153, 276 (0.0243) 162, 267 (0.0473)
12 156, 324 (0.0048) 163, 317 (0.0092) 175, 305 (0.0243) 185, 295 (0.0488)
13 178, 355 (0.0049) 186, 347 (0.0097) 198, 335 (0.0243) 208, 325 (0.0471)
14 201,387 (0.0049) 209, 379 (0.0094) 222, 336 (0.0242) 233, 355 (0.0483)
15 225, 420 (0.0049) 234, 411 (0.0098) 247, 398 (0.0241) 259, 386 (0.0493)
,16 250, 454 (0.0048) 259, 445 (0.0094) 273, 431 (0,0239) 285, 419 (0.0475)
28 ~ 12 159, 333 (0.0046) 167, 325 (0.0095) 179, 313 (0.0243) 189, 303 (0.0479)
13 182, 364 (0.0050) 190, 356 (0.0097) 202, 344 (0.0236) 213, 333 (0.0479)
14 205, 397 (0.0048) 214, 388 (0.0098) 227, 375 (0.0245) 238, 364 (0.0479)
15 230, 430 (0.0050) 239, 421 (0.0099) 252, 408 (0.0237) 264, 396 (0.0477)
29 12 163, 341 (0.0049) 171, 333 (0.0098) 183, 321 (0.0243) 194, 310 (0.0500)
13 186, 373 (0.0050) 194, 365 (0.0097) 207, 352 (0.0246) 218, 341 (0.0487)
14 209, 407 (0.0047) 218, 398 (0.0095) 232, 384 (0.0248) 243, 373 (0.0474)
15 234, 441 (0.0048) 243, 432 (0.0093) 258, 417 (0.0248) 270, 405 (0.0487)
30 13 189, 383 (0.0047) 198, 374 (0.0096) 211, 361 (0.0240) 223, 349 (0.0494)
14 213, 417 (0.0047) 223, 407 (0.0099) 236, 394 (0.0235) 249, 381 (0.0496)
15 239, 451 (0.0050) 248, 442 (0.0095) 263, 427 (0.0245) 276, 414 (0.0497)
For m and n larger than the tabulated entries, normal approximation holds quite well. Some of
the extreme unequal sample size combinations have been left out here; these may be obtained
from Wiicoxon, Katti and Wilcox (1968).
Selected tables ]:or nonparametric statistics 945
w h e r e a c o r r e s p o n d s t o t h e d e s i r e d p r o b a b i l i t y l e v e l a n d t~m, t h e a c t u a l o n e ,
b e i n g t h e l a r g e s t p o s s i b l e v a l u e a m o n g all s u c h s o l u t i o n s . T h e e n t r i e s in T a b l e s 2.1
a n d 2.2 a r e r e p r o d u c e d f r o m W i l c o x s o n , K a t t i a n d W i l c o x ( 1 9 6 8 ) w i t h t h e k i n d
permission of the American Cyanamid Company and the Department of
Statistics, Florida State University.
Table 3.1 a
Selected critical values for the three sample Kruskal-
Wallis statistics
nl n2 n3 h P ( H >i h)
2 2 2 4.571 0.0667
3 2 2 4.714 0.0476
3 3 2 5.139 0.0607
3 3 3 5.600 0.0500
4 2 1 4.821 0.0571
4 2 2 5.125 0.0524
4 3 1 5.208 0.0500
4 3 2 5.400 0.0508
4 3 3 5.727 0.0505 6.745 0.0100
4 4 1 4.867 0.0540 6.667 0.0095
4 4 2 5.236 0.0521 6.873 0.0108
4 4 3 5.576 0.0507 7.136 0.0107
4 4 4 5.692 0.0487 7.538 0.0107
5 2 1 5.000 0.0476
5 2 2 5.040 0.0556 6.533 0.0079
5 3 1 4.871 0.0516 6.400 0.0119
5 3 2 5.251 0.0492 6.822 0.0103
5 3 3 5.515 0.0507 7.079 0.0087
5 4 1 4.860 0.0556 6~840 0.0111
5 4 2 5.268 0.0505 7.118 0.0101
5 4 3 5.631 0.0503 7.445 0.0097
5 4 4 5.618 0.0503 7.760 0.0095
5 5 1 4.909 0.0534 6.836 0.0108
5 5 2 5.246 0.0511 7.269 0.0103
5 5 3 5.626 0.0508 7.543 0.0102
5 5 4 5.643 0.0502 7.823 0.0098
5 5 5 5.660 0.0509 7.980 0.0105
6 2 1 4.822 0.0478
6 3 1 4.855 0.0500 6.582 0.0119
6 3 2 5.227 0.0520 6.970 0.0091
6 3 3 5.615 0.0497 7.192 0.0102
6 4 1 4.947 0.0468 7.083 0.0104
6 4 2 5.263 0.0502 7.212 0.0108
946 P. K. Sen and P. R. Krishnaiah
nl n2 n3 h P ( H >1h)
Asymptotic
value 5.991 0.0500 9.210 0.0100
Table 3.2
Selected critical values for the four sample Kruskal-Wallis
statistics
nl n2 n3 n4 h P { H >t h}
Table 3.3
Selected critical values for the five sample Kruskal-Wallis statistics
nl n2 ns n4 n5 h P { H >I h}
1
Table 4.1
Selected critical values for the Friedman rank statistics
p n C(gp
) and a,,p
3 3 6.000 (0.0278)
4 6.500 (0.0417) 8.000 (0.0046)
5 6.400 (0.0394) 8.400 (0.0085)
6 7.000 (0.0289) 9.000 (0.0081)
7 7.143 (0.0272) 8.857 0.0084)
8 6.250 (0.0469) 9.000 (0.0099)
9 6.222 (0.0476) 8.667 (0.0103)
10 6.200 (0.0456) 9.600 (0.0075)
11 6.546 (0.0435) 9.456 (0.0065)
3 12 6.167 (0.0510) 8.667 (0.0107)
13 6.000 (0.0501) 9.385 (0.0087)
14 6.143 (0.0480) 9.000 (0.0101)
15 6.400 (0.0468) 8.933 (0.0097)
asymptotic 5.991 (0.0500) 9.210 (0.0100)
4 3 7.400 (0.0330) 9.000 (0.0017)
4 7.800 (0.0364) 9.600 (0.0067)
5 7.800 (0.0443) 9.960 (0.0087)
6 7.600 (0.0433) 10.200 (0.0096)
7 7.800 (0.0413) 10.543 (0.0090)
8 7.650 (0.0488) 10.500 (0.0094)
asymptotic 7.815 (0.0500) 11.345 (0.0100)
5 3 8.53 (0.0455) 1 0 . 1 3 (0.0078)
4 8.8 (0.0489) 11.2 (0.0079)
5 8.96 (0.049) 11.52 (0.010)
6 9.067 (0.049) 11.867 (0.0099)
7 9.143 (0.049) 12.114 (0.0100)
8 9.200 (0.050) 12.300 (0.0099)
asymptotic 9.488 (0.050) 13.277 (0.0100)
Selected tables for nonparametric statistics 949
p n ! ~.,,
"(~ and ~.j,
Note that for various combinations of (p, n), the entries were
computed by different workers and they have different degrees
of accuracy; viz, the en:ries (5, 4) and (5, 5). We may note that
across the table, the actual right hand tails of the exact dis-
tribution of X2 is dominated by that of the chi-square dis-
tribution (with the appropriate degrees of freedom), so that the
use of the asymptotic critical values usually results in a more
conservative test.
p { x 2 >~c(,,)
,~,j1. = a,,p ~
_< a < P { X ~ >~ d } . (4.2)
T h e tabulated entries relate to ~,'(~),.pand a,.p for a = 0.05 and 0.01. F o r large
values of n, u n d e r the null hypothesis, X 2 has closely the central chi-square
distribution with p - 1 degrees of f r e e d o m , so that the a p p r o x i m a t e critical
values can be o b t a i n e d f r o m the chi-square distributional tables.
and
Table 5.1
Table for the critical values of the one-sample Kolmogorov-Smirnov
statistic D.
n A a n d a~
and, actually, the right hand sides of (5.4) and (5.5) provide upper bounds for
any finite sample size. T h e s e approximations are quite good for n ~> 31. H e n c e ,
we provide the entries only for n ~< 30.
For two samples of equal sizes n, if F, and G , stand for the empirical
distributions, then one may define the one and two-sided K o l m o g o r o v -
Smirnov statistics as in (5.1) and (5.2) with F being replaced by G,. In this case,
(5.4) and (5.5) hold when we replace n 1/2D+ and n m D , by (n/2)l/2D + and (n/2)mD,,
respectively. For this two-sample case, B i r n b a u m and Hall (1960) have tabu-
lated the distributions for specific values of n, and we adopt their tables to
provide the critical values for specific level of significances. F o r the case of
m o r e than two samples, we refer to Section 7 for some tabulation of the
asymptotic critical values, mostly due to Kiefer (1959). In Tables 5.1 and 5.2,
the entries refer to A and a,, for (5.3) in the one and two-sample cases.
Table 5.2
Table for the critical values of two-sample D,+ and D, for some specific n
n D,+ Dn
K = n ( n - 1)(1 + t ) / 4 . (6.2)
Thus, for every t* ( - 1 ~< t* ~< 1), there exists a K* (0 ~< K* ~< (~)), such that
In Table 6.1, the values of K* for which the probabilities in (6.3) are close to
typical significance levels, along with their exact levels, are presented. This
table has kindly been provided by Professor Dana Quade who has used a
program of his" own to record these values up to four decimal places of
accuracy.
The Spearman rank correlation coefficient (rho) is defined by
where R i (and Si) are the ranks of X i (and Y/) among X1 . . . . ,Xn (and
Y 1 , . . . , ]I,) respectively, for i = 1 . . . . . . n (ties neglected). If, we define
then, we have
r* = 1 - 6 S * / ( n 3 - n ) . (6.6)
Table 6.1
Critical values and probability levels for the Kendall tau statistics
n K * ( P { K <~K*})
4 0 0.0417
5 0 0.0083 0 0.0083 1 ,0.0417
6 0 0.0014 1 0.0083 1 0.0083 2 0.0278
7 1 0.0014 2 0.0054 3 0.0151 4 0.0345
8 3 0.0028 4 0.0071 5 0.0156 6 0.0305
9 j 5 0.0029 6 0.0063 8 0.0223 9 0.0376
10 8 0.0046 9 0.0083 11 0.0233 12 0.0363
11 11 0.0050 12 0.0083 14 0.0203 16 0.0433
12 14 0.0044 15 0.0069 18 0.0224 20 0.0432
13 17 0.0033 19 0.0075 22 0.0211 25 0.0500
14 22 0.0049 24 0.0096 27 0.0236 29 0.0397
15 26 0.0041 28 0.0078 32 0.0231 35 0.0463
16 31 0.0043 34 0.0099 37 0.0206 41 0.0480
17 36 0.0040 39 0.0086 43 0.0211 47 0.0457
18 42 0.0043 45 0.0086 50 0.0239 54 0.0479
19 48 0.0041 52 0.0097 57 0.0245 61 0.0466
20 55 0.0045 59 0.0099 64 0.0234 69 0.0492
21 62 0.0045 66 0.0093 72 0.0244 77 0.0485
22 70 0.0050 74 0.0097 80 0.0237 85 0.0454
23 77 0.0043 82 0.0094 89 0.0249 94 0.0456
24 86 0.0048 91 0.0099 98 0.0246 104 0.0484
25 95 0.0049 100 0.0098 107 0.0232 114 0.0488
26 104 0.0048 109 0.0092 117 0.0233 124 0.0470
27 113 0.0044 119 0.0092 128 0.0247 135 0.0478
28 124 0.0049 130 0.0099 139 0.0249 146 0.0466
29 134 0.0046 140 0.0090 150 0.0241 158 0.0476
30 145 0.0047 152 0.0097 162 0.0245 170 0.0468
31 157 0.0050 164 0.0099 174 0.0240 183 0.0480
32 168 0.0046 176 0.0098 187 0.0246 196 0.0475
33 181 0.0049 188 0.0092 200 0.0243 210 0.0488
34 193 0.0046 202 0.0100 214 0.0249 224 0.0485
35 207 0.0049 215 0.0095 228 0.0247 239 0.0499
Table 6.2
Critical values and probability levels for the Spearman rho statistics
n s (P{S* ~<s})
4 0, 0.0417
5 0, 0.0083 0, 0.0083 2, 0.0417
6 0, 0.0014 2, 0.0083 4, 0.0167 6, 0.0292
7 4, 0.0034 6, 0.0062 12, 0.0240 16, 0.0440
8 10, 0.0036 14, 0.0077 22, 0.0229 30, 0.0481
9 20, 0.0041 26, 0.0086 36, 0.0216 48, 0.0484
10 34, 0.0044 42, 0.0087 58, 0:0245 72, 0.0481
11 54, 0.0049 64, 0.0091 84, 0.0239 102, 0.0470
12 78, 0.0048 92, 0.0093 118, 0.0244 142, 0.0495
13 108, 0.0047 128, 0.0097 160, 0.0249 188, 0.0485
14 146, 0.0047 170, 0.0095 210, 0.0250 244, 0.0486
15 194, 0.0050 222, 0.0097 268, 0.0244 310, 0.0486
16 248,0.0049 284, 0.0100 338, 0.0247 388, 0.0493
k
K1 : -{sup E
x ,1-
ni[S(inl(X) - - 2"11/2'
SN(x)] l
(7.1)
and
Table 7,1
Table for the percentile points of Ak(') and Bk('), k <~5
k A~(a) B~l(a)
a = 0.01 0.05 0.10 a = 0.01 0.05 0.10
Table 7.2
Critical values for K~ for k ~<7 and some typical levels of
significance
k
ot 1 2 3 4 5 6 7
II
a,
II
If
~5
H
_=
II
II
II
"-d
H
e.
rZ ~
Selected tables for nonparametric statistics 957
Also, for the truncated version of the Kolmogorov-Smirnov test, one encoun-
ters a test statistic whose asymptotic null-distribution agrees with that of
sup{(t(1 t))-mB(t): el ~< t ~< 1 - e2}, where el and e2 are positive numbers, and
B~, is the tied-down Bessel process, defined as in before. For k = 1, we may
also have a one-sided version, wherein B(t) is replaced by W(t). Note that if
we let T = ( 1 - e 2 ) / e l (>1), then, for this statistic too, the distribution is the
same as in (7.7). For this reason, we use the tables in D e L o n g (1981) and
provide some critical values of BS(T) for various T and k. For k = 1, the
one-sided entries are also presented.
We conclude this section with the remark that for the unweighted one and two-
sample Kolmogorov-Smirnov statistics, under censoring, for one and two-sided
tests, one needs to consult the tables for P{sup0<,~ T W(u) <<-a} (=A+(a, T),
say) and P{supo<u~T]W(t)[ <- a} ( = A ( a , T ) , say), for various a (>/0)
and T: 0 < T~< 1, where W is a standard Brownian Bridge. By using the
basic results in Anderson (1960), Schey (1977) considered these and has
tabulated some of the entries. For example, for T = 0.1, 0.5 and 0.9, the one-sided
critical values for a = 0.05 are 0.5985, 1.133, and 1.224, respectively while the two-
sided ones are 0.6825, 1.400 and 1.480, respectively. With more adaptations from
Koziol and Byar (1975), other critical values for a = 0 t 0 for the one-sided case
corresponds to the entries for a = 0.05 for the two-sided case, stated above. For
a = 0.01, the one-sided critical values for T = 0.1, 0.5 and 0.9 are 0.916, 1.658 and
1.731, respectively, while the two-sided values are 0.851, 1.552 and 1.628,
respectively.
References
[1] Birnbaum, Z. W. (1952). Numerical tabulation of the distribution of Kolmogorov's statistics for
finite sample size. Jour. Amer. Statist. Assoc. 47, 425-441.
[2] Birnbaum, Z. W. and Hall, R. A. (1960). Small sample distribution for multi-sample statistics
of the Smirnov type. Ann Math. Statist. 31, 710-720.
[3] DeLong, D. (1980). Some asymptotic properties of a progressively censored nonparametric
test for multiple regression. Jour. Multivar. Anal. 10, 360-370.
[4] DeLong, D. (1981). Crossing probabilities for a square root boundary by a Bessel process.
Commun. Statist. Theor. Meth. All), 2197-2213.
[5] Hollander, M. and Wolfe, D. A. (1973). Nonparametric Statistical Methods Wiley, New York.
[6] Iman, R. L., Quade, D. and Alexander, D. A. (1975). Exact probability levels for the
Kruskai-wailis test. In: H. L. Harter and D. B. Owen, eds., Selected Tables in Mathematical
Statistics, Amer. Math. Soc., Providence, RI, pp. 329-384.
958 P. K. Sen and P. R. Krishnaiah
[7] Kendall, M. G. and Smith, B. (1939). The problem of m-rankings. Ann. Math. Statist. 10,
275-287.
[8] Kiefer, J. (1959). K-sample analogues of the Kolmogorov--Smirnov and Cramer-V. Mises'
tests. Ann. Math. Statist. 30, 420--447.
[9] Koziol, J. A. and Byar, D. P. (1975). Percentage points of the asymptotic distributions of one-
and two-sample K-S statistics for truncated or censored data. Technometrics 17, 507-510.
[10] Michaelis, J. (1971). Schwellenwerte des Friedman-tests. Biom. Zeit. 13, 118-129.
[11] Millar, L. H. (1956). Tables of percentage points of Kolmogorov statistics. Jour. Arner. Statist.
Assoc. 51, 111-121.
[12] Odeh, R. E. (1977). Extended tables of the distribution of Friedman's S-statistic in the
two-way layout. Commun. Statist. Set. B 6, 29--48.
[13] Otten, A. (1973). The null distribution of Spearman's S when n = 13(1)16. Statist. Neerland.
27, 19--20.
[14] Owen, D. B. (1962). Handbook of Statistical Tables. Addison-Wesley, Reading, MA.
[15] Quade, D. (1972). Average internal rank correlation. Tech. Report. Math. Centrum, Am-
sterdam.
[16] Schey, H. (1977). The asymptotic distribution of the one-sided Kolmogorov-Smirnov statistic
for truncated data. Commun. Statist. Theor. Meth. A6, 1361-1366.
[17] Wilcoxon, F., Katti, S. K. and Wilcox, R. A. (1968). Critical values and probability levels for
the Wilcoxon rank sum and the Wilcoxon signed rank test. Amer. Cynamid Co., Pearl River,
New York.
Subject Index
959
960 Subject index
IFR class, 581, 585, 589, 615, 616, 617, 621, 628 Linear spacings statistics, 587
Inclusion and exclusion, 127 Linear unbiased estimators, 373
Incomplete block designs, 299 Locally most powerful (LMP) rank tests, 21,
Independence criteria, 444 39, 561, 562, 570, 677
Induced order statistics, 383--402 Log-dose transformation, 730
Influence curve, 468 Logistic distribution, 371, 373
Inspection procedure, 624 Logits, 302
Integral of the mean-square error, 540 Log-likelihood-ratio statistics, 865
Interaction parameters, 833 Log-linear analysis of paired comparisons, 324
Interchangeable random variables, 130, 140 Loglinear representation, 833
Internal constraints problems, 834 Log-odds 0ogit) representation, 858
Intra-block permutations, 16 LOg rank test, 800
Invariance principle, 20, 473, 664 L-statistics, 504, 666
Invariant SPRT, 669 Luce choice axiom, 315
Invariant to the translation, 475
Isotonic power, 587, 589 M-, L-, and R-estimators, 463-481
Isotonic regression, 551 Maintenance policies, 624
Iterated logarithm inequalities, 686 Mann-Whitney test, 43, 562
Iterative procedure, 303 MANOCOVA, 723
MANOVA, 718
Jackknife estimator, 729
Many-one rank-sum statistics, 198
Kaplan-Meier product limit estimator, 551, Many-one sign statistics, 197
554, 555, 573, 597, 648, 772-776 Martingale, 384, 385, 596, 664
KendaU's coefficient of concordance, 826 Matched pairs, 187
Kendall's rank correlation, 570 Matching invariance, 4, 11, 12
Kendall's tau, 81, 91, 147, 887-889 Maximal correlation, 87
Kernel-type empirical density, 533, 537 Maximal deviation tests, 599
Kiefer process, 435, 439, 440 Maximal essential similar partition, 6, 26
Kiefer-Wolfowitz approximation, 525 Maximal invariants, 1, 5, 11, 12, 19, 20, 25
Kolmogorov-Smirnov statistic, 406, 546, 554, Maximum likelihood estimates, 373, 834
557, 563, 564, 599, 604, 668, 729, 741, 779- Maximum of correlated normal variables, 913,
784, 885--887 920, 925
Kolmogorov statistic, 592 Mean square convergence, 526
Kruskal-Wallis statistic, 40, 47, 551, 566, 881- Mean-square error, 539
883 Measures of concordance, 80, 85
K-step rank estimates, 265 Measures of dependence, 79
Median regression, 813
Ll-distance, 535 M-estimator, 494, 495
L2-distanee, 535 Method of n rankings, 186
Large deviations, 424 Minimal alternative, 423
Law of iterated logarithm, 416, 468, 473, 545 Minimal sufficient statistics, 5
Laws of large numbers, 146 Minimax property, 466, 469, 475, 591
Least squares regression, 819 Minimum discrimination information, 832, 835,
Lehmann alternatives, 671 841
Lehmann test, 729 Minimum distance estimates, 741, 743, 746, 747
L-estimator, 474, 496 Minimum distance test, 741, 752
Life distributions, 613 Minimum risk estimator, 489, 490
Life tests, 362 Misclassification, 547
Likelihood methods, 255, 302 Mixture of life distributions, 620
Limit distribution, 368, 369, 374, 412, 414, 535, Mixture of noncrossing distributions, 621
541, 542, 543 Mode of the distribution, 547
Linear model, 229, 259 Moment convergence, 468, 470, 473
Linear ordered (L-) estimators, 347, 348 Moment inequalities, 623
Linear rank statistics, 38, 159, 558, 586, 662, Monotone tests, 639
664, 813 Monotonic hazard rates, 377
962 Subject index
V o l u m e 2. Classification, P a t t e r n R e c o g n i t i o n a n d R e d u c t i o n of
Dimensionality
E d i t e d b y P. R . K r i s h n a i a h a n d L. N. K a n a l
1982 xxii + 903 p p .