About Testing The Hypothesis of Equality of Two Bernoulli

Mathematical Theory and Modeling www.iiste.
org
ISSN 2224-5804 (Paper) ISSN 2225-0522 (Online)
Vol.4, No.9, 2014

142
About Testing the Hypothesis of Equality of Two Bernoulli
Regression Curves
Petre Babilua (Corresponding author)
Faculty of Exact and Natural Sciences of I. Javakhishvili Tbilisi State University
2 University St., Tbilisi 0186, Georgia
Tel: +995599408383 E-mail: petre.babilua@tsu.ge

Elizbar Nadaraya
Faculty of Exact and Natural Sciences of I. Javakhishvili Tbilisi State University
Tel: +995599570555 E-mail: elizabar.nadaraya@tsu.ge

Grigol Sokhadze
I. Vekua Institute of Applied Mathematics of I. Javakhishvili Tbilisi State University
Tel: +995591313197 E-mail: grigol.sokhadze@tsu.ge
Abstract
The limiting distribution of an integral square deviation between two kernel type estimators of Bernoulli
regression functions is established in the case of two independent samples. The criterion of testing is constructed
for both simple and composite hypotheses of equality of two Bernoulli regression functions. The question of
consistency is studied. The asymptotics of behavior of the power of test is investigated for some close
alternatives.
Keywords: Bernoulli Regression Function, Power of Test, Consistency, Composite Hypothesis

1. Introduction
Let random variables
( ) i
Y , 1,2 = i , take two values 1 and 0 with probabilities
i
p (succes) and
i
p 1 ,
1,2 = i (failure), respectively. Assume that the probability of success
i
p is the function of an independent
variable | | 0,1 e x , i.e. ( )
( )
{ } x Y x p p
i
i i
| 1 = = = P ( ) 1, 2 i = (see [1]-[3]). Let
j
t , n j , 1, = , be the
devision points of the interval | | 0,1 :
2 1
= , =1, , .
2
j
j
t j n
n

Let further
( ) 1
i
Y and
( ) 2
i
Y , n i , 1, = , be mutually independent random Bernoulli variables with
( )
{ } ( )
i k i
k
i
t p t Y = | 1 = P ,
( )
{ } ( )
i k i
k
i
t p t Y 1 = | 0 = P , n i , 1, = , 1,2 = k . Using the samples
( ) ( ) 1 1
1
, ,
n
Y Y
and
( ) ( ) 2 2
1
, ,
n
Y Y we want to chek the hypothesis
( ) ( ) ( ) | | 1 , 0 , = = :
2 1 0
e x x p x p x p H ,
against the sequence of close alternatives of the form
( ) ( ) ( ) ( ) 1,2. = , = :
1
k o x u x p x p H
n k n k n
o o + +
Mathematical Theory and Modeling www.iiste.org
Vol.4, No.9, 2014

143
where 0
n
o relevantly, ( ) ( ) x u x u
2 1
= , | | 0,1 e x and ( )
n
o o uniformly in | | 0,1 e x .
The problem of comparing two Bernoulli regression functions arises in some applications, for example in
quantal biossays in pharmacology. There x denotes the dose of a drung and ( ) p x the probability of response
to the dose x .
We consider the crietrion of testing the hypothesis
0
H based on the statistic
( ) ( ) ( )
( )
( ) ( )
( )
( ) ( )
2 2
2
1 2 1 2
1 1
,
2 2
= , 1 , 0,
n n
n n n n n n n n
n n n
T nb p x p x p x dx nb p x p x dx
b b
t t
t t t t
O O
= = ( (

O > (

} }

where
( ) ( ) ( )
( )
( )
( )
1
1
1
,
1
= , =1, 2,
1
= ,
in in n
n
j i
in j
j
n n
n
i
n
i
n n
p x p x p x
x t
p x K Y i
nb b
x t
p x K
nb b
=
=
=
| |
|
\ .
| |
|
\ .

( ) x K is some distribution density and 0
n
b is a sequence of positive numbers, ( ) x p
in
is the kernel
estimator of the regression function (see [4], [5]).
2. Assumptions and Notation
We assume that a kernel ( ) 0 > x K is chosen so that it is a function of bounded variation and satisfies the
conditions: ( ) ( ) x K x K = , ( ) 0 = x K for 0 > t > x , ( ) 1 =
}
dx x K . The class of such functions is denoted by
( ) t H .
We also introduce the notation:
( )
( ) ( )
( )
( ) ( ) ( )
( ) ( )
( )
( )
( ) ( ) ( ) ( )
2
1
1 2
1 2
2 2
2
2 1 1
1
= , = , =1, , ,
2
= , , , = ,
1
= , = = 1 .
n
n
n n n n in in in
ij n i j n
n n
n k
n k i ik i i k i k i
k i k
n
T nb p x p x dx p x p x Ep x i n
x u x v
Q t t u v K K dx
b b
d d Q d d t p t p t
nb
t
t

o
O
O
= = =
(

| | | |
| |
\ . \ .
}
}

3. Auxiliary Assertions
Lemma 1 ([6]). Let ( ) ( ) K x H t e and ( ) p x , | | 0,1 x e , be a function of bounded variation. If
n
nb ,
then
( ) ( )
3 3 1 2 1 2
1
1
0
1 1 1
=
n
i i
i
i
n n n n n n n
x t y t x u y u
K K p t K K p u du O
nb b b b b b nb
v v v v v v
=
| | | | | | | | | |
+
| | | | |
\ . \ . \ . \ . \ .
}

uniformly in | | 0,1 , e y x , where { } 0 eN
i
v , 1,2,3 = i .
Lemma 2. Let ( ) ( ) t H x K e , ( ) | | 0,1
1
C x p e and ( ) x u
1
, ( ) x u
2
be continuous functions on | | 0,1 . If
Vol.4, No.9, 2014

144

2
n
nb and 0
1/2
n n
b o , then for the hypothesis
n
H
1

( ) ( ) ( ) ( ) ( )
1
2
1 2 2 2 2
0
0 2
2 1
n n
x
b p p x p x dx K x dx
t
o o
s
=
} }
(1)
and
( ) ( ) ( ) ( )
1/2 1/2 1/2
3/2
1
=
n n n n n
n
b p O b O b O
nb
o

| |
A A + +
|
\ .
, (2)
where
( )
( ) ( ) ( ) ( ) ( )
1
1 2
0 | |
0
= , = 1 ,
= , .
n n
x
ET p p x p x dx K u du
K K K is the convolution operator
t s
A A
- -
} }

Proof. We have
( )
( ) ( )
2 2 2 2
1 2
2
, 1 1
1 1 1
=
2 2
n n
n i k ik i ii
i k i
n
d d Q d Q A n A n
nb
o
= =
| |
= +
|
\ .

(3)
where
( ) ( ) ( ) ( ) ( ) ( ) ( )
1 1 2 2
1 1 , 1, , ,
k k k k k k
d d t p t p t p t p t k n = = + =
( ) ( ) ( ) ( ) = 2 1 , 1, 2
k k k n
d p t p t O k o + = , (4)
uniformly in [0,1] e
k
t .
It can be easily established that
( )
( )
2
1 2 3 2 2
2 1 2
1
1 1
2
n
i n
n n i
i
n n n
n
x t
b A n n b d K dx c c
b nb nb
o
t

=
O
| |
| | |
= s +
|
|
\ .
|
\ .
}
. (5)

From the definition of
ik
Q and (4) follows
( ) ( ) ( ) ( ) ( ) ( ) ( )
( )
( ) ( ) ( )
2
2
1
1
1
= 2 1 ,
2
= .
n
n
i i
n i i n
i
n n
n
n n
x t y t
A n nb p t p t O K K dx dy
b b
t
o
t t t
=
O
( | | | |
+
( | |
( \ . \ .
O O O
}

Further, using Lemma 1 and also taking into account that

( ) | | 1 , 0
1
C x p e and | | t t, ,
1

(

n n
b
x
b
x
for all
( ) t
n
x O e , it is easy to show that
( ) ( ) ( ) ( )
( )
( ) ( )
( )
( )
2 2
1 2 2 1/2
1 0
2
1
1
= 2 1 .
n
n
n
x
b
n n n n n
x n n
b
x y
b A n p x p x K dx dy O b O b O O
b nb
t
t
t
o o

O
+
| | | |
+ + + +
| |
\ . \ .
} }

Thus
Vol.4, No.9, 2014

145
( ) ( ) ( ) ( ) ( )
1
2
1 2 2
1 0
0 2
2 1
n
x
b A n p x p x dx K x dx
t
s

} }
. (6)
From (5) and (6) follows statement (1).
Further, using the above-mentioned method, we can write
( )
( ) ( )
( )
( ) ( ) ( ) ( )
( )
( )
( ) ( ) ( ) ( ) ( ) ( )
1 1 2
1
2
1
1
2
0
1
= = =
2
1
= 1 =
1
1 .
n
n
n
n
n
i
n n n i
i
n
x
b
n n n
x n
b
n n
n x
x t
ET nb K d t dx
b
K u p x b u p x b u du dx O O
nb
p x p x dx K x dx O b O O
nb
t
t
t
o
o
=
O
O
s
| |
A
|
\ .
(
| | (
+ +
| (
\ .
(

| |
= + + +
|
\ .
}
} }
} }

Thus
( ) ( ) ( ) ( )
1/2 1/2 1/2
3/2
1
=
n n n n n
n
b p O b O b O
nb
o

| |
A A + +
|
\ .
.
The lemma is proved.
Asymptotical Normality of the Statistic
n
T
We have the following assertion.
Theorem 1. Let ( ) ( ) t H x K e and
( ) ( ) ( ) | | 0,1 , ,
1
2 1
C x u x u x p e .
If
2
n
nb , 0
1/2
n n
b o and
0
2 1/2
c nb
n n
o , < < 0
0
c , then for the hypothesis
n
H
1

( ) ( ) ( ) ( )
1/2 1
,1
d
n n
b T p p N a o

A ,
where ( ) p A and ( ) p
2
o are defined in Lemma 2 and
d
denotes convergence in distribution and
( ) ,1 a N is a random variable having the standard normal distribution with parameters ( ) 1 , a ,
( )
( ) ( ) ( )
1
2
0
1 2
0
=
2
c
a u x u x dx
p o

}
.
Proof. We have
( ) ( ) ( ) 1 1 2
=
n n n n
T T L L + + ,
where
( )
( ) ( ) ( ) ( )
( )
( )
( ) ( )
( )
1
1 2 1 2
2
2
1 2
,
1
.
2
n
n
n n n n n n
n n n n
L nb p x p x Ep x Ep x dx
L nb Ep x Ep x dx
t
t
O
O
= ( (

= (

}
}

By the Lemma 1, it is clear
Vol.4, No.9, 2014

146
( )
( ) ( )
( )
2
1
2 1/2 1/2 2
1 2
0
1 1 1
2
n n n n
n n n
n
x t
b L nb K u t u t dt O dx
b b nb
t
o
O
| | | |
= + (
` | |

\ . \ . )
} }
. (7)
Since | | t t, ,
1

(

n n
b
x
b
x
for all ( ) t
n
x O e , then from (7) we find
( )
( ) ( ) ( ) ( )
( )
2
2 1/2 1/2 2
1 2
1 1
2
n n n n n n
n
n
b L nb K t u x b t u x b t dt O dx
nb
t
t t
o
O
( | |
= +
( |
( \ .
} }
. (8)
Further, since ( ) ( ) | | 0,1 ,
1
2 1
C x u x u e , then from (8) we have
( )
( ) ( ) ( )
1
2
2 1/2 0
1 2
0
2
n n
c
b L u t u t dt

}
. (9)
Now, we show that
( ) 1 1/2
0
n n
b L
. We have
( )
( ) ( ) ( ) ( )
( )
1 1/2 1/2
1 1 2
1
=
2
n n n n n n
n
b L nb p x Ep x Ep x dx
t
O

}
( ) ( ) ( ) ( )
( )
1/2
2 1 2
1
2
n n n n
n
nb p x Ep x Ep x dx
t O
=
}

( ) ( ) 1 2
n n
I I = + . (10)
It is clear that
( ) ( )
( )
( ) ( ) ( ) ( )
( )
( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )
( )
( ) ( ) ( )
1/2
2
1/2
2
1 1 1/2
1 1 2
1/2
1/2
1 1 1 2 1 1 2 1 1 2 2 2 1 2
1
=
2
1
cov , ,
2
= .
n
n
n n n n n n
n n n n n n n
n
n n
E I E I nb E p x Ep x Ep x dx
nb p x p x Ep x Ep x Ep x Ep x dx dx
t
t
t
t t
O
O
(
| |
| |
(
| s =
|
| (
\ .
\ .

(
= (
(

O O O
}
}

Easily verify, that
( ) ( ) ( )
( )
( ) ( ) ( )
1 2
1 1 1 2 1 1
2
1
1
cov , 1
n
i i
n n i i
i
n n
n
x t x t
p x p x K K p t p t
b b
nb =
| | | |
=
| |
\ . \ .

and by Lemma 2 we can now write
( ) ( ) ( )
( ) ( ) ( )
( )
1 2
1 1 1 2
1
1 2
1 1
2
0
cov , =
1
1 .
n n n
n n
n
p x p x n b
x u x u
K K p u p u du O
b b
nb

| |
| | | |
| +
| |
|
\ . \ .
\ .
}

Thus
Vol.4, No.9, 2014

147
( )
( ) ( ) ( )
( ) ( )
( ) ( ) ( ) ( ) ( ) ( )
1
1 1/2 1 2
1 1
2 2
0
1 2
1/2 2
1/2
1 1 2 1 1 2 2 2 1 2 3 3
1 1 1
1
2
= 0,
n
n n
n n n
n
n n
n n n n n n
n
x u x u
E I nb K K p u p u du
b b nb
nb
nb
Ep x Ep x Ep x Ep x dx dx c n b c
n
t
o
o
o
O
(
| | | |
s + (
| |
( \ . \ .

s
`
)
} }

since, by condition,
0
2 1/2
c nb
n n
o , < < 0
0
c and
1/2 2
1/4
=
n n
n
n
nb
n
b
o
o .
So,
( ) 1
0
n
I . Analogously we can show that

( ) 2
0
n
I .
Hence
( ) 1
0
n
L . (11)
Further, to prove the theorem it remains to show
( )
( )
1
0,1
d n n
n
T
N
o
A
. (12)
Since the proof of (12) is similar to that of Theorem 1 from [7], we omit it.
Using the representation
( ) ( ) ( ) 2 1 1
=
n n n n
L L T T + + , Lemma 2, (9), (11) and (12), we find that
( )
( )
( ) ( ) ( )
1
2
1/2 0
1 2
0
,1
2 ( )
n d
n
T p c
b N u x u x dx
p p o o
| | A | |

| |
|
\ . \ .
}
.
The theorem is proved.
The conditions of Theorem 1 for
n
b and
n
o are fulfilled if we assume
o
n b b
n 0
= and
1 2 4
0 n
n
o
o o
+
= for
0 1 2 o < < .
Corollary. Let ( ) ( ) t H u K e and ( ) | | 0,1
1
C x p e . If
2
n
nb , then for the hypothesis
0
H
( ) ( ) ( ) ( )
1/2 1
0,1
d
n n
b T p p N o

A . (13)
4. Application of the Statistic
n
T for the Hypothesis Testing
As an important application of the result of the corollary let us construct the criterion of testing the simple
hypothesis
0
H : ( ) ( ) ( )
1 2
= = p x p x p x (this is the case with given ( ) p x ); the critical domain is defined by
the inequality
( ) ( ) ( )
1/2
=
n n n
T d p b p
o
o o > A + ,
Vol.4, No.9, 2014

148
And from Theorem 1 we establish that the local behavior of the power ( ) ( )
1
H n n
n
T d o > is as follows
( ) ( )
( )
( )
1
1
H n n
n
A u
T d
p
o
o
o
| |
> u |
|
\ .
,
where
( ) ( ) ( ) ( ) ( )
1
2
0
1 2 1 2
0
, ,
2
c
A u u x u x dx u u u = =
}
,
( ) =1
o
o u , ( ) u is a standard normal distribution.
Note note that in (13) the statistic function
n
T is normalized by the values ( ) p A and ( )
2
p o which depend
on ( ) p x . If ( ) p x is not defined by a hypothesis, then the parameters ( ) p A and ( )
2
p o should be replaced
respectively by
( )
( )
( )
( )
( )
( )
2
2 2 2
0
2
,
2 ,
n
n
n n
x
n n
x
x dx K x dx
x dx K x dx
t t
t t
o
O s
O s
A =
=
} }
} }

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )
1 1 2 2
=
n n n n n n n
x p x p x p x p x p x p x +
and we show that
( ) ( ) ( )
1/2 2 2
0,
n n n
b p p o o
A A . (14)
Let us prove (14). Since ( )
1
1
n
n
p x O
nb
| |
= +
|
\ .
uniformly in ( )
n
x t eO and ( )
4 in
p x c s , | | 0,1 x e , 1, 2 i = ,
we obtain
( )
( ) ( ) ( )
( )
( )
( ) ( ) ( )
( )
( )
( ) ( )
( )
( ) ( )
( )
1/2
1/2 1/2
2 2
1/2
5 1 1 2 2
1/2 1/2
1 2
.
n n
n n n n n
n n
n n n n
n n
b E p
c b E p x Ep x dx E p x Ep x dx
b Ep x p x dx b Ep x p x dx
t t
t t
O O

O O
A A s

s + +
`

)
+ +
} }
} }

Further, using Lemma 1 and and also taking into account that ( ) | |
1
0,1 p x C e and | |
1
, ,
n n
x x
b b
t t
(

(

for all
( )
n
x t eO , it is easy to see that
( ) ( )
1/2 1/2
3/2
1 1
n n n
n
b E p O O b O
nb n b
| |
| |
A A = + + |
|
|
\ .
\ .
.
Vol.4, No.9, 2014

149
Hence ( ) ( )
1/2
0
n n
b p
A A . Analogously, it can be shown that ( )

2 2
n
p o o .
Theorem 2. Let ( ) ( ) K x H t e and
( ) ( ) | |
1
1 2
= 0,1 p x p x C e .
If
2
n
nb , then for n
( ) ( )
1/2 1
0,1 .
d
n n n n
b T N o

A
Proof. Follows from (13) and (14).
Theorem 2 enables us to construct an asymptotical criterion of testing the composite hypothesis
( ) ( )
0 1 2
: = H p x p x , | | 0,1 x e . The critical domain for testing this hypothesis is defined by the inequality
( ) ( )
1/2
, 1
n n n n n
T d b
o o
o o o
> = A + u = . (15)
Theorem 3. Let ( ) ( ) K x H t e , ( ) ( ) | |
1
1 2
, 0,1 p x p x C e . If
2
n
nb , then for n
( )
( )
1
1
H n n
T d o >
Here the alternative hypothesis
1
H is any pair ( ) ( ) ( )
1 2
, p x p x , ( ) ( ) | |
1
1 2
, 0,1 p x p x C e , ( ) 0 1
i
p x s s ,
1, 2 i = , such that ( ) ( )
1 2
p x p x = on the set of positive measure.
Proof. Is similar to the proof of Theorem 3 from [7].
Remark. Let
i
t be the division points of the interval | | 0,1 , which are chosen so that
( )
2 1
2
j
j
H t
n
= , 1 , , j n = ,
where ( ) ( )
0
x
H x h u du =
}
, ( ) h u is some known continuous distribution density o | | 0,1 . In this case, by a
similar reasoning to the above one we can be generalize the results obtained in this paper.

Acknowledgement. The work is supported by Shota Rustaveli National Scientific Foundation, Project
No. PG/18/5-104/13

Vol.4, No.9, 2014

150
References
1. Efromovich, S. (1999), Nonparametric curve estimation. Methods, theory, and applications. Springer Series
in Statistics. New York: Springer-Verlag.
2. Copas, J. B. (1983), Plotting p gainst x . Appl. Statist. 32:1, 25-31

3. Okumura, H. & Naito, K. (2004), Weighted kernel estimators in nonparametric binomial regression. The
International Conference on Recent Trends and Directions in Nonparametric Statistics. J. Nonparametr. Stat.,
16:1-2, 39-62. DOI:10.1080/10485250310001624828,
http://www.tandfonline.com/doi/abs/10.1080/10485250310001624828
4. Nadaraya, E. A. (1964), On estimating regression. (Russian) Teor. Veroyatn. Primen., 9, 157-159
5. Watson, G. S. (1964), Smooth regression analysis. Sankhy Ser. A, 26, 359-372
6. Nadaraya, E., Babilua, P., & Sokhadze, G. (2010), Estimation of a distribution function by an indirect
sample. (Russian) Ukr. Mat. Zh., 62:12, 1642-1658; translation in Ukr. Math. J., 62:12, 1906-1924
7. Nadaraya, E. Babilua, P., & Sokhadze, G. (2013), On the integral square measure of deviation of a
nonparametric estimator of the Bernoulli regression. Teor. Veroyatn. Primen., 57:2, 322-336; translation in
Theory Probab. Appl., 57:2, 265-278.

The IISTE is a pioneer in the Open-Access hosting service and academic event
management. The aim of the firm is Accelerating Global Knowledge Sharing.

More information about the firm can be found on the homepage:
http://www.iiste.org

CALL FOR JOURNAL PAPERS
There are more than 30 peer-reviewed academic journals hosted under the hosting
platform.
Prospective authors of journals can find the submission instruction on the
following page: http://www.iiste.org/journals/ All the journals articles are available
online to the readers all over the world without financial, legal, or technical barriers
other than those inseparable from gaining access to the internet itself. Paper version
of the journals is also available upon request of readers and authors.

MORE RESOURCES
Book publication information: http://www.iiste.org/book/

IISTE Knowledge Sharing Partners
EBSCO, Index Copernicus, Ulrich's Periodicals Directory, JournalTOCS, PKP Open
Archives Harvester, Bielefeld Academic Search Engine, Elektronische
Zeitschriftenbibliothek EZB, Open J-Gate, OCLC WorldCat, Universe Digtial
Library , NewJour, Google Scholar

About Testing The Hypothesis of Equality of Two Bernoulli

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

About Testing The Hypothesis of Equality of Two Bernoulli

Загружено:

Авторское право:

Доступные форматы

Mathematical Theory and Modeling www.iiste.

A A . Analogously, it can be shown that ( )

Вам также может понравиться