[PPT] - IWSM 2013, the University of Georgia in Athens, Georgia, USA 18-21 PowerPoint Presentation

SLIDE 1

IWSM 2013, the University of Georgia in Athens, Georgia, USA

18-21 July, 2013

SLIDE 2

Presentation structure

1. Introduction
2. Constrained Bayesian problems of the multiple

comparisons

3. Properties of the hypotheses acceptance regions
4. The method of sequential analysis of Bayesian type
5. Investigation of the method of sequential analysis of

Bayesian type

6. Experimental research

SLIDE 3

1. Introduction

We will consider the following methods

f

sequential analysis:

Sequential methods of Wald (middle of forties of 20th

century); century);

Bayesian sequential procedures (end of forties of 20th

century);

Berger sequential tests (beginning of nineties of 20th

century);

The sequential analysis method of Bayesian type

(2010).

SLIDE 4

2. Constrained Bayesian problems of

the multiple comparisons

To minimize the averaged value of probabilities of incorrectly rejected hypotheses (1)

∑ ∫ ∑

≠ = Γ =

=

S i j j i S i i

j

d H p H p r

, 1 1

) | ( ) ( x x

δ

subject to the averaged value of incorrectly accepted hypotheses . (2)

∑ ∫ ∑

≠ = Γ = i j j i

j

, 1 1

α ≤ −

∫ ∑

Γ =

i

d H p H p

i S i i

x x ) | ( ) ( 1

1

SLIDE 5

Here is the number of tested hypotheses , supposed that the sample ( , is

bservation

space) is generated by distribution , ;

S

i i

H θ θ = : ) ,..., ( 1

n T

x x = x

n

R ∈ x

n

R

≡ = ) ,..., ; ,..., ( ) , (

1 1 i k i n i

x x p p θ θ θ x ) | (

i

H p x

S i ,..., 1 =

, ; is the vector of parameters of distribution, ; ; is

dimensional parametrical

space; in general, ; is the a priori probability of hypothesis ; is the region of acceptance of hypothesis ; is the maximum allowed level of the averaged value

f

the probabilities

f

incorrectly accepted hypotheses.

≡ = ) ,..., ; ,..., ( ) , (

1 1 k n

x x p p θ θ θ x ) | (

i

H p x

S i ,..., 1 =

) ,..., ( 1

k T

θ θ = θ

k i

Θ ∈ θ

k

Θ

k

k n ≠

) (

i

H p

i

H

i

Γ

i

H

α

S i i ,..., 1 : = ∀

SLIDE 6

Task (1), (2) is one of possible formulations of the constrained Bayesian problem. In a similar manner, we can introduce and solve different constrained Bayesian

tasks. Solutions of these tasks differ to each other but,

generally, they can be written as follows: , , (3) where , .

} ) | ( ) | ( : {

, 1

∑

≠ =

> = Γ

S j j j j j j

H p k H p k

x

x x

S j ,..., 1 =

+∞ < ≤

j

k

S ,..., 1 =

SLIDE 7

Investigation of these regions shows that they have identical properties, namely, in general case they can intersect and their union could not coincide with observation space. This fact allows us to use each of them for construction of the appropriate sequential method of testing of hypotheses. But, for simplicity, further we shall consider only the task (1), (2), though all results

btained below, after appropriate modifications, are true for all
ther

tasks also.

ther

tasks also. For concreteness let us rewrite the regions of acceptance of hypotheses (3) for the task (1), (2): , (4) where ( ) the same scalar value for all regions, is determined so that in (2) the equality takes place.

{ }

S j H p H p H p H p

j j S j i i i i j

,..., 1 , ) | ( ) ( ) | ( ) ( :

, 1

= < = Γ

∑

≠ =

x x x λ

λ

+∞ < < λ

SLIDE 8

3. Properties of the hypotheses acceptance

regions

It is known that, in traditional statements of the problem

f

statistical hypotheses testing, their acceptance regions are not intersected, i.e. , , and the union of all regions of acceptance of

∅ = Γ Γ

j i

j i ≠

, and the union of all regions of acceptance of hypotheses coincides with the observation space, i.e. . These conditions break down at consideration of above-formulated constrained Bayesian task

f

hypotheses testing.

j i ≠

n S i i

R = Γ

=

1

SLIDE 9

Namely, for any value of ( ) in restriction (2) (that is the same, for any value of ( )), in the observation space exist: the regions of unambiguous acceptance of the tested hypotheses, the regions of the suspicion on the validity of several (more

α

1 < < α

λ

+∞ < < λ

n

R

regions of the suspicion on the validity of several (more than one) tested hypotheses (corresponding to sub- regions of the intersection of the regions of acceptance of corresponding hypotheses (4)) and the region

f

impossibility of acceptance of the tested hypotheses (corresponding to the region of the space , which do not belong to any of the regions of acceptance of hypotheses (4)).

n

R

SLIDE 10

Accordingly, for any concrete observation result , on the basis of which the decision is made, in the interval there are such values that for the observation result belongs to only one

f the regions of acceptance of hypotheses (4) and the

corresponding hypothesis is accepted, respectively. At , the observation result appears in a sub-region of intersection of two or several regions of acceptance of

x

) 1 ; (

x

) ( ) (

* *

x x α α ≤

)] ( ); ( [

* *

x x α α α ∈

) (

* x

α α <

x

intersection of two or several regions of acceptance of hypotheses (4), and it is impossible to make a unique decision. In that case, the appropriate hypotheses are suspected on the validity. At , the observation result appears in the region of the space which does not belong to any of the regions of acceptance of hypotheses (4). In this case, it is impossible to make the decision on the basis

f the set observation result

. ) (

* x

α α >

x

n

R

x

SLIDE 11

Thus, the situation is similar to the sequential analysis when, on the basis of present observation results, it could be impossible to make a decision (with the given probabilities of errors) about the validity of one of the hypotheses from the considered set. Therefore, in the considered tasks, if there is the situation of impossibility considered tasks, if there is the situation of impossibility

f making an unambiguous decision for the given

significance level, we shall continue the observations until such an opportunity appears. Thus, on the basis of above-considered constrained Bayesian task, let us determine the method of sequential analysis for multiple testing problems. For clarity let us call this method the sequential analysis method of Bayesian type.

SLIDE 12

4. The method of sequential analysis of

Bayesian type

Let us suppose that there is an opportunity of

btaining repeated observations. Let

be the sampling space of all possible samples of indepen- dent -dimensional observation vectors . Let us split into disjoint sub-regions ,

n m

R

m

n

) ,..., ( 1

n T

x x = x

n

R

1 + S

n

R

Let us split into disjoint sub-regions , ,..., , such that .

n m

R

1 + S

n m

R

1 ,

n m

R

2 ,

n S m

R ,

n S m

R

1 , +

1

1 , + =

=

S i n i m n m

R R

SLIDE 13

Let us determine the following decision rule. If the matrix of observation results belongs to the sub-region , , then hypothesis is accepted, and, if belongs to the sub-region , the decision is not made, and the observations go

n until one of the tested hypotheses is accepted.

) ,..., (

1 m

x x x =

n i m

R ,

S i ,..., 1 =

i

H

) ,..., (

1 m

x x x =

n S m

R

1 , +

SLIDE 14

Regions , are determined in the following way: , is such a part of acceptance region

f hypothesis

which does not belong to any

ther region

; is such a part

f sampling space

which belongs simultaneously to more than one region , or it does not belong

1 ,..., 1 ,

,

+ = S i Rn

i m

S i Rn

i m

,..., 1 ,

,

=

m i

Γ

i

H S i i j

m j

,..., 1 , 1 ,..., 1 , + − = Γ

n S m

R

1 , + n m

R

S i

m i

,..., 1 , = Γ

to any

f

these regions. Here the index ( ) points to the fact that the regions are determined on the basis of sequential observation results. On the basis of the properties described in Item 3, hypotheses acceptance regions , could be determined as follows.

i

m

,... 2 , 1 = m

m

1 ,..., 1 ,

,

+ = S i Rn

i m

SLIDE 15

Let us designate the population of sub-regions of intersections of acceptance regions

f hypotheses

in constrained Bayesian task of hypotheses testing with the regions of acceptance of other hypotheses , , by . By , we designate the population of regions

m i

Γ

i

H ) ,..., 1 ( S i =

j

H

i j S j ≠ = ; ,..., 1

m i

I

S

i m i n m n m

R E

1 = Γ

− =

f space

which do not belong to any of hypotheses acceptance regions. Then the hypotheses acceptance regions in the method of sequential analysis of Bayesian type are determined in the following way: ; . (6) Here regions , are defined on the basis of hypotheses acceptance regions (4) from Item 2.

i

i m m

R E

1 = Γ

− =

n m

R

S i I R

m i m i n i m

,..., 1 , /

,

= Γ =

( )

n

m S i m i n S m

E I R

1 1 , = + =

,

m i

Γ

,

m i

I ,

n m

E

S i ,..., 1 =

SLIDE 16

5. Investigation of the method of sequential

analysis of Bayesian type

For clarity, by and , we shall designate the probabilities of errors of the first and the second kinds for sequential method of Bayesian type, and, by and , the same quantities for constrained Bayesian

1

α

1

β

α

β

and , the same quantities for constrained Bayesian task.

β

SLIDE 17

Theorem 5.1. If the probability distribution , , is such that an increase in the sample size entails a decrease in the entropy concerning

) | (

i

H p x

S i ,..., 1 = entails a decrease in the entropy concerning distribution parameters about which the hypotheses are formulated, then infinitely increasing number of repeated observations, i.e. in the sequential analysis method

f

Bayesian type, entails infinite decreasing probabilities of errors of the first and the second kinds, i.e. and .

m

θ

∞ → m

1 →

α

1 →

β

SLIDE 18

Lemma 5.1. In the conditions of Theorem 5.1, at increasing divergence between tested hypotheses and , , Lagrange coefficient in solution (4) decreases, and, in the limit, at , takes place for the given .

) , (

j i H

H J

i

H

j

H j i S j i ≠ = ; ,..., 1 ,

λ

{ }

∞ → ) , ( min

, j i j i

H H J

→ λ

α

SLIDE 19

Lemma 5.2. In the conditions of Theorem 5.1, at infinitely decreasing divergence between tested hypotheses and , , i.e. at Lagrange coefficient in solution (4) tends to a certain value from the interval: depending on the value of .

) , (

j i H

H J

i

H

j

H

j i S j i ≠ = ; ,..., 1 ,

{ }

) , ( max

,

→

j i j i

H H J

λ

{ } { }

     

∑ ∑

≠ = ≠ = S j i i j i j S j i i j i j

H p H p H p H p

, 1 , 1

) ( / ) ( max , ) ( / ) ( min

α

SLIDE 20

Hereinafter we shall suppose that probability distributions , , are such that increasing information causes a decrease in the entropy relative to parameter which the hypotheses are formulated about.

) | (

i

H p x

S i ,..., 1 =

θ

parameter which the hypotheses are formulated about. Theorem 5.2. For any given sample size and as small errors of the first and the second kinds and as one likes, there always exists such a positive value that, if the divergence between tested hypotheses is more than that value, i.e. , and hold true.

θ

m

α′

β′

*

J

{ }

* ,

) , ( min J H H J

j i j i

>

α α ′ < ) (

1 J

β β ′ < ) (

1 J

SLIDE 21

Theorem 5.3. For any value of in constrained Bayesian task there always exists such an integer that if the number of repeated observations , in the method of sequential analysis of Bayesian type, is more than this value, i.e. , there will be accepted one of the tested hypotheses with the probability equal to unity.

α

*

m

*

m m >

SLIDE 22

6. Experimental research

Suppose that and that it is desired to test versus . The computation results

f

the sequentially processed sample generated by with 17

bservations were used (Table 1, where the arithmetic

) 1 , 1 ( N

) 1 , ( ~ θ ξ N

1 : − = θ H 1 : = θ

A

H

bservations were used (Table 1, where the arithmetic

mean of the observations is denoted by ). From here it is seen that the Wald and Berger’s tests yield absolutely the same results, though the reported error probabilities in the Berger’s and the Wald’s test differ to each other because Berger computed the error probabilities for the given value of the statistics.

m k

x x ,...,

m k

x ,

SLIDE 23

Out of 17 observations, correct decisions were taken 7 times on the basis of 3, 3, 5, 1, 1, 3 and 1 observations in times on the basis of 3, 3, 5, 1, 1, 3 and 1 observations in both tests (the Wald and Berger’s). The average value of

bservations for making the decision is equal to 2.43. In

the sequential test of Bayesian type for the same sample correct decisions were taken 10 times on the basis of 1, 2, 2, 1, 3, 2, 1, 1, 3 and 1 observations. The average value of

bservations for making the decision is equal to 1.7.

SLIDE 24

SLIDE 25

The reported error probabilities in the sequential test of Bayesian type and the Wald’s test decrease depending on the number of observations used for making the decision (see Table 2). By the Type II error probability it strongly surpasses the Wald’s test. While these characteristics for surpasses the Wald’s test. While these characteristics for the Berger’s test have no monotonous dependence on the number of observations (for the reason mentioned above). They basically are defined by the value of the likelihood ratio. For example, the value of the Type I error probability for 5 observations ( ) surpasses the analogous value for 3 observations and both

f them surpass the same value for 1 observation

.

11 7,..., x

x

16 15 14

, , x x x

17

x

SLIDE 26