557: MATHEMATICAL STATISTICS II RESULTS FROM CLASSICAL HYPOTHESIS TESTING

Σχετικά έγγραφα
557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING

ST5224: Advanced Statistical Theory II

Other Test Constructions: Likelihood Ratio & Bayes Tests

Statistical Inference I Locally most powerful tests

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

Theorem 8 Let φ be the most powerful size α test of H

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

STAT200C: Hypothesis Testing

2 Composition. Invertible Mappings

EE512: Error Control Coding

Uniform Convergence of Fourier Series Michael Taylor

Ordinal Arithmetic: Addition, Multiplication, Exponentiation and Limit

Every set of first-order formulas is equivalent to an independent set

Matrices and Determinants

Homework 3 Solutions

Areas and Lengths in Polar Coordinates

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

Example Sheet 3 Solutions

Solutions to Exercise Sheet 5

Areas and Lengths in Polar Coordinates

Exercise 2: The form of the generalized likelihood ratio

Section 8.3 Trigonometric Equations

Finite Field Problems: Solutions

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

A Note on Intuitionistic Fuzzy. Equivalence Relation

C.S. 430 Assignment 6, Sample Solutions

Second Order RLC Filters

Fractional Colorings and Zykov Products of graphs

Jesse Maassen and Mark Lundstrom Purdue University November 25, 2013

Inverse trigonometric functions & General Solution of Trigonometric Equations

Μηχανική Μάθηση Hypothesis Testing

An Introduction to Signal Detection and Estimation - Second Edition Chapter II: Selected Solutions

Fourier Series. MATH 211, Calculus II. J. Robert Buchanan. Spring Department of Mathematics

Solution Series 9. i=1 x i and i=1 x i.

Problem Set 3: Solutions

Concrete Mathematics Exercises from 30 September 2016

Math221: HW# 1 solutions

Second Order Partial Differential Equations

The Simply Typed Lambda Calculus

6.3 Forecasting ARMA processes

Reminders: linear functions

Srednicki Chapter 55

Lecture 21: Properties and robustness of LSE

Tridiagonal matrices. Gérard MEURANT. October, 2008

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

Nowhere-zero flows Let be a digraph, Abelian group. A Γ-circulation in is a mapping : such that, where, and : tail in X, head in

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

Coefficient Inequalities for a New Subclass of K-uniformly Convex Functions

Απόκριση σε Μοναδιαία Ωστική Δύναμη (Unit Impulse) Απόκριση σε Δυνάμεις Αυθαίρετα Μεταβαλλόμενες με το Χρόνο. Απόστολος Σ.

Approximation of distance between locations on earth given by latitude and longitude

k A = [k, k]( )[a 1, a 2 ] = [ka 1,ka 2 ] 4For the division of two intervals of confidence in R +

forms This gives Remark 1. How to remember the above formulas: Substituting these into the equation we obtain with

6. MAXIMUM LIKELIHOOD ESTIMATION

The challenges of non-stable predicates

SCITECH Volume 13, Issue 2 RESEARCH ORGANISATION Published online: March 29, 2018

Lecture 34 Bootstrap confidence intervals

Partial Differential Equations in Biology The boundary element method. March 26, 2013

F19MC2 Solutions 9 Complex Analysis

4.6 Autoregressive Moving Average Model ARMA(1,1)

D Alembert s Solution to the Wave Equation

5. Choice under Uncertainty

Parametrized Surfaces

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

ECE598: Information-theoretic methods in high-dimensional statistics Spring 2016

Exercises 10. Find a fundamental matrix of the given system of equations. Also find the fundamental matrix Φ(t) satisfying Φ(0) = I. 1.

Last Lecture. Biostatistics Statistical Inference Lecture 19 Likelihood Ratio Test. Example of Hypothesis Testing.

Math 446 Homework 3 Solutions. (1). (i): Reverse triangle inequality for metrics: Let (X, d) be a metric space and let x, y, z X.

ORDINAL ARITHMETIC JULIAN J. SCHLÖDER

Probability and Random Processes (Part II)

1 String with massive end-points

Practice Exam 2. Conceptual Questions. 1. State a Basic identity and then verify it. (a) Identity: Solution: One identity is csc(θ) = 1

Main source: "Discrete-time systems and computer control" by Α. ΣΚΟΔΡΑΣ ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ ΔΙΑΛΕΞΗ 4 ΔΙΑΦΑΝΕΙΑ 1

PARTIAL NOTES for 6.1 Trigonometric Identities

Lecture 2. Soundness and completeness of propositional logic

2. Let H 1 and H 2 be Hilbert spaces and let T : H 1 H 2 be a bounded linear operator. Prove that [T (H 1 )] = N (T ). (6p)

derivation of the Laplacian from rectangular to spherical coordinates

Homework 8 Model Solution Section

Some new generalized topologies via hereditary classes. Key Words:hereditary generalized topological space, A κ(h,µ)-sets, κµ -topology.

Section 7.6 Double and Half Angle Formulas

MINIMAL CLOSED SETS AND MAXIMAL CLOSED SETS

Notes on the Open Economy

Aquinas College. Edexcel Mathematical formulae and statistics tables DO NOT WRITE ON THIS BOOKLET

SOME PROPERTIES OF FUZZY REAL NUMBERS

Quadratic Expressions

1. Introduction and Preliminaries.

b. Use the parametrization from (a) to compute the area of S a as S a ds. Be sure to substitute for ds!

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

CRASH COURSE IN PRECALCULUS

6.1. Dirac Equation. Hamiltonian. Dirac Eq.

ω ω ω ω ω ω+2 ω ω+2 + ω ω ω ω+2 + ω ω+1 ω ω+2 2 ω ω ω ω ω ω ω ω+1 ω ω2 ω ω2 + ω ω ω2 + ω ω ω ω2 + ω ω+1 ω ω2 + ω ω+1 + ω ω ω ω2 + ω

Math 6 SL Probability Distributions Practice Test Mark Scheme

A Two-Sided Laplace Inversion Algorithm with Computable Error Bounds and Its Applications in Financial Engineering

HOMEWORK#1. t E(x) = 1 λ = (b) Find the median lifetime of a randomly selected light bulb. Answer:

F A S C I C U L I M A T H E M A T I C I

( y) Partial Differential Equations

2. THEORY OF EQUATIONS. PREVIOUS EAMCET Bits.

Online Appendix to Measuring the Bullwhip Effect: Discrepancy and Alignment between Information and Material Flows

Sequent Calculi for the Modal µ-calculus over S5. Luca Alberucci, University of Berne. Logic Colloquium Berne, July 4th 2008

CE 530 Molecular Simulation

Transcript:

Most Powerful Tests 557: MATHEMATICAL STATISTICS II RESULTS FROM CLASSICAL HYPOTHESIS TESTING To construct and assess the quality of a statistical test, we consider the power function β(θ). Consider a family tests C for testing H 0 and H 1 with corresponding subsets Θ 0 and Θ 1. The uniformly most powerful (UMP) test T is the test whose power function β(θ) dominates the power function, β (θ), of any other test T C at all θ Θ 1, β(θ) β (θ) θ Θ 1. A test with power function β(θ) is unbiased if for all θ 0 Θ 0, θ 1 Θ 1, β(θ 1 ) β(θ 0 ). A simple hypothesis is one which species the distribution of the data completely. Consider a parametric model f X (x; θ) with parameter space Θ = {θ 0, θ 1 }, and the test of Then both H 0 and H 1 are simple hypotheses. H 1 : θ = θ 1 Theorem (The Neyman-Pearson Lemma) Consider a parametric model f X (x; θ) with parameter space Θ = {θ 0, θ 1 }. A test of H 1 : θ = θ 1 is required. Consider a test T with rejection region R that satises f X (x; θ 1 ) > cf X (x; θ 0 ) = x R f X (x; θ 1 ) < cf X (x; θ 0 ) = x R for some k 0, and Pr[X R θ = θ 0 ] = α. Then T is UMP in the class, C α, of tests at level α. Further, if such a test exists with k > 0, then all tests at level α also have size α (that is, α is the least upper bound of the power function β(θ)), and have rejection region identical to that of T, except perhaps if x A and Pr[X A; θ 0 ] = Pr[X A; θ 1 ] = 0. Proof As Pr[X R; θ 0 ] = α, the test T has size and level α. Consider the test function ϕ R (x) for this test, and ϕ R (x) be the test function for any other α level test, T. Denote by β(θ) and β (θ) be the power functions for these two tests. Now g(x) = (ϕ R (x) ϕ R (x))(f X (x; θ 1 ) cf X (x; θ 0 )) 0 as x R R = ϕ R (x) = ϕ R (x) = 1 g(x) = 0 x R R = ϕ R (x) = 1, ϕ R (x) = 0, f X (x; θ 1 ) > cf X (x; θ 0 ) g(x) > 0 x R R = ϕ R (x) = 0, ϕ R (x) = 1, f X (x; θ 1 ) < cf X (x; θ 0 ) g(x) > 0 x R R = ϕ R (x) = ϕ R (x) = 0 g(x) = 0. 1

Thus (ϕ R (x) ϕ R (x))(f X (x; θ 1 ) cf X (x; θ 0 )) dx 0 X but this inequality can be written in terms of the power functions as (β(θ 1 ) β (θ 1 )) k(β(θ 0 ) β (θ 0 )) 0 (1) As β(θ) and β (θ) are bounded above by α, and β(θ 0 ) = α as T is a size α, we have that β(θ 0 ) β (θ 0 ) = α β (θ 0 ) 0 β(θ 1 ) β (θ 1 ) 0 Thus β(θ 1 ) β (θ 1 ), and hence T function β is arbitrarily chosen. is UMP, as θ 1 is the only point in Θ 1, and the test with power Now consider any UMP test T C α. By the result above, T this case, if c > 0, we have from equation (1) that is UMP at level α, so β(θ 1 ) = β (θ 1 ). In β(θ 0 ) β (θ 0 ) = α β (θ 0 ) 0. But, by assumption, T is a level α test, so we also have α β (θ 0 ) 0, and hence β (θ 0 ) = α, that is, T is also a size α test. Therefore (ϕ R (x) ϕ R (x))(f X (x; θ 1 ) cf X (x; θ 0 )) dx = 0 (2) X where the integrand in equation (2) is a non-negative function. Let A be the collection of sets of probability (that is, density) zero under both f X (x; θ 0 ) and f X (x; θ 1 ), then (ϕ R (x) ϕ R (x))(f X (x; θ 1 ) cf X (x; θ 0 )) dx = 0 A A A irrespective of the nature of R, so the functions ϕ R (x) and ϕ R (x) may not be equal for x in such a set A. Apart from that specic case, the integral in equation (2) can only be zero if at least one of the two factors is identically zero for all x. The second factor cannot be identically zero for all x, as the densities must integrate to one. Thus, for all x X \ A, ϕ R (x) = ϕ R (x), and hence R satises the same conditions as R. Notes: To nd the value c that appears in the Theorem, we need to compute Pr [X R; θ 0 ] for a xed level/size α. It is possible that, for given alternative hypotheses, no UMP test exists. Also, for discrete data, it may not be possible to solve the equation Pr [X R; θ 0 ] = α for every value of α, and hence only specic values of α may be attained. The test can be reformulated in terms of the statistic λ(x) and region R λ {t R + : t > c}: λ(x) = f X(x; θ 1 ) f X (x; θ 0 ) x R λ(x) R λ If T (X) is a sufcient statistic for θ, then by the factorization theorem so that f X (x; θ 1 ) f X (x; θ 0 ) = g(t (x); θ 1)h(x) g(t (x); θ 0 )h(x) = g(t (x); θ 1) g(t (x); θ 0 ) λ(x) R λ T (x) R T say. Thus any test based on T (x) with critical region R T is a UMP α level test, and α = Pr[T (X) R T ; θ 0 ] 2

Composite Null Hypotheses Often the null and alternative hypotheses do not specify the distribution of the data completely. For example, the specication H 1 : θ θ 0 could be of interest. If, in general, a UMP test of size α is required, then its power must equal the power of the most powerful test of for all θ 1 Θ 1. H 1 : θ = θ 1 Models with a Monotone Likelihood Ratio For one class of models, nding UMP tests for composite hypotheses is possible in general. A parametric family F of probability models indexed by parameter θ Θ has a monotone likelihood ratio if for θ 2 > θ 1, and for x in the union of the supports of the two densities f X (x; θ 1 ) and f X (x; θ 2 ), is a monotone function of x. λ(x) = f X(x; θ 2 ) f X (x; θ 1 ) Theorem (Karlin-Rubin Theorem) Suppose that a test of the hypotheses H 0 : θ θ 0 H 1 : θ > θ 0 is required. Suppose that T (X) is a sufcient statistic for θ, and that f T for θ Θ has a monotone (non-decreasing) likelihood ratio, that is for θ 2 θ 1 and t 2 t 1 f T (t 2 ; θ 1 ) f T (t 1 ; θ 2 ) f T (t 1 ; θ 1 ). Then for any t 0, the test T with critical region R T dened by T (x) > t 0 = T (x) R T T (x) t 0 = T (x) R T is a UMP α level test, where α = Pr[T > t 0 ; θ 0 ]. Proof Let β(θ) be the power function of T. Now, for t 2 t 1, f T (t 2 ; θ 1 ) f T (t 1 ; θ 2 ) f T (t 1 ; θ 1 ) f T (t 1 ; θ 1 ) f T (t 1 ; θ 2 )f T (t 2 ; θ 1 ) (3) Integrating both sides with respect to t 1 on (, t 2 ), we obtain F T (t 2 ; θ 1 ) F T (t 2 ; θ 2 )f T (t 2 ; θ 1 ) 3 f T (t 2 ; θ 1 ) F T (t 2 ; θ 2 ) F T (t 2 ; θ 1 ).

Alternatively, integrating both sides of equation (3) with respect to t 2 on (t 1, ), we similarly obtain f T (t 1 ; θ 2 ) f T (t 1 ; θ 1 ) 1 F T (t 1 ; θ 2 ) 1 F T (t 1 ; θ 1 ) But setting t 1 = t 2 = t in these two inequalities yields which, on rearrangement yields 1 F T (t; θ 2 ) 1 F T (t; θ 1 ) F T (t; θ 2 ) F T (t; θ 1 ) 1 F T (t; θ 2 ) F T (t; θ 2 ) 1 F T (t; θ 1 ) F T (t; θ 1 ) F T (t; θ 2 ) F T (t; θ 1 ) (4) as F T (t; θ) is non-decreasing in t, and the function g(x) = (1 x)/x is non-increasing for 0 < x < 1. Finally, β(θ 2 ) β(θ 1 ) = Pr[T > t 0 θ 2 ] Pr[T > t 0 θ 1 ] = (1 F T (t; θ 2 )) (1 F T (t; θ 1 )) = F T (t; θ 1 ) F T (t; θ 2 ) 0 so β(θ) is non-decreasing in θ. Hence sup β(θ) = β(θ 0 ) = Pr[T > t 0 θ 0 ] = α θ θ 0 so T is an α level test. Now, let θ > θ 0, and consider the simple hypotheses Let k be dened by H0 : θ = θ 0 H1 : θ = θ. k = inf t T 0 f T (t; θ ) f T (t; θ 0 ) where T 0 = {t : t > t 0, and f T (t; θ ) > 0 or f T (t; θ 0 ) > 0}. Then T > t 0 f T (t; θ ) f T (t; θ 0 ) > k so that, by the Neyman-Pearson Lemma, T is UMP for testing H0 versus H 1 ; for any other test T of H0 at level α with power function β that satises β (θ 0 ) α, we have that β(θ ) β (θ ). But for any α level test T of H 0, we have β (θ 0 ) α. Thus taking T T, we can conclude that β(θ ) β (θ ). This inequality holds for all θ Θ 1, so T must be UMP at level α. Note: The theorem also covers the case where we are interested in hypotheses H 0 : θ θ 0 H 1 : θ < θ 0 and we have a non-increasing monotone likelihood ratio, that is for θ 2 θ 1 and t 2 t 1 f T (t 2 ; θ 1 ) f T (t 1 ; θ 2 ) f T (t 1 ; θ 1 ). 4

THE LIKELIHOOD RATIO TEST The Likelihood Ratio Test (LRT) statistic for testing H 0 against H 1 is based on the statistic H 0 : θ Θ 0 H 1 : θ Θ 1 λ X (x) = 0 sup f X (x; θ) say, where Θ Θ 0 Θ 1 ; H 0 is rejected if λ X (x) is small enough, that is, λ X (x) k for some k to be dened. Note that Θ is not necessarily the entire parameter space, just the union of Θ 0 and Θ 1. Theorem If T (X) is a sufcient statistic for θ, then λ X (x) = λ T (T (x)) = sup f T (T (x); θ) 0 sup f T (T (x); θ) x X Proof As T (X) is sufcient, for any θ 0, θ 1, f X (x ; θ 0 ) f X (x ; θ 1 ) = g(t (x); θ 0)h(x) g(t (x); θ 1 )h(x) = g(t (x); θ 0) g(t (x); θ 1 ) = f T (T (x); θ 0 ) f T (T (x); θ 1 ) by the Neyman factorization theorem, where the last equality follows as the normalizing constants in numerator and denominator are identical. Hence, at the suprema, the LRT statistics are equal. Union and Intersection Tests The construction of union and intersection tests is necessary to formulate the assessment of size and power for tests involving composite hypotheses. Suppose rst that we require a test T for the null hypothesis expressed as H 0 : θ Θ 0 where Θ 0γ, γ G are a collection of subsets of Θ. Suppose that Tγ is a test for the hypotheses H 0γ : θ Θ 0γ H 1γ : θ Θ 0γ with test statistic T γ (X) and critical region R γ. Then the rejection region for T R G Θ 0γ R γ = T rejects H 0 if x {x : T γ (x) R γ } that is, if any one of the Tγ rejects H 0γ. This test is termed a Union-Intersection Test (UIT). Suppose now that we require a test T for the null hypothesis expressed as H 0 : θ Θ 0 Θ 0γ is 5

Then, by the same logic as above, the rejection region for T is R G R γ = T rejects H 0 if x {x : T γ (x) R γ } that is, if all of the Tγ reject H 0γ. This test is termed an Intersection-Union Test (IUT). Note that if α γ is the size of the test of H 0γ, then the IUT is a level α test, where as, for each γ and for any θ Θ 0, α = sup α γ α α γ = Pr[X R γ ; θ] Pr[X R; θ] Theorem Consider testing H 0 : θ Θ 0 Θ 0γ using the global likelihood ratio statistic H 1 : θ Θ 0 λ(x) = 0 sup f X (x; θ) equipped with the usual critical region R {x : λ(x) < c}, and the collection of likelihood ratio statistics λ γ (x) 0γ λ γ (x) = Dene statistic T (x) = inf λ γ(x), and consider the critical region Then (a) T (x) λ(x) for all x. R G {x : λ γ (x) < c, some γ G} {x : T (x) < c}, (b) If β T and β λ are the power functions for the tests based on T (X) and λ(x) respectively, then β T (θ) β λ (θ) for all θ Θ (c) If the test based on λ(x) is an α level test, then the test based on T (X) is also an α level test. Proof For (a), as Θ 0 Θ 0γ, we have and thus for (b), for any θ, Hence which proves (c). λ γ (x) λ(x) for each γ T (x) = inf λ γ(x) λ(x) β T (θ) = Pr[T (X) < c; θ] Pr[λ(X) < c; θ] = β λ (θ). sup β T (θ) sup β λ (θ) α 0 0 6

P-VALUES Consider a test of hypothesis H 0 dened by region Θ 0 of the parameter space. A p-value, p(x), is a test statistic such that 0 p(x) 1 for each x. A p-value is valid if, for every θ Θ 0 and 0 α 1 Pr[p(X) α ; θ] α. That is, a valid p-value is a test statistic that produces a test at level α of the form p(x) α = x R p(x) > α = x R The most common construction of a valid p-value is given by the following theorem. Theorem Suppose that T (X) is a test statistic constructed so that a large value of T (X) supports H 1. Then the statistic p(x) given for each x X by say, is a valid p-value. p(x) = sup θ Θ 0 Pr[T (X) T (x); θ] = sup θ Θ 0 p θ (X) (5) Proof For θ Θ 0, we have p θ (x) = Pr[T (X) T (x); θ] = Pr[ T (X) T (x); θ] = F θ ( T (x)) F S (s) say, dening F S F θ as the cdf of S = T (X); clearly 0 p(x) 1 as 0 p θ (x) 1 for all θ. This recalls a result from distribution theory; if X F X, the U = F X (X) Uniform(0, 1). Suppressing the dependence on θ for convenience, dene random variable Y by Y = F θ ( T (X)) F S (S) (= p θ (X)) and let A y {s : F S (s) y}. If A y is a half-closed interval (, s y ], then F Y (y) = Pr[Y y] = Pr[F S (S) y] = Pr[S A y ] = F S (s y ) y by denition of A y, as s y A y. If A y is a half-open interval (, s y ) F Y (y) = Pr[Y y] = Pr[F S (S) y] = Pr[S A y ] = by continuity of probability. Putting the components together, for 0 α 1, Pr[p θ (X) α; θ] Pr[Y α] α But by the denition in equation (5), p(x) p θ (x), so and the result follows. Pr[p(X) α; θ] Pr[p θ (X) α; θ] α lim s s y F S (s) y 7