Theorem 8 Let φ be the most powerful size α test of H

Σχετικά έγγραφα
Other Test Constructions: Likelihood Ratio & Bayes Tests

ST5224: Advanced Statistical Theory II

Statistical Inference I Locally most powerful tests

STAT200C: Hypothesis Testing

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

2 Composition. Invertible Mappings

Exercise 2: The form of the generalized likelihood ratio

557: MATHEMATICAL STATISTICS II RESULTS FROM CLASSICAL HYPOTHESIS TESTING

Solution Series 9. i=1 x i and i=1 x i.

6. MAXIMUM LIKELIHOOD ESTIMATION

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING

Ordinal Arithmetic: Addition, Multiplication, Exponentiation and Limit

4.6 Autoregressive Moving Average Model ARMA(1,1)

Example Sheet 3 Solutions

Every set of first-order formulas is equivalent to an independent set

Last Lecture. Biostatistics Statistical Inference Lecture 19 Likelihood Ratio Test. Example of Hypothesis Testing.

C.S. 430 Assignment 6, Sample Solutions

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

Problem Set 3: Solutions

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

Μηχανική Μάθηση Hypothesis Testing

EE512: Error Control Coding

Lecture 34 Bootstrap confidence intervals

Inverse trigonometric functions & General Solution of Trigonometric Equations

Section 8.3 Trigonometric Equations

Finite Field Problems: Solutions

An Introduction to Signal Detection and Estimation - Second Edition Chapter II: Selected Solutions

Solutions to Exercise Sheet 5

Concrete Mathematics Exercises from 30 September 2016

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

Lecture 2. Soundness and completeness of propositional logic

Fourier Series. MATH 211, Calculus II. J. Robert Buchanan. Spring Department of Mathematics

The challenges of non-stable predicates

Homework for 1/27 Due 2/5

The Simply Typed Lambda Calculus

Lecture 12: Pseudo likelihood approach

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

Nowhere-zero flows Let be a digraph, Abelian group. A Γ-circulation in is a mapping : such that, where, and : tail in X, head in

5.4 The Poisson Distribution.

Fractional Colorings and Zykov Products of graphs

Uniform Convergence of Fourier Series Michael Taylor

Tridiagonal matrices. Gérard MEURANT. October, 2008

ORDINAL ARITHMETIC JULIAN J. SCHLÖDER

Matrices and Determinants

6.3 Forecasting ARMA processes

Areas and Lengths in Polar Coordinates

Section 9.2 Polar Equations and Graphs

( y) Partial Differential Equations

Math221: HW# 1 solutions

Areas and Lengths in Polar Coordinates

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

A Note on Intuitionistic Fuzzy. Equivalence Relation

Homework 3 Solutions

More Notes on Testing. Large Sample Properties of the Likelihood Ratio Statistic. Let X i be iid with density f(x, θ). We are interested in testing

Aquinas College. Edexcel Mathematical formulae and statistics tables DO NOT WRITE ON THIS BOOKLET

Lecture 21: Properties and robustness of LSE

Math 446 Homework 3 Solutions. (1). (i): Reverse triangle inequality for metrics: Let (X, d) be a metric space and let x, y, z X.

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

Math 6 SL Probability Distributions Practice Test Mark Scheme

k A = [k, k]( )[a 1, a 2 ] = [ka 1,ka 2 ] 4For the division of two intervals of confidence in R +

b. Use the parametrization from (a) to compute the area of S a as S a ds. Be sure to substitute for ds!

F19MC2 Solutions 9 Complex Analysis

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

ΕΙΣΑΓΩΓΗ ΣΤΗ ΣΤΑΤΙΣΤΙΚΗ ΑΝΑΛΥΣΗ

Section 7.6 Double and Half Angle Formulas

2. Let H 1 and H 2 be Hilbert spaces and let T : H 1 H 2 be a bounded linear operator. Prove that [T (H 1 )] = N (T ). (6p)

6.1. Dirac Equation. Hamiltonian. Dirac Eq.

Reminders: linear functions

Jesse Maassen and Mark Lundstrom Purdue University November 25, 2013

Estimation for ARMA Processes with Stable Noise. Matt Calder & Richard A. Davis Colorado State University

Bounding Nonsplitting Enumeration Degrees

Practice Exam 2. Conceptual Questions. 1. State a Basic identity and then verify it. (a) Identity: Solution: One identity is csc(θ) = 1

Lecture 7: Overdispersion in Poisson regression

SCITECH Volume 13, Issue 2 RESEARCH ORGANISATION Published online: March 29, 2018

An Inventory of Continuous Distributions

Approximation of distance between locations on earth given by latitude and longitude

5. Choice under Uncertainty

Congruence Classes of Invertible Matrices of Order 3 over F 2

The Probabilistic Method - Probabilistic Techniques. Lecture 7: The Janson Inequality

Statistics 104: Quantitative Methods for Economics Formula and Theorem Review

P AND P. P : actual probability. P : risk neutral probability. Realtionship: mutual absolute continuity P P. For example:

Depth versus Rigidity in the Design of International Trade Agreements. Leslie Johns

Lecture 2: Dirac notation and a review of linear algebra Read Sakurai chapter 1, Baym chatper 3

D Alembert s Solution to the Wave Equation

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

A Conception of Inductive Logic: Example

Homework 8 Model Solution Section

Chapter 3: Ordinal Numbers

Local Approximation with Kernels

Srednicki Chapter 55

APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 651 APPENDIX B. BIBLIOGRAPHY 677 APPENDIX C. ANSWERS TO SELECTED EXERCISES 679

derivation of the Laplacian from rectangular to spherical coordinates

ECE598: Information-theoretic methods in high-dimensional statistics Spring 2016

Exercises 10. Find a fundamental matrix of the given system of equations. Also find the fundamental matrix Φ(t) satisfying Φ(0) = I. 1.

Lecture 26: Circular domains

ENGR 691/692 Section 66 (Fall 06): Machine Learning Assigned: August 30 Homework 1: Bayesian Decision Theory (solutions) Due: September 13

DESIGN OF MACHINERY SOLUTION MANUAL h in h 4 0.

MATH423 String Theory Solutions 4. = 0 τ = f(s). (1) dτ ds = dxµ dτ f (s) (2) dτ 2 [f (s)] 2 + dxµ. dτ f (s) (3)

ΕΛΛΗΝΙΚΗ ΔΗΜΟΚΡΑΤΙΑ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΡΗΤΗΣ. Ψηφιακή Οικονομία. Διάλεξη 10η: Basics of Game Theory part 2 Mαρίνα Μπιτσάκη Τμήμα Επιστήμης Υπολογιστών

Transcript:

Testing composite hypotheses Θ = Θ 0 Θ c 0 H 0 : θ Θ 0 H 1 : θ Θ c 0 Definition 16 A test φ is a uniformly most powerful (UMP) level α test for H 0 vs. H 1 if φ has level α and for any other level α test φ E θ [φ(x)] E θ [φ (X)] θ Θ c 0. Theorem 8 Let φ be the most powerful size α test of H 0 : θ = θ 0 vs. H 1 : θ = θ 1, where θ 0 Θ 0 and θ 1 Θ c 0. (φ is assumed to be of the form in a) of Neyman-Pearson lemma.) Suppose that 1) φ does not depend on θ 1, and 2) E θ [φ(x)] α θ Θ 0. Then φ is the UMP level α test of H 0 vs. H 1. 153

Proof 2) that φ is level α. Let φ be any other level α test of H 0 vs. H 1. Since φ does not depend on θ 1, it follows from the Neyman- Pearson lemma that E θ [φ(x)] E θ [φ (X)] θ Θ c 0 and any test φ such that E θ0 [φ (X)] α. But φ is of this form since it is level α for testing H 0 vs. H 1. The result follows immediately. Note In most cases where Theorem 8 is applied, θ 0 is chosen to be on the boundary between Θ 0 and Θ c 0. Example 33 Let X 1,..., X n be i.i.d. N(θ, 1) observations. Want to test H 0 : θ θ 0 vs. H 1 : θ > θ 0. Let θ 1 > θ 0. In Example 30 we found the most powerful size α test of H 0 : θ = θ 0 vs. H 1 : θ = θ 1 is φ(x) = { 1, x θ0 + z α / n 0, x < θ 0 + z α / n. 154

This test does not depend on θ 1. Does it have level α when used to test H 0 vs. H 1? Let θ < θ 0. P θ ( X θ 0 + z α n ) = P θ ( X θ 1/ n ) n(θ 0 θ) + z α = P (Z n(θ 0 θ) + z α ) < α since θ 0 > θ. This explains why θ 0 was used in H 0. In many cases the most powerful level α test of H 0 : θ = θ 0 vs. H 1 : θ = θ 1 (θ 1 > θ 0 ) will depend on θ 1. In such cases a UMP test for H 0 : θ θ 0 vs. H 1 : θ > θ 0 does not exist. However, there is a large class of distributions for which UMP tests do exist. 155

Let X 1,..., X n be i.i.d. observations with pdf or pmf p(x θ), θ Θ R. Define f(x θ) = p(x 1 θ) p(x n θ). Definition 17 The family of distributions {f(x θ) : θ Θ} is said to have the monotone likelihood ratio property in the statistic T (X) if for θ 1 < θ 2 the ratio f(x θ 2 )/f(x θ 1 ) is a nondecreasing function of T (x) on {x : f(x θ 1 ) > 0 or f(x θ 2 ) > 0}. Example 34 Let X 1,..., X n be i.i.d. U(0, θ) observations, where θ > 0. The joint pdf is f(x θ) = θ n I (0,x(n) ) (x (1) )I (0,θ) (x (n) ). 156

Let 0 < θ 1 < θ 2, and x be such that 0 < x (n) < θ 2. (Otherwise both densities are 0.) f(x θ 2 ) f(x θ 1 ) = { (θ1 /θ 2 ) n, 0 < x (n) < θ 1, θ 1 x (n) < θ 2. Hence, the family of U(0, θ) densities has MLR in T (X) = X (n). Theorem 9 family The one-parameter exponential f(x θ) = exp [c(θ)t (x) + d(θ) + S(x)] I A (x) has MLR in T when c is nondecreasing in θ. Proof Let θ 1 < θ 2 and x A. (Why do we assume x A?) 157

f(x θ 2 ) f(x θ 1 ) = exp {T (x)[c(θ 2) c(θ 1 )]+ d(θ 2 ) d(θ 1 )}. Since c(θ 1 ) c(θ 2 ), this is a nondecreasing function of T (x). Normal, Poisson, binomial, and exponential distributions all have the MLR property. Theorem 10 Let X 1,..., X n be i.i.d. observations with joint pdf f(x θ), θ Θ, and let the family {f(x θ) : θ Θ} have the MLR property in T. Let θ 0 Θ. Then for testing the composite hypotheses H 0 : θ θ 0 vs. H 1 : θ > θ 0 at level α, there exists a test which is UMP among all level α tests. The UMP test is φ(x) = 1, if T (x) > k γ, if T (x) = k 0, if T (x) < k 158

where k and γ are determined by α = E θ0 [φ(x)] = P θ0 [T (X) > k] + γp θ0 [T (X) = k]. Proof Roussas, p. 279; Lehmann, pp. 68-69; Casella and Berger, p. 391. Note The UMP test for H 0 : θ θ 0 vs. H 1 : θ < θ 0 in the situation of Theorem 10 is given by reversing the inequalities in the definition of φ. Example 35 Let X 1,..., X n be i.i.d. U(0, θ). From Example 34 {f(x θ) : θ > 0} has MLR in X (n). By Theorem 10 the UMP test of is H 0 : θ θ 0 vs. H 1 : θ > θ 0 φ(x) = { 1, if x(n) c 0, if x (n) < c, where E θ0 [φ(x)] = α = P θ0 (X (n) c). 159

P θ0 (X (n) c) = 1 P θ0 (X 1 < c,..., X n < c) = 1 [P (X 1 < c)] n = 1 ( c θ 0 ) n The last quantity is α iff c = θ 0 (1 α) 1/n. For θ > θ 0 (1 α) 1/n, the power curve is β(θ) = P θ (X (n) c) = 1 [ θ0 θ (1 α)1/n ] n = 1 ( θ0 θ ) n (1 α). For θ θ 0 (1 α) 1/n, β(θ) = 0. Note: As θ, β(θ) 1. 160

Example 36 Let X 1,..., X n be i.i.d. N(0, θ). Θ = {θ : θ > 0}. Find UMP test of H 0 : θ θ 0 vs. H 1 : θ < θ 0. Defining η = 1/θ and η 0 = 1/θ 0, these hypotheses are equivalent to H 0 : η η 0 vs. H 1 : η > η 0. Check for the MLR property. f(x η) = exp η 2 n i=1 x 2 i n 2 log(2πη 1 ) Since η/2 is an increasing function of η, Theorems 9 and 10 tell us that the UMP level α test of H 0 vs. H 1 has the form φ(x) = { 1, if ni=1 x 2 i c 0, if n i=1 x 2 i < c. 161

The test on the last page can also be written { 1, if ni=1 x 2 φ(x) = i c 0, if n i=1 x 2 i > c. The constant c is such that P η0 ( n i=1 X 2 i c ) = α. When η = η 0, we know that η 0 X i N(0, 1), and so η 0 X 2 i χ 2 1 and η 0 ni=1 X 2 i χ 2 n. It follows that c = χ 2 n,α/η 0, where χ 2 n,p is the 100pth percentile of the χ 2 n distribution. So, we have found the most powerful level α test of H 0 vs. H 1, and hence of H 0 vs. H 1. β(θ) = P θ [ χ 2 n ( θ0 θ ) ] χ 2 n,α Limiting cases: θ = 0 and θ =. 162

Example 37: UMP tests do not always exist Let the probability model be as in Example 36. Want to test H 0 : θ = θ 0 vs. H 1 : θ θ 0. Θ 0 = {θ 0 } Θ c 0 = (0, ) {θ 0} c MP test of H 0 : θ = θ 0 vs. H 1 : θ = θ 1 for θ 1 > θ 0 is { 1, if ni=1 x 2 φ 1 (x) = i c 0, if n i=1 x 2 i < c. MP test of H 0 : θ = θ 0 vs. H 1 : θ = θ 2 for θ 2 < θ 0 is { 1, if ni=1 x 2 φ 2 (x) = i c 1 0, if n i=1 x 2 i > c 1. Assume φ is UMP for testing H 0 vs. H 1. Then it is most powerful for H 0 : θ = θ 0 vs. H 1 : θ = θ 1 and hence agrees with φ 1 by N-P lemma. Also, it must agree with φ 2 by the same logic. But φ 1 φ 2, which yields a contradiction. Hence, there is no UMP test. 163

When a UMP test doesn t exist, one can look at a smaller class of tests and try to find the most powerful test within the smaller class. Examples of such tests: (i) Class of unbiased tests A test is said to be unbiased if β(θ) size of test for all θ Θ c 0. (ii) Class of invariant tests See pp. 213-222 of Lehmann, Testing Statistical Hypotheses. 164

Likelihood ratio tests Neyman-Pearson says that the most powerful level α test of H 0 : θ = θ 0 vs. H 1 : θ = θ 1 rejects H 0 for large values of f(x θ 1 ) f(x θ 0 ) = L(θ 1 x) L(θ 0 x). The test statistic is a ratio of likelihoods. Extend this notion to testing composite hypotheses. Let X 1,..., X n be observations with joint pdf f(x θ), θ Θ = Θ 0 Θ c 0, and consider sup θ Θ c 0 L(θ x) sup θ Θ0 L(θ x). The numerator is the likelihood function at the most plausible value of θ in Θ c 0. Likewise, the denominator is L at the most plausible value of θ in Θ 0. 165

It seems reasonable to test H 0 : θ Θ 0 vs. H 1 : θ Θ c 0 by rejecting H 0 if and only if the likelihood ratio on the previous page is sufficiently large. In practice, the test statistic most often used is λ(x), where λ(x) = sup θ Θ L(θ x) sup θ Θ0 L(θ x). Define and L(Θ 0 ) = sup θ Θ 0 L(θ x) L(Θ c 0 ) = sup θ Θ c L(θ x). 166

λ(x) = max ( L(Θ 0 ), L(Θ c 0 )) L(Θ 0 ) ( ) L(Θ c = max 0 ) L(Θ 0 ), 1. The likelihood ratio test is { 1, λ(x) > c φ L (x) = 0, λ(x) < c, where c is chosen so that E θ [φ L (X)] α for all θ Θ 0. Note that λ(x) 1 for all x. Recall that under general conditions MLEs are consistent. If H 0 is true, the MLE ˆθ will be close to H 0. If ˆθ Θ 0, then λ(x) = 1, and we won t reject H 0. When ˆθ Θ c 0 and far enough from Θ 0, then we will reject H 0. 167

To calculate λ(x) we need ˆθ, the MLE of θ, and the restricted MLE ˆθ R, where L(ˆθ R x) = sup θ Θ 0 L(θ x). Usually we try to find a statistic T (X) such that λ(x) is a monotone function of T (x). The likelihood ratio test can then be formulated in terms of T (x). Likelihood ratio tests are especially useful in two situations: (i) Two-sided tests (ii) Tests in the presence of nuisance parameters 168

Example 38: Two-sided test X 1,..., X n is a random sample from N(0, θ). H 0 : θ = θ 0 vs. H 1 : θ θ 0 The unrestricted MLE of θ is ˆθ = n i=1 Xi 2/n, and, trivially, the restricted MLE of θ is θ 0. [ L(ˆθ x) L(θ 0 x) = (2πˆθ) n/2 exp ni=1 x 2 i /(2ˆθ) ] (2πθ 0 ) n/2 exp [ n i=1 x 2 i /(2θ 0) ] Now, = ( θ0ˆθ ) n/2 exp [ n 2 + nˆθ 2θ 0 log(λ(x)) = n 2 log(θ 0/ˆθ) n 2 (1 ˆθ/θ 0 ) ] = n 2 [ log(ˆθ/θ 0 ) + (1 ˆθ/θ 0 ) ]. 169

Define g(y) = [log y + (1 y)]. We have g (y) = 1 y + 1, and hence g (y) < 0 for y < 1 and g (y) > 0 for y > 1. So, log λ(x) > c iff ˆθ/θ 0 < c 1 or ˆθ/θ 0 > c 2, where c 1 and c 2 are such that g(c 1 ) = g(c 2 ). We also know that n(ˆθ/θ 0 ) χ 2 n when θ = θ 0. The precise likelihood ratio test is thus φ L (x) = { 1, ˆθ θ 0 c 1 or ˆθ θ 0 c 2 0, θ 0 c 1 < ˆθ < θ 0 c 2, where g(c 1 ) = g(c 2 ) and P (χ 2 n nc 1) + P (χ 2 n nc 2) = α. 170