Berry-Esseen Theorem. Po-Ning Chen, Professor. Institute of Communications Engineering. National Chiao Tung University. Hsin Chu, Taiwan 30010, R.O.C.

Σχετικά έγγραφα
2 Composition. Invertible Mappings

Ordinal Arithmetic: Addition, Multiplication, Exponentiation and Limit

C.S. 430 Assignment 6, Sample Solutions

Solutions to Exercise Sheet 5

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

A Two-Sided Laplace Inversion Algorithm with Computable Error Bounds and Its Applications in Financial Engineering

Example Sheet 3 Solutions

Solution Series 9. i=1 x i and i=1 x i.

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

Uniform Convergence of Fourier Series Michael Taylor

ST5224: Advanced Statistical Theory II

Lecture 34 Bootstrap confidence intervals

5.4 The Poisson Distribution.

Other Test Constructions: Likelihood Ratio & Bayes Tests

w o = R 1 p. (1) R = p =. = 1

F19MC2 Solutions 9 Complex Analysis

Probability and Random Processes (Part II)

Every set of first-order formulas is equivalent to an independent set

Statistical Inference I Locally most powerful tests

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

Homework 8 Model Solution Section

Math221: HW# 1 solutions

Inverse trigonometric functions & General Solution of Trigonometric Equations

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

Pg The perimeter is P = 3x The area of a triangle is. where b is the base, h is the height. In our case b = x, then the area is

EE512: Error Control Coding

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

Lecture 21: Properties and robustness of LSE

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

Math 446 Homework 3 Solutions. (1). (i): Reverse triangle inequality for metrics: Let (X, d) be a metric space and let x, y, z X.

k A = [k, k]( )[a 1, a 2 ] = [ka 1,ka 2 ] 4For the division of two intervals of confidence in R +

Second Order Partial Differential Equations

Section 8.3 Trigonometric Equations

Online Appendix to Measuring the Bullwhip Effect: Discrepancy and Alignment between Information and Material Flows

Homework 3 Solutions

A Note on Intuitionistic Fuzzy. Equivalence Relation

derivation of the Laplacian from rectangular to spherical coordinates

Nowhere-zero flows Let be a digraph, Abelian group. A Γ-circulation in is a mapping : such that, where, and : tail in X, head in

Areas and Lengths in Polar Coordinates

Areas and Lengths in Polar Coordinates

Second Order RLC Filters

4.6 Autoregressive Moving Average Model ARMA(1,1)

Matrices and Determinants

CHAPTER 101 FOURIER SERIES FOR PERIODIC FUNCTIONS OF PERIOD

Fourier Series. MATH 211, Calculus II. J. Robert Buchanan. Spring Department of Mathematics

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

Problem Set 3: Solutions

Lecture 15 - Root System Axiomatics

Parametrized Surfaces

Statistics 104: Quantitative Methods for Economics Formula and Theorem Review

Homework for 1/27 Due 2/5

The Probabilistic Method - Probabilistic Techniques. Lecture 7: The Janson Inequality

= λ 1 1 e. = λ 1 =12. has the properties e 1. e 3,V(Y

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

Fractional Colorings and Zykov Products of graphs

Arithmetical applications of lagrangian interpolation. Tanguy Rivoal. Institut Fourier CNRS and Université de Grenoble 1

Approximation of distance between locations on earth given by latitude and longitude

SCITECH Volume 13, Issue 2 RESEARCH ORGANISATION Published online: March 29, 2018

2. Let H 1 and H 2 be Hilbert spaces and let T : H 1 H 2 be a bounded linear operator. Prove that [T (H 1 )] = N (T ). (6p)

Tridiagonal matrices. Gérard MEURANT. October, 2008

Reminders: linear functions

ORDINAL ARITHMETIC JULIAN J. SCHLÖDER

D Alembert s Solution to the Wave Equation

Practice Exam 2. Conceptual Questions. 1. State a Basic identity and then verify it. (a) Identity: Solution: One identity is csc(θ) = 1

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

5. Choice under Uncertainty

CRASH COURSE IN PRECALCULUS

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

ECE598: Information-theoretic methods in high-dimensional statistics Spring 2016

6.3 Forecasting ARMA processes

6. MAXIMUM LIKELIHOOD ESTIMATION

SCHOOL OF MATHEMATICAL SCIENCES G11LMA Linear Mathematics Examination Solutions

Oscillatory integrals

Mock Exam 7. 1 Hong Kong Educational Publishing Company. Section A 1. Reference: HKDSE Math M Q2 (a) (1 + kx) n 1M + 1A = (1) =

Mean-Variance Analysis

Trigonometric Formula Sheet

557: MATHEMATICAL STATISTICS II RESULTS FROM CLASSICAL HYPOTHESIS TESTING

Srednicki Chapter 55

New bounds for spherical two-distance sets and equiangular lines

DERIVATION OF MILES EQUATION FOR AN APPLIED FORCE Revision C

Appendix to Technology Transfer and Spillovers in International Joint Ventures

Lecture 2: Dirac notation and a review of linear algebra Read Sakurai chapter 1, Baym chatper 3

Section 7.6 Double and Half Angle Formulas

Theorem 8 Let φ be the most powerful size α test of H

Απόκριση σε Μοναδιαία Ωστική Δύναμη (Unit Impulse) Απόκριση σε Δυνάμεις Αυθαίρετα Μεταβαλλόμενες με το Χρόνο. Απόστολος Σ.

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 19/5/2007

Μηχανική Μάθηση Hypothesis Testing

Main source: "Discrete-time systems and computer control" by Α. ΣΚΟΔΡΑΣ ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ ΔΙΑΛΕΞΗ 4 ΔΙΑΦΑΝΕΙΑ 1

Coefficient Inequalities for a New Subclass of K-uniformly Convex Functions

Limit theorems under sublinear expectations and probabilities

Lecture 13 - Root Space Decomposition II

ENGR 691/692 Section 66 (Fall 06): Machine Learning Assigned: August 30 Homework 1: Bayesian Decision Theory (solutions) Due: September 13

Appendix S1 1. ( z) α βc. dβ β δ β

Congruence Classes of Invertible Matrices of Order 3 over F 2

General 2 2 PT -Symmetric Matrices and Jordan Blocks 1

9.09. # 1. Area inside the oval limaçon r = cos θ. To graph, start with θ = 0 so r = 6. Compute dr

The semiclassical Garding inequality

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 6/5/2006

Lecture 2. Soundness and completeness of propositional logic

Transcript:

Berry-Esseen heorem Po-Ning Chen, Professor Institute of Communications Engineering National Chiao ung University Hsin Chu, aiwan 30010, R.O.C.

Historical aspects BE- he central limit theorem CL concerns the situation that the limit distribution of the normalized sum is normal. As an example, for i.i.d. zero-mean sequence X 1,X,..., X 1 + + X n ne[x 1 ] N, where N has standard normal distribution. Question: What is the rate of convergence of normalized sum distribution to standard normal distribution?

Historical aspects BE-3 he first convergence rate estimates in the CL were obtained by A. M. Lyapounov in 1900-1901. In the beginning of 1940s, the classic Berry-Esseen estimate came to the light: where sup F n x Φx C β 3 x R σ 3 n, F n is the cdf of X 1 + + X n / ne[x1 ], Φ is the standard normal cdf, [ β 3 = E X E[X] 3], [ σ = E X E[X] ],and C is a universal constant, independent of n. In fact, it was due to Lyapounov that the finiteness of + δth absolute moment, where 0 <δ<1, guarantees sup F n x Φx = On δ/ asn. x R

Berry-Esseen theorem BE-4 Classic Berry-Esseen theorem For independent zero-mean random variables {X i } n i=1, [ ] 1 Pr X 1 + + X n a Φa s n C r n, sup a R where s n sum of the marginal variances, r n sum of the marginal absolute 3rd central moments, C absolute constant, and Φ the unit Gaussian cumulative distribution function cdf. Notably, only the first three moments are involved in the Berry-Esseen inequality. s 3 n

Berry-Esseen theorem BE-5 sup a R [ ] 1 Pr X 1 + + X n a Φa s n C r n. s 3 n Feller s book, 1979 C = 6 for an independent sample sum. C = 3 for an independent and identically distributed sample sum. Beek 197 C =0.7975 for an independent sample sum. Shiganov 1986 C =0.7915 for an independent sample sum. C =0.7655 for an independent and identically distributed sample sum.

Berry-Esseen theorem BE-6 Again, the remarkable aspect of the Berry-Esseen theorem is that the upper bound depends only on the variance and the 3rd central moment. Hence, it can provide a good probability estimate based on the first three moments.

BE-7 echnique behind the proof: Pass the difference through a bandlimited filter. Lemma Fix a symmetric bandlimited filtering function with characteristic function ω ζ v x = 1 cosx πx v xe jζx dx = For any cdf H on the real line R, where η sup x R and hu πu = sin x/ πx sup x 1 x R η 6 π π h Hx Φx, u v t dt = π t 1 ζ, if ζ ; 0, otherwise. π u +1 cosu u η, [Ht x Φt x] v xdx, u 0 sinx x dx.

BE-8 Summary: η is the desired difference between any cdf H and standard normal cdf Φ. sup [Hx Φx] v x x R 1 η 6 π π π π t dt η v πη/ = 1 η 3η v ydy πη/ 1 = η 3 v ydy πη/ 1 = sup Hx Φx x R 3 v ydy πη/ Observation he maximum filter output bounds the maximum absolute value of filter input.

BE-9 Proof: he right-continuity of the cdf H and the continuity of the Gaussian unit cdf Φ together indicate the right-continuity of Hx Φx, which in turn implies the existence of x 0 Rsatisfying either η = Hx 0 Φx 0 or η = lim x x0 Hx Φx > Hx 0 Φx 0. We then distinguish between three cases: Case A η = Hx 0 Φx 0 ; Case B η =Φx 0 Hx 0 ; Case C η = lim Hx Φx > Hx 0 Φx 0. x x0

BE-10 Case A η = Hx 0 Φx 0. In this case, we note that for s>0, Hx 0 + s Φx 0 + s Hx 0 [ Φx 0 + s ] π 1 = η s π, where 1 follows from sup x R Φ x =1/ π. Observe that implies π π H x 0 + η x Φ x 0 + η x η 1 π π = 1 η + x π, η x for x <η π/.

BE-11 ogether with the fact that Hx Φx η for all x R,weobtain π sup x x 0 + η x R [ ] π π = H x 0 + η x Φ x 0 + η x v xdx [ ] π π = [ x <η H x 0 + η x Φ x 0 + η x v π/] xdx [ ] π π + [ x η H x 0 + η x Φ x 0 + η x v π/] xdx [ 1 [ x <η π/] η + x ] v π xdx + [ x η π/] η v xdx 1 = η v xdx + [ x η π/] η v xdx, [ x <η π/] where the last equality holds because of the symmetry of the filtering function v.

BE-1 he quantity of [ x η π/] v xdx can be derived as follows: [ x η π/] v xdx = η v xdx π/ Continuing from 3, = = sup x [1 1 x R η [ η η π/ v u 4 ηπ π h 4 ηπ π h 4 ηπ π h = 1 η 6 π π h du π π π π η η η η ].. ]

BE-13 Case B η =Φx 0 Hx 0. Similar to Case A, we first derive for s>0, [ Φx 0 s Hx 0 s Φx 0 s ] Hx 0 =η π s π, and then obtain π Φ x 0 η x H x 0 π η x η 1 π π = 1 η x π, η + x for x <η π/.

BE-14 ogether with the fact that Φx Hx η for all x R,weobtain π sup x x 0 η x R [ 1 [ x <η π/] η x ] v π xdx + [ x η π/] η v xdx 1 = [ x <η π/] η v xdx + [ x η π/] η v xdx = 1 η 6 π π h π η.

BE-15 Case C η = lim x x0 Hx Φx > Hx 0 Φx 0 0. In this case, we observe that for any 0 <δ<η, there exists x 0 such that Hx 0 Φx 0 η δ η. We can then follow the procedure of the previous two cases to obtain: sup x 1 6 x R η π π h π η. he proof is completed by noting that η can be made arbitrarily close to η.

BE-16 Lemma For any cumulative distribution function H with zero-mean and unit variance, its characteristic function ϕ H ζ satisfiesthat η 1 ϕ π H ζ e 1/ζ dζ ζ + 1 π π h πη, where η and h are defined in the previous lemma. his lemma transforms the bound in the previous lemma into frequency domain, namely the characteristic function domain.

BE-17 Proof: Observe that t = [Ht x Φt x] v xdx is nothing but a convolution of v andh Φ. By Fourier inversion theorem, d t dt = 1 t=x π = 1 π e jζx [ϕ H ζ e 1/ζ] ω ζdζ e jζx [ϕ H ζ e 1/ζ] ω ζdζ, where the second step follows from bandlimit assumption. Notably, ϕ H ζande ζ / are the Fourier transforms with respect to measures dhx anddφx, not the Fourier transforms of Hx andφx. Actually, Hx andφx have no Fourier transforms.

BE-18 Integrating with respect to x, weobtain x = 1 [ϕ e jζx H ζ e 1/ζ] ω π jζ ζdζ, 3 where no integration constant appears since both sides go to zero as x. Notably, where c is the integration constant. f x =g x fx =gx+c,

BE-19 Accordingly, 1 [ϕ sup x = sup x R x R π e jζx H ζ e 1/ζ] ω jζ ζdζ 1 [ϕ sup H ζ e 1/ζ] x R π e jζx ω jζ ζ dζ 1 = sup ϕ x R π H ζ e 1/ζ ω ζ dζ ζ 1 sup ϕ x R π H ζ e 1/ζ dζ ζ = 1 ϕ π H ζ e 1/ζ dζ ζ.

BE-0 ogether with we finally have η 1 π sup x 1 x R η 6 π π h π ϕ H ζ e 1/ζ dζ ζ + 1 π π h π η, η.

BE-1 heorem Let Y n = n i=1 X i be sum of i.i.d. random variables. Assume n 3. Denote the mean and variance of X 1 by ˆµ and ˆσ, respectively. Define [ ˆρ E X 1 ˆµ 3]. Also denote the cdf of Y n E[Y n ]/ˆσ nbyh n. hen sup H n y Φy 5 y R ˆρ ˆσ 3 n.

BE- Proof: π η ζ ˆϕn ˆσ e ζ / n where ˆϕ is the characteristic function of X 1 ˆµ. Lemma For any complex numbers α and β, α n β n n α β γ n 1, dζ ζ + 1 π h π where γ max{ α, β }. Proof: α n β n = α n n β r n r r n α r β r rn = n α β r n 1. η, 4

BE-3 Observe that the integrand satisfies ζ ˆϕn ˆσ e ζ / n n ζ ˆϕ ˆσ e ζ /n n γn 1, 5 ζ n ˆϕ ˆσ 1 ζ + n n 1 ζ e ζ /n n γ n 1 6 where the quantity γ in 5 requires that ζ ˆϕ ˆσ γ and e ζ /n γ. n We upperbound the first and second terms in the parentheses of 6 respectively by ζ ˆϕ ˆσ 1+ ζ n n ˆρ and 6ˆσ 3 n 3/ ζ 3 1 ζ n e ζ /n 1. 8n ζ4 Continuing the derivation of 6, ζ ˆϕn ˆσ e ζ / ˆρ n n + 1 γ n 1. 7 6ˆσ 3 n 3/ ζ 3 8n ζ4 It remains to choose γ that bounds both ˆϕζ/ˆσ n and exp{ ζ /n} from above.

BE-4 For complex number z and reals b and c, z b c z b + c. Accordingly, ζ ˆϕ ˆσ 1+ ζ n n ˆρ 6ˆσ 3 n 3/ ζ 3 ζ ˆϕ ˆσ n 1 ζ n + ˆρ. 6ˆσ 3 n 3/ ζ 3 Next, ζ ˆϕ ˆσ 1 ζ n n + ˆρ, if ζ 1. 8 6ˆσ 3 n 3/ ζ3 n For those ζ [,] which is exactly the range of integration operation in 4, we can guarantee the validity of the condition in 8 by defining and obtain for n 3. ζ n n = ˆσ6 ˆρ ˆσ3 n ˆρ n 3 n 1 n 3, n 1 1 n 3 1, n 1

BE-5 Hence, for ζ, ζ ˆϕ ˆσ 1+ ζ n n + ˆρ 6ˆσ 3 n 3/ ζ3 } exp { ζ n + ˆρ 6ˆσ 3 n { 3/ ζ3 exp 1 } ˆρ n ζ + 6ˆσ 3 n { 3/ζ 1 = exp n ˆρ } ζ 6ˆσ 3 n { 3/ = exp 3 } 6n 1 ζ. We can then choose γ exp { 3 } 6n 1 ζ Note that the above selected γ is an upper bound of exp { ζ /n } for n 3/.1..

BE-6 By taking the chosen γ into 7, the integration part in 4 becomes ζ ˆϕn ˆσ e ζ / dζ n ζ ˆρ n + 1 { } 3 exp ζ dζ 6ˆσ 3 n 3/ζ 8n ζ 3 6 ˆρ 6ˆσ 3 n ζ + 1 { } 3 8n ζ 3 exp ζ dζ 6 ˆρ = ˆσ 3 6π n 3 + 9 3/ 3 ˆσ 3 ˆρ n ˆρ ˆσ 3 6π n 3 + 9 3/ 3 1 n = 1 n 3 6π n 1 3 + 9 3/ 3 1, 9 n where the last inequality follows from Lyapounov s inequality, i.e., ˆσ = E 1/ [ X d+1 ˆµ ] E 1/3 [ X d+1 ˆµ 3] =ˆρ 1/3.

BE-7 aking 9 into 4, we finally obtain π η 1 n 3 n 1 + 1 π h π 6π 3 3/ + 9 3 η, 1 n or equivalently, πu 6hu for u πη/. πn 3 4n 1 6π 3/ + 3 9 1, 3 n 10

BE-8 Observe that πu 6hu =πu t 1 6 v u dt is continuous, and equals 0 at u = 0, and goes to as u,which guarantees the existence of positive u satisfying 10. Inequality 10 thus implies u max a 0 : πa 6ha πn 3 4n 1 6π 3/ + 3 9 1 3 n. Since πn 3 π = 3 π 4n 1 4 n 1, we can, for n 3, further upper-bound u by: π u max a 0 : πa 6ha 6π 9 1 3/ + 3 3 3 = 3.6369...

he proof is completed by η = u π 3.6369 π n 1 ˆρ = 6.5738 πn 3 ˆσ 3 n = 6.5738 1+ 3 π n 3 ˆρ ˆσ 3 n 6.5738 1+ 3 π 6 3 ˆρ ˆσ 3 n ˆρ = 4.19115 ˆσ 3 n ˆρ 5 ˆσ 3 n. BE-9