Supplementary materials for Statistical Estimation and Testing via the Sorted l 1 Norm

Σχετικά έγγραφα
Πανεπιστήµιο Κρήτης - Τµήµα Επιστήµης Υπολογιστών. ΗΥ-570: Στατιστική Επεξεργασία Σήµατος. ιδάσκων : Α. Μουχτάρης. εύτερη Σειρά Ασκήσεων.

Multi-dimensional Central Limit Theorem

Multi-dimensional Central Limit Theorem

One and two particle density matrices for single determinant HF wavefunctions. (1) = φ 2. )β(1) ( ) ) + β(1)β * β. (1)ρ RHF

Variance of Trait in an Inbred Population. Variance of Trait in an Inbred Population

α & β spatial orbitals in

Ordinal Arithmetic: Addition, Multiplication, Exponentiation and Limit

1 Complete Set of Grassmann States

Every set of first-order formulas is equivalent to an independent set

Supporting information for: Functional Mixed Effects Model for Small Area Estimation

Statistical Inference I Locally most powerful tests

8.324 Relativistic Quantum Field Theory II

5 Haar, R. Haar,. Antonads 994, Dogaru & Carn Kerkyacharan & Pcard 996. : Haar. Haar, y r x f rt xβ r + ε r x β r + mr k β r k ψ kx + ε r x, r,.. x [,

Matrices and Determinants

Uniform Convergence of Fourier Series Michael Taylor

Generalized Fibonacci-Like Polynomial and its. Determinantal Identities

Other Test Constructions: Likelihood Ratio & Bayes Tests

A Class of Orthohomological Triangles

8.1 The Nature of Heteroskedasticity 8.2 Using the Least Squares Estimator 8.3 The Generalized Least Squares Estimator 8.

Example Sheet 3 Solutions

k A = [k, k]( )[a 1, a 2 ] = [ka 1,ka 2 ] 4For the division of two intervals of confidence in R +

Fractional Colorings and Zykov Products of graphs

ST5224: Advanced Statistical Theory II

Some generalization of Cauchy s and Wilson s functional equations on abelian groups

2 Composition. Invertible Mappings

Duals of the QCQP and SDP Sparse SVM. Antoni B. Chan, Nuno Vasconcelos, and Gert R. G. Lanckriet

Nowhere-zero flows Let be a digraph, Abelian group. A Γ-circulation in is a mapping : such that, where, and : tail in X, head in

Math221: HW# 1 solutions

Numerical Analysis FMN011

Concomitants of Dual Generalized Order Statistics from Bivariate Burr III Distribution

Finite Field Problems: Solutions

Solution Series 9. i=1 x i and i=1 x i.

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

C.S. 430 Assignment 6, Sample Solutions

SCITECH Volume 13, Issue 2 RESEARCH ORGANISATION Published online: March 29, 2018

b. Use the parametrization from (a) to compute the area of S a as S a ds. Be sure to substitute for ds!

LECTURE 4 : ARMA PROCESSES

EE512: Error Control Coding

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

8.323 Relativistic Quantum Field Theory I

The one-dimensional periodic Schrödinger equation

THE SECOND WEIGHTED MOMENT OF ζ. S. Bettin & J.B. Conrey

Generalized Linear Model [GLM]

Reminders: linear functions

Young Kwark. Warrington College of Business, University of Florida, 1454 Union Road, Gainesville, FL U.S.A.

4.6 Autoregressive Moving Average Model ARMA(1,1)

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

The challenges of non-stable predicates

Solutions for Mathematical Physics 1 (Dated: April 19, 2015)

6.3 Forecasting ARMA processes

Tridiagonal matrices. Gérard MEURANT. October, 2008

Derivation for Input of Factor Graph Representation

Lecture 21: Properties and robustness of LSE

ORDINAL ARITHMETIC JULIAN J. SCHLÖDER

Areas and Lengths in Polar Coordinates

The semiclassical Garding inequality

arxiv: v2 [math.ap] 6 Dec 2015

Areas and Lengths in Polar Coordinates

Inverse trigonometric functions & General Solution of Trigonometric Equations

Sequent Calculi for the Modal µ-calculus over S5. Luca Alberucci, University of Berne. Logic Colloquium Berne, July 4th 2008

F19MC2 Solutions 9 Complex Analysis

( y) Partial Differential Equations

Partial Differential Equations in Biology The boundary element method. March 26, 2013

Homework 3 Solutions

A Note on Intuitionistic Fuzzy. Equivalence Relation

Congruence Classes of Invertible Matrices of Order 3 over F 2

A Two Sample Test for Mean Vectors with Unequal Covariance Matrices

2. Let H 1 and H 2 be Hilbert spaces and let T : H 1 H 2 be a bounded linear operator. Prove that [T (H 1 )] = N (T ). (6p)

ECE Spring Prof. David R. Jackson ECE Dept. Notes 2

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

Lecture 2: Dirac notation and a review of linear algebra Read Sakurai chapter 1, Baym chatper 3

Proposal of Terminal Self Location Estimation Method to Consider Wireless Sensor Network Environment

w o = R 1 p. (1) R = p =. = 1

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 19/5/2007

Symplecticity of the Störmer-Verlet algorithm for coupling between the shallow water equations and horizontal vehicle motion

SOME PROPERTIES OF FUZZY REAL NUMBERS

SCHOOL OF MATHEMATICAL SCIENCES G11LMA Linear Mathematics Examination Solutions

Neutralino contributions to Dark Matter, LHC and future Linear Collider searches

Section 8.3 Trigonometric Equations

Homework 8 Model Solution Section

The ε-pseudospectrum of a Matrix

2 Lagrangian and Green functions in d dimensions

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 6/5/2006

Constant Elasticity of Substitution in Applied General Equilibrium

5.4 The Poisson Distribution.

ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΜΗΧΑΝΙΚΩΝ ΥΠΟΛΟΓΙΣΤΩΝ

Solutions to Exercise Sheet 5

Main source: "Discrete-time systems and computer control" by Α. ΣΚΟΔΡΑΣ ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ ΔΙΑΛΕΞΗ 4 ΔΙΑΦΑΝΕΙΑ 1

Strain gauge and rosettes

derivation of the Laplacian from rectangular to spherical coordinates

Local Approximation with Kernels

ΗΥ537: Έλεγχος Πόρων και Επίδοση σε Ευρυζωνικά Δίκτυα,

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

CRASH COURSE IN PRECALCULUS

Approximation of distance between locations on earth given by latitude and longitude

Bounding Nonsplitting Enumeration Degrees

Trigonometric Formula Sheet

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

2. THEORY OF EQUATIONS. PREVIOUS EAMCET Bits.

Transcript:

Sulementary materals for Statstcal Estmaton and Testng va the Sorted l Norm Małgorzata Bogdan * Ewout van den Berg Weje Su Emmanuel J. Candès October 03 Abstract In ths note we gve a roof showng that even though the number of false dscoveres and the total number of dscoveres are not contnuous functons of the arameters, the formulas we obtan for the false dscovery roorton (FD and the ower, namely, (B.3 and (B.4 n the aer Statstcal Estmaton and Testng va the Sorted l Norm are mathematcally vald. We recall that these formulas are derved from [, Theorem.5]. Consder the lnear model y Xβ + z, where X R n s the desgn matrx, β R s the unknown arameter of nterest and z R n s a vector of..d. standard Gaussan ndeendent of X and β. The lasso estmate ˆβ wth enalty λ > 0 s the soluton to mn b R y Xb l + λ b l. ( As n the man aer, we let ϕ V (x, y (x 0(y 0 and ϕ R (x, y (x 0 so that the number V of false dscoveres s equal to V ϕ V ( ˆβ, β, and the number R of dscoveres s equal to R ϕ R ( ˆβ, β. We can now state the man result n ths note, whch generalzes [, Theorem.5]. Theorem. Suose that X s an n Gaussan desgn matrx wth..d. N (0, n entres, β s are non-degenerate..d. random varables wth bounded second moment ndeendent of X. Below, Θ s a varable wth the same dstrbuton as β. As and n/ δ > 0, the lasso soluton ˆβ obeys FD ϕ V ( ˆβ, β max{ ϕ R( ˆβ, β, } where τ > 0 and α > α mn are the unque solutons to τ +, (η δ E ατ (Θ + τz Θ λ ( δ ( Θ + τz > ατ ατ. (Θ 0Φ( α ( Θ + τz > ατ, Here η s the soft-thresholdng oerator defned as η t (x sgn(x( x t + and Z s a standard Gaussan ndeendent of Θ. *Deartments of Mathematcs and Comuter Scence, Wrocław Unversty of Technology and Jan Długosz Unversty, oland IBM T.J. Watson Research Center, Yorktown Heghts, NY 0598, U.S.A. Deartment of Statstcs, Stanford Unversty, Stanford, CA 94305 Deartments of Statstcs and of Mathematcs, Stanford Unversty, Stanford, CA 94305

Before resentng the roof, we gve three lemmas. Lemma. Suose Σ ( Σ, Σ, Σ, Σ, s a ostve defnte matrx wth all egenvalues larger than or equal to, where Σ, s a scalar. Then we have Σ, Σ, Σ, Σ,. roof. The condton Σ I, where I s the dentty matrx wth the same sze as Σ, s equvalent to 0 Σ I. So by the Schur comlement roerty, the (, entry of Σ satsfes whch gves Σ, Σ, Σ, Σ, as desred. 0 < Σ (, (Σ, Σ, Σ, Σ,, Lemma. Suose Y s a -dmensonal vector dstrbuted as N (µ, Σ, where Σ σ I. For any ɛ (0,, there exsts a constant c c(ɛ > 0 such that for any h > 0, (at least ɛ comonents of Y are n ( h, h mn(, ch ɛ σ ɛ. roof. By Lemma, the varance of Y Y, where the subscrt denotes all the comonents excet the th, s larger than or equal to σ h. So we have (Y ( h, h Y mn(, πσ almost surely snce the normal densty functon φ s bounded by h π. Denote by ξ..d. Bernoull varables ndeendent of Y wth (ξ mn(, πσ. And denote by ζ (Y ( h, h. Snce (ζ ζ,..., ζ (ξ ζ,..., ζ, a.s., we have Usng smlar arguments we reach the concluson (ζ + ζ +... + ζ ɛ (ξ + ζ +... + ζ ɛ. (ξ + ξ +... + ξ k + ζ k+ +... + ζ ɛ (ξ + ξ +... + ξ k + ξ k+ + ζ k+ +... + ζ ɛ, whch holds for k,,...,. Therefore, (at least ɛ comonents of Y are n ( h, h (ζ + ζ +... + ζ ɛ (ξ + ζ +... + ζ ɛ (ξ + ξ + ζ 3 +... + ζ ɛ... (ξ + ξ +... + ξ ɛ. Hence, ( ξ ɛ ɛ ɛ ɛ ɛ ( n ( n ( +ɛ/ π ɛ/ h ɛ σ ɛ.

Lemma 3. In the same settng as Theorem, denote by A {,..., } the actve set of the lasso soluton ˆβ of (. Then there exsts a constant ρ > 0 such that ( A < ρ 0. roof. Defne ϕ(x, y mn( x,, whch s seudo-lschtz. It follows from [, Theorem.5] that n robablty. Note that mn( ˆβ, E mn( η ατ (Θ + τz, > 0 A mn( ˆβ,. So for any ρ < E mn( η ατ (Θ + τz, we have ( A < ρ 0. roof of Theorem. To go around the dscontnuty of ϕ V, we defne a seres of seudo-lschtz contnuous functons ϕ V,h (x, y ( Q(x/hQ(y/h, where Q(x max( x, 0 for h > 0. Therefore, by [, Theorem.5], n robablty. Snce so that for any ɛ > 0, ϕ V,h ( ˆβ, β Eϕ V,h (η ατ (Θ + τz, Θ ( ϕ V,h (x, y ϕ V (x, y (0 < x < h + (0 < y < h, (3 ( ϕ V ( ˆβ, β By the weak Law of Large Numbers we have So f we addtonally have for any ɛ > 0, we would obtan ϕ V,h ( ˆβ (, β > ɛ ( h 0 h 0 ( su (0 < ˆβ < h > ɛ ( + (0 < β < h > ɛ 0. (0 < β < h > ɛ (0 < ˆβ < h > ɛ 0 (4 ϕ V ( ˆβ, β h 0 ϕ V,h ( ˆβ, β h 0 Eϕ V,h (η ατ (Θ + τz, Θ Eϕ V (η ατ (Θ + τz, Θ, where the last equalty comes from alyng the domnated convergence theorem to ϕ V,h ϕ V. We rove (4 to comlete the roof of the theorem. Denote the actve set of the Lasso soluton by A {,..., }. Then artal KKT condtons gve X T A(y X A ˆβA λ A, where λ A s a vector wth each comonent beng λ or λ deendng the sgns of ˆβ A. Note that X T A X A s nvertble wth robablty one because A n. So we may wrte the soluton as ˆβ A (X T AX A (X T Ay λ A (X T AX A (X T AXβ λ A + (X T AX A X T Az.. (5 3

Now for any subset D {,..., } wth D n, and λ of length D wth each comonent beng ±λ, defne D (X T DX D (X T DXβ λ + (X T DX D X T Dz. If D A and λ λ A, D concdes wth ˆβ A. By Lemma 3, there s a constant ρ > 0 such that ( A < ρ o(. For any ɛ > 0, by the unon bound we have ( By [], we have Now we estmate (0 < ˆβ < h > ɛ ρ D mn(,n λ D ( D ( D ( < h > ɛ, σ max (X < δ / + + ɛ ( + σ max (X δ / + + ɛ + ( A < ρ. (6 (σ max (X δ / + + ɛ o(. (7 ( D ( D ( < h > ɛ, σ max (X < δ / + + ɛ. Condtonally on X, D s dstrbuted as Gaussan wth mean (X T D X D (X T D Xβ λ and covarance (X T D X D. On the event σ max (X < δ / + + ɛ, all the egenvalues of (X T D X D are larger than σ (δ / + + ɛ. So by Lemma, ( D ( ˆβ λ D ( < h > ɛ,σ max (X < δ / + + ɛ X (at least ɛ D comonents of D are n ( h, h, σ max (X < δ / + + ɛ X mn(, ch ɛ σ ɛ D mn(, ch ɛ σ ɛ ρ (ch ɛ σ ɛ ρ. Together wth (6 and (7 ths gves ( (0 < ˆβ < h > ɛ (ch ɛ σ ɛ ρ + o( + o( ρ D mn(,n λ D (ch ɛ σ ɛ ρ + o( (4c ρ h ρɛ σ ρɛ + o(, (8 whch roves (4 by choosng h suffcently small such that 4c ρ h ρɛ σ ρɛ <. Consequently (5 s also roved. Ths gves V Eϕ V (η ατ (Θ + τz, Θ. (9 Smlarly, defnng ϕ R,h (x, y Q(x/h, we can also establsh R Eϕ R (η ατ (Θ + τz, Θ. (0 4

Combnng (9 and (0 gves FD V Eϕ V (η ατ (Θ + τz, Θ max(r, Eϕ R (η ατ (Θ + τz, Θ (Θ 0Φ( α ( Θ + τz > ατ. As a byroduct of the roof of Theorem, we have Corollary. Wth the same settng as Theorem, the emrcal ower of the lasso soluton ˆβ of ( obeys roof of Corollary. We have ower #{ : ˆβ 0, β 0} β l0 R V ower #{ : β 0} (R V / #{ : β 0}/ ( Θ + τz > ατ Θ 0. Eϕ R(η ατ (Θ + τz, Θ Eϕ V (η ατ (Θ + τz, Θ (Θ 0 (Θ 0, Θ + τz > ατ (Θ 0 ( Θ + τz > ατ Θ 0. References [] M. Bayat and A. Montanar. The LASSO rsk for Gaussan matrces. IEEE Transactons on Informaton Theory, 58(4:997 07, 0. [] S. Geman. A t theorem for the norm of random matrces. The Annals of robablty, 8(:5 6, 980. 5