Confirmatory Factor Analysis

Σχετικά έγγραφα
3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

Μηχανική Μάθηση Hypothesis Testing

The Simply Typed Lambda Calculus

Other Test Constructions: Likelihood Ratio & Bayes Tests

Section 7.6 Double and Half Angle Formulas

Queensland University of Technology Transport Data Analysis and Modeling Methodologies

Section 8.3 Trigonometric Equations

EE512: Error Control Coding

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

Homework 3 Solutions

2 Composition. Invertible Mappings

Instruction Execution Times

derivation of the Laplacian from rectangular to spherical coordinates

5.4 The Poisson Distribution.

Statistical Inference I Locally most powerful tests

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 19/5/2007

ST5224: Advanced Statistical Theory II

Introduction to Structural Equations with Latent Variables

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

Concrete Mathematics Exercises from 30 September 2016

Supplementary Appendix

Finite Field Problems: Solutions

4.6 Autoregressive Moving Average Model ARMA(1,1)

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

PARTIAL NOTES for 6.1 Trigonometric Identities

Section 9.2 Polar Equations and Graphs

Areas and Lengths in Polar Coordinates

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

Writing for A class. Describe yourself Topic 1: Write your name, your nationality, your hobby, your pet. Write where you live.

Matrices and Determinants

6.3 Forecasting ARMA processes

Right Rear Door. Let's now finish the door hinge saga with the right rear door

BECAUSE WE REALLY WANT TO KNOW WHAT YOU THINK ABOUT SCHOOL AND YOUR GARDEN. Fairly true If I decide to learn something hard, I can.

Solutions to Exercise Sheet 5

9.09. # 1. Area inside the oval limaçon r = cos θ. To graph, start with θ = 0 so r = 6. Compute dr

Approximation of distance between locations on earth given by latitude and longitude

Εργαστήριο Ανάπτυξης Εφαρμογών Βάσεων Δεδομένων. Εξάμηνο 7 ο

the total number of electrons passing through the lamp.

Areas and Lengths in Polar Coordinates

SCHOOL OF MATHEMATICAL SCIENCES G11LMA Linear Mathematics Examination Solutions

ΤΕΧΝΟΛΟΓΙΚΟ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΥΠΡΟΥ ΤΜΗΜΑ ΝΟΣΗΛΕΥΤΙΚΗΣ

Repeated measures Επαναληπτικές μετρήσεις

Srednicki Chapter 55

LESSON 14 (ΜΑΘΗΜΑ ΔΕΚΑΤΕΣΣΕΡΑ) REF : 202/057/34-ADV. 18 February 2014

Estimation for ARMA Processes with Stable Noise. Matt Calder & Richard A. Davis Colorado State University

6.1. Dirac Equation. Hamiltonian. Dirac Eq.

Jesse Maassen and Mark Lundstrom Purdue University November 25, 2013

Math 6 SL Probability Distributions Practice Test Mark Scheme

ΕΙΣΑΓΩΓΗ ΣΤΗ ΣΤΑΤΙΣΤΙΚΗ ΑΝΑΛΥΣΗ

Partial Differential Equations in Biology The boundary element method. March 26, 2013

CHAPTER 48 APPLICATIONS OF MATRICES AND DETERMINANTS

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

Math221: HW# 1 solutions

DESIGN OF MACHINERY SOLUTION MANUAL h in h 4 0.

C.S. 430 Assignment 6, Sample Solutions

«ΨΥΧΙΚΗ ΥΓΕΙΑ ΚΑΙ ΣΕΞΟΥΑΛΙΚΗ» ΠΑΝΕΥΡΩΠΑΪΚΗ ΕΡΕΥΝΑ ΤΗΣ GAMIAN- EUROPE

Numerical Analysis FMN011

Δθαξκνζκέλα καζεκαηηθά δίθηπα: ε πεξίπησζε ηνπ ζπζηεκηθνύ θηλδύλνπ ζε κηθξνεπίπεδν.

Code Breaker. TEACHER s NOTES

Solution Series 9. i=1 x i and i=1 x i.

Mean bond enthalpy Standard enthalpy of formation Bond N H N N N N H O O O

[1] P Q. Fig. 3.1

Lecture 34 Bootstrap confidence intervals

Συστήματα Διαχείρισης Βάσεων Δεδομένων

SUPERPOSITION, MEASUREMENT, NORMALIZATION, EXPECTATION VALUES. Reading: QM course packet Ch 5 up to 5.6

CE 530 Molecular Simulation

2. THEORY OF EQUATIONS. PREVIOUS EAMCET Bits.

ΕΚΤΙΜΗΣΗ ΤΟΥ ΚΟΣΤΟΥΣ ΤΩΝ ΟΔΙΚΩΝ ΑΤΥΧΗΜΑΤΩΝ ΚΑΙ ΔΙΕΡΕΥΝΗΣΗ ΤΩΝ ΠΑΡΑΓΟΝΤΩΝ ΕΠΙΡΡΟΗΣ ΤΟΥ

ΑΓΓΛΙΚΑ Ι. Ενότητα 7α: Impact of the Internet on Economic Education. Ζωή Κανταρίδου Τμήμα Εφαρμοσμένης Πληροφορικής

department listing department name αχχουντσ ϕανε βαλικτ δδσϕηασδδη σδηφγ ασκϕηλκ τεχηνιχαλ αλαν ϕουν διξ τεχηνιχαλ ϕοην µαριανι

CRASH COURSE IN PRECALCULUS

ίκτυο προστασίας για τα Ελληνικά αγροτικά και οικόσιτα ζώα on.net e-foundatio // itute: toring Insti SAVE-Monit

χ 2 test ανεξαρτησίας

«ΑΓΡΟΤΟΥΡΙΣΜΟΣ ΚΑΙ ΤΟΠΙΚΗ ΑΝΑΠΤΥΞΗ: Ο ΡΟΛΟΣ ΤΩΝ ΝΕΩΝ ΤΕΧΝΟΛΟΓΙΩΝ ΣΤΗΝ ΠΡΟΩΘΗΣΗ ΤΩΝ ΓΥΝΑΙΚΕΙΩΝ ΣΥΝΕΤΑΙΡΙΣΜΩΝ»

Πώς μπορεί κανείς να έχει έναν διερμηνέα κατά την επίσκεψή του στον Οικογενειακό του Γιατρό στο Ίσλινγκτον Getting an interpreter when you visit your

An Introduction to Signal Detection and Estimation - Second Edition Chapter II: Selected Solutions

Advanced Subsidiary Unit 1: Understanding and Written Response

TMA4115 Matematikk 3

Aquinas College. Edexcel Mathematical formulae and statistics tables DO NOT WRITE ON THIS BOOKLET

Example Sheet 3 Solutions

Τ.Ε.Ι. ΔΥΤΙΚΗΣ ΜΑΚΕΔΟΝΙΑΣ ΠΑΡΑΡΤΗΜΑ ΚΑΣΤΟΡΙΑΣ ΤΜΗΜΑ ΔΗΜΟΣΙΩΝ ΣΧΕΣΕΩΝ & ΕΠΙΚΟΙΝΩΝΙΑΣ

Εγχειρίδια Μαθηµατικών και Χταποδάκι στα Κάρβουνα

HISTOGRAMS AND PERCENTILES What is the 25 th percentile of a histogram? What is the 50 th percentile for the cigarette histogram?

HW 3 Solutions 1. a) I use the auto.arima R function to search over models using AIC and decide on an ARMA(3,1)

Figure A.2: MPC and MPCP Age Profiles (estimating ρ, ρ = 2, φ = 0.03)..

Homework for 1/27 Due 2/5

ΠΑΝΔΠΗΣΖΜΗΟ ΠΑΣΡΩΝ ΣΜΖΜΑ ΖΛΔΚΣΡΟΛΟΓΩΝ ΜΖΥΑΝΗΚΩΝ ΚΑΗ ΣΔΥΝΟΛΟΓΗΑ ΤΠΟΛΟΓΗΣΩΝ ΣΟΜΔΑ ΤΣΖΜΑΣΩΝ ΖΛΔΚΣΡΗΚΖ ΔΝΔΡΓΔΗΑ

ΙΩΑΝΝΗ ΑΘ. ΠΑΠΑΪΩΑΝΝΟΥ

Υπολογιστική Φυσική Στοιχειωδών Σωματιδίων

«ΑΝΑΠΣΤΞΖ ΓΠ ΚΑΗ ΥΩΡΗΚΖ ΑΝΑΛΤΖ ΜΔΣΔΩΡΟΛΟΓΗΚΩΝ ΓΔΓΟΜΔΝΩΝ ΣΟΝ ΔΛΛΑΓΗΚΟ ΥΩΡΟ»

The challenges of non-stable predicates

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 6/5/2006

( ) 2 and compare to M.

b. Use the parametrization from (a) to compute the area of S a as S a ds. Be sure to substitute for ds!

Overview. Transition Semantics. Configurations and the transition relation. Executions and computation


Living and Nonliving Created by: Maria Okraska

Potential Dividers. 46 minutes. 46 marks. Page 1 of 11

Partial Trace and Partial Transpose

Transcript:

Confirmatory Factor Analysis Session 9, Lecture 7 // Adding Latent Variables to the Model So far we ve only included observed variables in our path models Etension to latent variables: need to add in a measurement piece how do we define the latent variable (think factor analysis) more complicated to look at, but same general principles apply

Why we need the measurement piece We can t directly measure most of the interesting things in behavioral science. Beyond that, human behavior plays a significant role in most (if not all!) of the pressing public health issues in the US imagine if we could get every American to Eercise and eat a healthy diet stop smoking, and using drugs or alcohol Use condoms This is why we need to figure out how to fit latent variables into a statistical framework Notation for Latent Variable Model η = latent endogenous variable (eta) y = observed indicator of η (these both makes ee sounds) = latent eogenous variable (pronounced ksi, or zi) = observed indicator of (these both make sounds) ζ = latent error (zeta) (zeta rhymes with eta) β = coefficient on latent endogenous variable (beta) γ = coefficient on latent eogenous variable (gamma) = measurement error on (delta) ε = measurement error on y (epsilon) Λ y,, y = coefficient relating y to η (lambda) Λ, = coefficient relating to

η = γ + ζ η = β η + γ + ζ ε ζ γ η γ β y y ε y ε η 7 9 8 y y ε ε ζ y ε

To write equations for models:. Start an equation for everything that has an arrow pointing at it. Each is the sum of (everything pointing at it multiplied by the weight). η = γ + ζ η = β η + γ + ζ = + = + = + y y y = η + ε = η + ε = η + ε y y y = η + ε 7 = η + ε 8 = η + ε 9 Confirmatory vs. Eploratory Factor Analysis CFA: We can test or confirm or implement via model constraints. Model Construction CFA: drawn in advance, including number of latent variables (factors) and which latent variables influence which indicators. EFA: not in advance, typically all latent variables influence all indicators Measurement Errors on Indicators () CFA: may be correlated EFA: can t be correlated

Eploratory Factor Analysis Two factor model: + = ~ ~ ~ + = Λ No curved arrows between deltas + = Eplanation of Matri Notation + + = + =

Confirmatory Factor Analysis Two factor model: ~ ~ ~ = Λ + = + Don t Like Matrices? CFA EFA = + = + = + = + = + = + cov(, ) = ϕ = + + = + + = + + = + + = + + = + + = cov(, )

Necessary Constraints Latent variables (LVs) need some constraints Recall EFA where we constrained factors: F ~ N(,) Otherwise, model is not identifiable. Here we have two options: Fi variance of latent variables (LV) to be (or another constant) Fi one path between LV and indicator Fi variances: Fi path: 7

Fi variances: = + = + = + = + = + = + cov(, ) = ϕ var( ) = var( ) = Fi path: = = + = + = + = + = + = + cov(, ) = ϕ var( ) = ϕ var( ) = ϕ More Important Notation Φ (capital of ): covariance matri of eogenous latent variables ϕ ϕ var( ) = ϕ Φ = = var( ) ϕ ϕ cov(, ) = ϕ Θ (capital of ): covariance matri of errors Θ var( ) = cov(, ) = NOTE: usually, ij = if i j 8

Identifiability Rules for CFA () T-rule (revisited) necessary, but not sufficient t things to estimate n observed variables t n( n + ) indicator rule Sufficient, but not necessary At least two factors At least two indicators per factor Eactly one non-zero element per row of Λ (translation: each is pointed at by one LV) Non-correlated errors (Θ is diagonal) (translation: no double-header arrows between the s) Factors are correlated (Φ has no zero elements)* (translation: there are double-headed arrows between all of the eogenous latent variables ()) * Alternative less strict criteria: each factor is correlated with at least one other factor. (see Bollen, p7) 9

= + Θ = var( ) = Φ = var( ) = No zeros in phi =All latent vars are correlated with each other. What if we take out that arrow?

= + Θ = var( ) = Φ = var( ) = 7 7 8 8

+ = 8 7 8 7 8 7 = Θ Φ = 7 8 7 8

+ = 8 7 8 7 8 7 = Θ Φ = ϕ indicator rule sufficient, not necessary at least one factor at least three indicators per factor one non-zero element per row of Λ (translation: each is pointed at by only one LV) non-correlated errors (Θ is diagonal) (translation: no double-headed arrows between the s) NO restrictions on Φ (translation: factors don t have to be correlated)

~ ~ ~ = Λ + = + Θ = var( ) = Φ = = ϕ var( ) ϕ What about the T-rule? Sample Moments: (8*9)/= Parameters being estimated: 8 eogenous variances= (fied) error variances = 8 direct effects = 8 double-headed arrows =

Food for Thought If something fulfills the -indicator rule, will it automatically fulfill the -indicator rule? If not, draw a counter-eample If something fulfills the -indicator rule, will it automatically fulfill the -indicator rule? If not, draw a counter eample If something fulfills either the - or - indicator rule, will it automatically fulfull the t-rule? If not, draw a counter-eample Estimation Procedures (Bollen, Ch. and Ch.7) Maimum Likelihood Generalized Least Squares Where these are in AMOS: View/Set Analysis Properties Estimation

CMIN (a global test of fit) Global Tests ~ Goodness of Fit AMOS: CMIN Get value comparing the covariance predicted by the model to the observed covariance (v. ugly formula found in AMOS help appendi B) The values can be used to compare nested models (e.g., your models compared to the saturated model ) More on CMIN CMIN : Depends on estimation procedure Asymptotically χ Maimum likelihood: -Log-likelihood Generalized LS: like Pearson χ CMIN for ML and from LS will converge as sample size approaches infinity. Assumes perfect fit : small deviations from model can lead to rejections (especially as N gets large) 7

Other fit statistics: Information Criteria Akaike: AIC = -LL + *s Schwarz: BIC = -LL + s*log(n*p) consistent AIC: CAIC = -LL + s*(log(n) + ) s is # of free parameters (parameters being estimated) p is number of parameters in independence model (no arrows, no latent vars, just observed variables) the smaller the better Other fit statistics: RMR (root mean square residual) average squared amount that sample variances and covariances differ from estimates is perfect fit the smaller the better, but no preset cutoff GFI (goodness of fit inde) is perfect fit ranges from to According to Garson, you want to see a GFI>.9 For more, check out http://www.chass.ncsu.edu/garson/pa7/structur.htm 8

Comparing Models AMOS: Model Fit Manage Models Can constrain parameters Careful with errors--can get negative variance estimate warnings. Nested likelihood ratios Comparison of info criteria (AIC, BIC) Parameter Evaluation Estimate/SE = Z-statistic Standard interpretation: if Z >, then significant Consider both statistical and scientific value of including a variable in the model Notes for parameter testing in CFA: Not usually interesting in determining if loadings are equal to zero Might be interested in testing whether or not covariance between factors is zero. 9

CFA Eample: Epidemiologic Catchment Area (ECA) Data Hypotheses: Lethargy and depression are two aspects of mental distress. Gender and age influence depression Current job influences both depression and lethargy Factors: depression lethargy X s: CFA Eample: ECA data have you been managing to keep yourself busy and occupied? have you been getting out of the house as much as usual? have you felt on the whole that you were doing things well? have you felt constantly under strain? have you found everything getting too much for you? have you been feeling unhappy and depressed?

Is this model identified? - indicator rule -How many sample moments to I have? -How many parameters am I estimating?

Everything seems fine, right? Or not.. Is this an identifiability issue?

CMIN= -LL (for ML) Proof: AIC = -LL + *s -LL=.98-(*) =.98

Second-Order Factor Analysis Factors: depression lethargy X s: have you been managing to keep yourself busy and occupied? have you been getting out of the house as much as usual? have you felt on the whole that you were doing things well? have you felt constantly under strain? have you found everything getting too much for you? have you been feeling unhappy and depressed? Maybe you look back at this, and think, maybe those two factors are correlated, because there is a grand factor to which they are both related. You would want STRONG prior theory Like mine

In order to get this to run, I had to keep constraining more and more parameters to be equal to. And still, even if it ran, bad things kept happening Totally crazy 9 trillion! (variance would be 77 octrillions) But, something even MORE important: The names of the two mediating latent vars can be (sort of) inferred from its indicators. BUT, what about the name for the eogenous latent variable It s hard to figure out what things really are when you have latent variables (with no indicators of their own) pointing to other latent variables.