EW OOLS FOR BAYESIA IFERECE: HE VARIAIOAL AROXIAIO olaos. Galatsanos ECE Dept. Unv. of atras, Greece. Dmtrs zas CS Dept. Unv. of Ioannna, Greece. Outlne Introducton Bayesan Inference Bascs A Estmaton Conugate Dstrbutons Graphcal odels E Algorthm An Alternatve Vew of the E he Varatonal E framewor Examples Lnear Regresson Blnd Image Deconvoluton Image Restoraton. Constraned. Bounded Gaussan xture odelng Conclusons Introducton Introducton homas Bayes (7 76, left, frst dscovered Bayes theorem n 764. However, Bayes n hs theorem used unform prors. erre Smon Laplace (749 87, rght, unaware of Bayes wor, dscovered the same theorem n more general form n a memor he wrote at the age of 5 and showed ts wde applcablty. (a (b (a: # of papers /year n IEEE Journals E Algorthm (b: # of papers /year n IEEE Journals Varatonal ethodology Applcatons of Varatonal Approxmaton. xture odelng of pdfs & Clusterng. ICA & CA Analyss 3. Learnng RFs 4. Dynamc Systems odelng 5. Image Recovery 6. Vsual racng 7. Dgtal Communcatons 8. Acoustcs and Speech rocessng 9. Learnng from Data Bases Sample of US atents that Use US atent 6879944 Varatonal relevance vector machne, Issued on Aprl, 5 US atent 655696 Varatonal nference engne for probablstc graphcal models, Issued on Aprl 9, 3 US atent 8567 Removng camera shae from a sngle photograph US atent 699447 ethod and apparatus for denosng and deverberaton usng varatonal nference and Strong Speech odels, Issue date: Jan 4, 6. US atent 65946 ethod for learnng swtchng lnear dynamc system models from data, Issue date: Jul 8, 3 US atent 693374 ethod of speech recognton usng varatonal nference wth swtchng state space models, Issue date: Aug 6, 5. US atent 4/6548 Varatonal nference and learnng for segmental swtchng state space models of hdden speech dynamcs, US atent 4/5493 A Systems and methods for tractable varatonal approxmaton for nference n decson graph Bayesan networs, Issue Feb 7, 7 ublcaton number US 7/8398 Data securty and ntruson detecton, ublcaton number: US 5/7657 A Dagnostc marers of mood dsorders and methods of use thereof ublcaton number: US 7/3339 A opulaton sequencng ublcaton number: US 6/846 A layer ranng wth partal nformaton ublcaton number: US 7/6578 A eam matchng ublcaton number: US 7/9854 A Detectng humans va ther pose
Bayesan Inference Bascs Estmaton > arameter Observatons: x arameters: Lelhood Functon: p x; axmum Lelhood Estmaton ( ( parameter ( ˆ arg max p x; L Bayesan Inference Bascs Inference > Random Varable Fnd osteror p ( x osteror ORE nformaton than pont estmate E( x SE estmate Var x accuracy of estmates ( Bayesan Inference Bascs Hdden Varables z: Descrbe data generaton mechansm (graphcal model lns between observatons and parameters Easy to compute p( x z Introduce prors p( z; Bayesan Inference Bascs Fnd Lelhood, argnalze Hdden Varables ( ; (, ; ( ; ( ; p x p x z dz p x z p z dz Fnd osteror p x z p z x; ( ( ; p( z; p( x; In most cases of nterest Cannot argnalze Bayesan Inference Bascs an effort n Bayesan Inference technques bypass or approxmate margnalzaton ntegral. Random samplng methods onte Carlo Determnstc approxmatons Laplace Varatonal A Estmaton Defned as mode of posteror : ˆ A argmax p x Based on Bayes theorem p( x p( p( x p x Can be found as: ( ˆ A argmax px ( p( (
A Estmaton o need for p( x no margnalzaton A Estmator uch easer to fnd ode of posteror o nformaton about shape of posteror Uses Bayes heorem, however posteror not found Aoor an s Bayesan Inference Conugate rors Fnd pror whch allows closed form margnalzaton of hdden varables ( (, ( ( p x p x z dz p x z p z dz Conugate rors Example #: μ hdden varable ( μ; σ ( ; μσ,, ( μ; μ, σ ( ; μ, σ ( ; μ, σ, σ ( μσ ; ( μ; μ, σ μ px x p x px px p d Conugate rors px ( μσ ; f ( μ exp ( μ μx σ p ( σ ( μμ ;, σ g( μ exp ( μ μμ px μ; σ, and pμμ ;, σ conugate ( ;, w.r.t. to μ both p x μ; σ and p μ μ σ have the same form (Gaussan. ( ( Conugate rors argnalzng μ possble (Gaussan Integral: ( ; ( (,, ; ;, p x μ σ σ p x μ σ p μμ σ dμ x; μ, + σ σ osteror: σx + σ μ p( μ x; σ, μ, σ μ;, + σ σ + σ σ Conugate rors Example #: a hdden varable ( ( p x a x;, a, b b ca exp p( abc ;, Gamma( abc ;, Γ ( ca ( b
Conugate rors p( x a and p ( abc ;, w.r.t. to a both have the same form (Gamma. / ax p( x a f ( a a exp p abc g a a ac b ( ;, ( exp( p( x a ( and p abc ;, conugate Conugate rors argnalze a possble / Γ ( b + / b x p( x; b, c c c+ Γ( b π b/ Can wrte as Student s t wth ν, b λ b/ c / Γ ( ν /+ / λ λx px ( ; λν, Stx ( ; λν, + Γ( ν/ πν ν osteror x p( a x; b, c Gamma a; b+, c+ ν // Lelhood Conugate ror Dstrbuton Conugate rors osteror Dstrbuton ( σ ab Gamma σ + n /, + n ( / μ n Σ ν V Wshart ( Σ ν + n, V+ ( X ( μ X μ ( X μσ, ( μμ, Σ μ ( Σ + nσ ( Σ μ + nσ x,( Σ + nσ ( x μ σ, (, Gamma(, X μσ Wshart (, ultn ( X π Dr( π a n + Dr( π a X Graphcal odels Represent dependences between rv.s n a statstcal model Graph nodes represent rv.s and edges dependences Drected and undrected graphs. Undrected arov Random Felds Rest of presentaton: drected, no cycles graphs Graphcal odels x rv assocated node s, π ( s parents of s s ( p xs x π ( s condtonal pdf Graphcal odels Example: pabcd,,, ; pa; pba; pca ; pdbc, ; ( ( ( ( 3 ( 4 Jont pdf over all varables p( x p( xs x π ( s s
E Algorthm x- observatons, z hdden varables, -parameters. Defne: old Q, E ln p x, z; old ln p( x, z; p( z x; dz ( ( old pzx ( ; E Algorthm old. Intal selecton. E step: Evaluate p(z x; old new 3. step: Evaluate new arg max Q, old ( 4. Chec for convergence parameters of log lelhood, f not satsfed, go to. An Alternatve Vew of the E Algorthm Can wrte: ln p( x; F( q, + KL( q p pxz (, ; Fq (, qz ( ln dz qz ( pz ( x; KL( q p q( zln dz qz ( q( z any pdf, KL( q p Kullbac Lebler Dvergence An Alternatve Vew of the E Algorthm *** KL( q p ln px ( ; Fq (, ln px ( ; Fq (, when KLq ( p or qz ( pz ( x; An Alternatve Vew of the E Algorthm *** OLD Substtute qz ( pz ( x ; Can wrte: OLD Fq (, pz ( x; ln pxz (, ; dz Same as before: OLD OLD p( z x; ln p( x z; dz OLD Q(, ant [ ] p ( z x ( OLD Q, E ln p ( x, z; ; OLD he Varatonal E framewor *** Assume p(z x; unnown F(q, functonal n terms of q(z Varatonal E Varatonal E step: Varatonal step: OLD ( EW ( q EW ( z max F q, EW arg max F q,
he Varatonal E framewor *** Key ssue: maxmze F(q, w.r.t. q(z? Assume parametrc form for q(z q(z approxmates unnown posteror p(z x ln p( x; F( q, + KL( q p max F(q, >mn KL(q p ean Feld Approxmaton*** Assumpton: q(z factorzes qz ( q( z ean Feld approxmaton, statstcal physcs ean Feld Approxmaton*** hen optmal factor q( z s: wth exp ln pxz (, ; exp ln pxz (, ; q ( z ln p( xz, ; ln pxz (, ; qdz dz Conugate Exponental models ror dstrbutons belong to the exponental famly p( X Y exp φ( Y u( X + f( X + g( Y Graphcal model wth conugate prors at each level hdden z, parents π ( z p ( z π ( z conugate to p( π( z π( π( z ( ( ( p z ππ ( ( z p z π( z p π( z ππ ( ( z dπ( z Conugate Exponental models p(x z,p(z z,p(z exponental dstrbutons p(z conugate to p(z z p( x z p( x z p( z z dz (x z conugate to p(z z z p( z p( z z p( z dz Cannot evaluate margnal z ( ( ( p x p x z p z z p( z dzdz x Conugate Exponental models ractable Varatonal computatons qz ( exp ln pxz (, ; ln qz ( ln px ( z; + ln pz ( z; q z ( ln qz ( ln pz ( z; + ln pz ( ; q( z
Lnear Regresson Examples Lnear Regresson Observatons at t n fnd y(x tn y( xn + ε n, n,, Sgnal y(x modeled by Lnear Regresson y( xw ; wmφm( x m Observaton model t y ( x ; w + ε, Φ X desgn matrx n n n n t Φw + ε φ φ φ ( φ x φ x Φ (,,, (,, ( m m m p β p Lnear Regresson Gaussan addtve nose ( ε ( ε, I Lelhood of observatons (; tw, β ( t Φw, β I Lnear Regresson: Least Squares (w parameters Graphcal odel Lnear Regresson: arameter Estmaton nmze mean square error E [ ] LS ( w t Φw tn y( xn; w n Soluton w LS ( ΦΦ Φt β t Φw axmum Lelhood Estmaton wl p t w t Φw, I arg max ( ;, β arg max ( β w w
Lnear Regresson: roblems wth Least Squares (w parameters axmum Lelhood Lmtatons: observatons parameters to estmate, >> Otherwse ΦΦ Ill condtoned L estmates large varance Assume constrants on the parameters w Bayesan: Use pror dstrbuton p(w Bayesan Lnear Regresson: odel (w Gaussan d dstrbuted Gaussan weght pror p w α wm, α ( ; ( m Why Gaussan? Conugate to lelhood Bayesan Lnear Regresson: Inference osteror gven by Bayes s law p( t w; β p( w; α p( w t; αβ, p( t; αβ, Can be found n closed form p( w t; αβ, ( w μ, Σ μ β ΣΦ t Σ ( βφ Φ+ αi Bayesan Lnear Regresson: arameter Estmaton axmum Lelhood αβ, { β I α ΦΦ t ( β I α ΦΦ t} ( αl, βl argmn p( t; α, β p( t w; β p( w; α dw argmn log + + + αβ, ot straght forward constraned optmzaton because α>, β>. Resort to E algorthm! Bayesan Lnear Regresson: E Algorthm Observatons t, parameters α, β; hdden varables w. Key for the applcaton of E: p(w t explctly nown E step: p(w t; α, β obtaned, nference of hdden varables step: L estmates of parameters Bayesan Lnear Regresson: E algorthm ( t Q ( twαβ ln p( twαβ ( t ( t p( wt ; α, β, ;,, ;, ln p( t w; αβ, p( w; αβ, ( t ( t p( wt ; α, β ( α ( t Φw α α ( w Q ( tw, ; αβ, β ln β t Φw + lnα w ( t β ln β + ln ( tr ( ( t ( t μ tr Σ ( t β ( t ( t Q ( tw, ; αβ, ln β t Φμ + [ Φ Σ Φ] α + ln α + [ ]
Lnear Regresson: E algorthm, E step E step: Evaluate p(w t;α (t,β (t (μ (t,σ (t Σ ( β Φ Φ+ α I μ ( t ( t ( t β Σ Φ t ( t ( t ( t Bayesan Lnear Regresson: arameter Estmaton: step step ( t ( t ( t ( α +, β + argmax( αβ, Q ( tw, ; α, β ( t Q, ;, ( ( t [ ( t μ tr Σ ] ( ( t [ ( t t Φμ tr ΦΣ Φ] ( twαβ + α α ( t Q ( tw, ; αβ, + β β ( t α + μ ( t ( t + tr[ Σ ] ( t β + ( t ( t t Φμ + tr[ ΦΣ Φ] Sparse Bayesan Lnear Regresson Lmted model: w d How to select bass functons? Sparse Lnear odel Consder many bass functons Estmatons use only few bass functons Advantages Small varance (good generalzaton Fast Evaluaton of estmaton Sparse Bayesan Lnear Regresson: ror Dstrbuton ew ror: w not dentcally dstrbuted ( w α ( m, m m p w α parameters to estmate, observatons Use Conugate Hyperprors p( α; a, b Gamma( α a, b m p( β; c, d Gamma( β c, d m Sparse Bayesan Lnear Regresson: ror Dstrbuton Graphcal odel: w, α hdden varables, a,b,c,d, parameters Sparse Bayesan Lnear Regresson: ror Dstrbuton rue weght pror s Student s t p( wa ;, b p( w α p( α; a, b dα m m m w, α Gamma α a, b dα ( ( m m m m St( w λν, m
Sparse Bayesan Lnear Regresson: osteror: p( t w, β p( w α p( α p( β p( w, α, β t p( t Cannot compute p( t p( t w, β p( w α p( α p( β dwdαdβ Varatonal ean Feld Approxmaton p( w, α, β t q( w, α, β q( w q( α q( β Sparse Bayesan Lnear Regresson: ln q( w ln p( t, w, α, β q α q ( ( β ln pt ( w, β p( w α q α q ( ( β ln pt ( w, β + ln p( w α q( α q ( β β ( t Φw ( tφw α mw m m α q q β ( ( β [ tt tφw + wφφw ] α m m w m w ( β Φ Φ + A w β w Φt+ const w Σ w w Σ μ+ const q( w ( w μσ, Σ β Φ Φ+ A μ β ΣΦ t ( Sparse Bayesan Lnear Regresson: ln q( α ln p( twα,,, β q( w q( β ln p( w α p( α w + a b a w + b q( w ln αm αm m ( lnαm αm m m m m ln m m m α α m m lnαm b mαm const m m a + q( α Gamma( α a, b a a+ / m m m bm b+ wm Sparse Bayesan Lnear Regresson: ln q( β ln p( twα,,, β q( w q( α ln p( t w p( β q( w q( α ln β β t Φw + ( c ln β dβ c + lnβ + d β+ const t Φw c ln β d β q( β Gamma( β c, d c c+ / d d + t Φw Sparse Bayesan Lnear Regresson: Fndng the requred expectatons q( β Gamma( β c, d, β c / d q( α Gamma( α a, b, α a / b m m m m m q w ( w ( w μσ,, m μm + Σmm { } t t t t t Φw t + tr ΦΣΦ + μ Φ Φμ t Φμ Sparse Bayesan Lnear Regresson: arameter Estmaton arameters a,b,c,d? Use fxed values that defne unnformatve prors Estmate parameters n Varatonal step For fxed a,b,c,d terate only between VE step for q(α, q(β and q(w
Lnear Regresson Example Lnear Regresson Example.8.6.4 orgnal observatons L(SE7.4e- Bayesan(SE4.9e-.5 Sgnal estmate Bass functons. Varatonal(SE3.7e- RVs(5.5.8.6.5.4. -. -.5 - -8-6 -4-4 6 8 - - -8-6 -4-4 6 8 Blnd Image Deconvoluton Examples Blnd Image Deconvoluton Unnown: Blur SF h rue Image f g h f + n g Hf + n Fh + n Convoluton Observed Image g Blnd Image Deconvoluton Restored Image Blnd Image Deconvoluton Unnown quanttes (f,h twce than nown (g ropertes that model should mpose: SF: smooth, lmted support Image: smooth, preserve edges ose: robustness Blnd Image Deconvoluton: ose odel Robust nose model Student s t pdf ( n β (, p n β β β p( β Gamma( β a, b pn ( pn ( β p( β dβ Student's t
Blnd Image Deconvoluton: SF odel SF: Sparse Lnear model Bass functons are Gaussan ernels hx ( wφ ( x h Φw φ ( x K( x, x Φ, K( x, x Sparseness weght pror α α p( w α ( w, α, p( α Gamma( α; a, b p( w p( w α p( α dα Student's t Blnd Image Deconvoluton: Image odel Drectonal mage dfferences ε ( x, y f( x, y f( x+, y ε ( xy, f( xy, f( xy, + ε γ γ γ ( ( p( γ Gamma ( γ a, b p( f γ f, ( Q ΓQ p( ε, γ ( Jont pdf Blnd Image Deconvoluton: Graphcal odel p( gf,, wαβγ,,, ; p( g f, w, β p( f γ p( w α p( β p( γ p( α Blnd Image Deconvoluton: Varatonal Bound ean feld approxmaton q( fwαβγ,,,, q( f q( w q( α q( β q( γ axmzaton of Varatonal Bound results n: log q( f log p( g, f, w, αβγ,, ; q( w q( α q( β q( γ log q( w log p( g, f, w, αβγ,, ; log q( α log p( gf,, wαβγ,,, ; q( w q( f q( β q( γ log q( β log p( gf,, wαβγ,,, ; q( w q( α q( f q( γ log q( γ log p( gfwαβγ,,,,, ; q ( w q ( α q ( f q ( β q( f q( α q( β q( γ Blnd Image Deconvoluton: Fnd approxmate posteror q(f ean Feld Optmzaton log q( f ( ΦWf g B( ΦWf g f Q ΓQf ( w q( β q( γ q f ( Φ W BWΦ + Q ΓQ f+ f Φ W Bg ( w q( β q( γ q f ( Φ W BW Φ + Q Γ Qf + f Φ W B g Completng Square results n q( f ( f μ f, Σ f μ Σ fφ < WB> g f Σ Φ < WBW> Φ + Q < Γ > Q f ( Blnd Image Deconvoluton: Approxmate posteror q(w Smlar calculatons log q( w ( FΦwg B( FΦwg w Aw q( f q( β q( α w ( Φ FBFΦ+ A w+ w Φ FBg+ const q( w ( w μ, Σ w ( w w w < > μ Σ Φ FB g Σ Φ < FBF> Φ+< A > w
Blnd Image Deconvoluton: Approxmate posteror q(α Based on the mean feld approxmaton log q( α log p( w α + log p( α q( w logα α w + ( a α logα b α α q( w ( a log α α ( b w α const α + + + Whch mples α α q( α Gamma( α a, b α α a a + / + < > b α α b w Blnd Image Deconvoluton: Approxmate posterors q(β and q(γ Smlarly we get β β q( β Gamma( β a, b K γ γ q( γ Gamma( γ a, b β β a a + / β β b b + < nn > n FΦwg γ γ a a + / b γ γ b + ( < > ( Q ff Q Blnd Image Deconvoluton: Statstcs of Approxmate osterors Blnd Image Deconvoluton: Statstcs of Approxmate osterors ae dagonal and crculant approxmatons hs results n Blnd Image Deconvoluton: Overall V E Algorthm Fx parameters { a α, b α, a β, b. β a γ, b γ } to yeld unnformatve hyperprors (no Varatonal step. Blnd Image Deconvoluton: Example SF: Square 7x7, 4db nose Iterate between estmates of statstcs of q(f, q(w, q(α, q(β and q(γ (only Varatonal E step.
Blnd Image Deconvoluton: Examples true V StStSt Example: Image Restoraton (Constraned * G. Chantas,. Galatsanos, A. Las and. Saunders, "Varatonal Bayesan Image Restoraton * Based on a roduct of Dstrbutons Image ror", IEEE rans. on Image rocessng, to appear. (avalable on lne Image Restoraton Image Restoraton: roblem Defnton Known SF h: ose Observed Degraded Image g Imagng odel ( pxels g h* f + n Hf + n Convoluton * rue Image f + g: ( : Degraded (Observatons h: (,H:( : ont Spread Functon (nown f: ( : Orgnal Image (unnown n~, β I : ose (unnown ( Image Restoraton: f arameter Lelhood of Observatons p( g; f, β ( Hf, β I ( ˆ f H H H g { p ( β } fˆ arg max g; f, arg mn g Hf Graphcal odel f roblematc too many parameters g β Image Restoraton: f arameter Example f SR log n Double crcled observed r.v. SR~8dB SR~dB
Image Restoraton: Bayesan Inference (f: hdden r.v., Image Restoraton: Bayesan Inference (f: hdden r.v. Error from Local Lnear redctor fˆ ( f ( ( f( + f( + ε ( Assume Gaussan..d. redcton Errors ( ε ( (, α p ( ε ( ε ( p p Smultaneously Autoregressve (SAR ror Qf ε, Q Operator that descrbes ε ff -ˆ p( f; a α exp α f Q Qf Graphcal odel a f g Double crcled observed rv β Image Restoraton: Bayesan Inference (f: hdden r.v. Image Restoraton: Bayesan Inference (f hdden r.v. Observaton Lelhood (computed analytcally ( β ( β ( p g;, a p g f; p f; a df osteror of Hdden (computed analytcally ( f g;, β ( μf g, Σf g p a Bayesan Inference va E Small α/β Amplfes ose Large α/β Smoothes Edges Image Restoraton: Spatally Varyng Bayesan odel Image Restoraton: Spatally Varyng, Bayesan Inference Spatally Varyng ( ε ( (, Use many ε,, n pror Qf p a α hdden varables Bayesan Inference Conugate pdf p ( α Gamma( a ; αβ, roduct ror p( f a p f a, p f a exp f QAQf ( ( ( Z a ( (, ( ( A dag a a a ror: Enforces many propertes smultaneously
Image Restoraton: Spatally Varyng Bayesan Inference Graphcal odel α,,β α,β α,β α α α Image Restoraton: Spatally Varyng, Bayesan Inference Dffculty: Compute normalzaton of pror Z ( a K det QAQ Q,A (Νx, 5-6 f β g Image Restoraton: Spatally Varyng, Bayesan Inference Image Restoraton: Spatallyvaryng, Bayesan Inference Change Observatons Doman QgQHf+Qn, y Hε +n,,..., Hdden Varables: l [ ε, ε ε ], a [ a, a a ] [ ε (, ε ( ε ( ], a [ a (, a ( a ( ] ε ε l l l l l l l β v λ α ε β v λ α ε... β v λ α ε ε κ Q f,,.. y y y Image Restoraton: Spatally Varyng Bayesan Inference ror on ε O dffculty normalzng p( ε a p( ε ( a(, ( ( ε ( (; λ ε (;, λ ( ν ν ( (; ν (;, p a a p a Gamma a Cannot margnalze hdden varables Resort to Varatonal ethodology Image Restoraton: Spatally Varyng, Bayesan Inference osterors ean Feld Approxmaton ( ε a ( ε ( a q, q q,,..., axmze Var. Bound q( ε ( μ, Σ, μ β ΣHg, Σ HH+ QAQ ( β λ ew problem: Dfferent f for each μ
Image Restoraton: Constraned Defne Constraned osteror q( ε ( QmQRQ, Consstent wth ( ( ( ( ε Qf, m E f, R E fm fm axmze the Varatonal Bound w.r.t. m and R Image Restoraton: Constraned t+ ( t+ ( t VE-step: q, arg max F q α,, q ( α, [ Rm] ( ( α ( t+ ( t+ ( t+ F q ( α ( ( ( ( ( V-step: arg max,,,, [ β, λ,..., λ, ν,..., ν ] Image Restoraton: Constraned VE Step Fq (, q( ε; q( a log p( y, ε, a ; dd ε a q( ε; q( a log q( ε; q( a dε da, F F ( F ( q( ε; q( a log p( y ε; p( ε a; dεda q( ε; log q( ε; dε. Image Restoraton: Constraned q( ε ; q( a log p( y ε; p( ε a; dεda - β( Hε -y Q Q ( Hε -y λε Aε q( ε ; q ( a β λ trace β + λ q ( ε; log q ( ε; d ε logdet R Image Restoraton: Constraned ˆ trace β + λ logdet F HHR ( QAQR R R R ˆ β λ ˆ HH+ QAQ R R βhh+ λqaq F ( m βrh g m Image Restoraton: Constraned q( a exp log p( y, ε, a q( ε ν + ν ( ( a exp a( λ( ( m ( + C (, a( ( ν ν ( ( (;, ( ( q a Gamma a + + λ m + C (, m Q m, and C Q RQ
Image Restoraton: Constraned V step ( t+ ( t+ df ( q ( a, log (,, ; ( (, d p y ε a t+ t+ q( ε ;, q ( a d d β ( Hm - g trace{ H HR } + t t t ( + ( + ( + ( ( (, a ( ( t m + C q ( a ( ( t λ + ( t ( t + ( t ( log a ( a ( ( t ( ψ ν + + ( t ( + + q q a a ( t ν ν log ν + ψ + log + d ψ ( x log Γ( x dx Image Restoraton: Constraned. Intalze, m statonary model.. Repeat untl convergence: VE step: Update, m and R, calculate m and C. Calculate expected value w.r.t. q(a (, needed for V step and the next VE step. V step: Update β, λ κ, ν κ,,... 3. Use m as restored mage estmate. Image Restoraton: Constraned Example: Image Restoraton (Bounded * Babacan, S.D.; olna, R.; Katsaggelos, A.K.; arameter Estmaton n V Image Restoraton * Usng Varatonal Dstrbuton Approxmaton, IEEE rans. on Image rocessng, Volume 7, Issue 3, arch 8 age(s:36 339 Image Restoraton: Bounded Imagng odel ( pxels g h* f + n Hf + n Observatons Lelhood ( β p( g/ f; β~ Hf, I Image Restoraton: Bounded V Based Image ror p( f a α exp( αv( f V( f ([ Qf] + ([ Qf], [ ] [ ] Qf f(, f(,, Qf f(, f(, Conugate Hyperpror p ( α; ab, Gamma( α; ab,
Image Restoraton: Bounded Graphcal odel a, b α f Image Restoraton: Bounded Dffculty n VE step log q( α log p( gf,, α; β, a, b q( f β Hf g α, ([ Qf] + ([ Qf] Due to q( f cannot Compute Expectaton β g Image Restoraton: Bounded Bypass dffculty: axmze a Lower Bound of Varatonal Bound w+ u Use Upper Bound: f( w w g( u, w, u> u Image Restoraton: Bounded Defne Functon w ([ ] ([ ] u + + p( f α ( f, α, u α exp a Qf Qf u p( gf,, α ; p( g/ f ( f, α, u p( α ( gfu,,, α ; Bound gets tght : f ( w g( u, w, u w Lower Bound of Varatonal Bound F( q q, ln d d F ( q, ( gfu,,, α ; b, ( f α f α, u q ( f, α Image Restoraton: Bounded VE Step q (, f q ( α argmax F (( q f, q( α, u, ( t+ ( t+ b ( t ( t q( f, q( α V Step Bound ghtenng u ( t+ b ( t+ ( t+ ( t arg max F ( q ( f, q ( α, u, u ( t b ( t ( t ( t + arg max F ( q + (, q + ( α, +, f u Image Restoraton: Bounded ghtest Bound f( w g( u, w, u w ([ ] ([ ] ( t+ + ( t+ q ( f ( t+ ( t+ ( t+ ( t+ Qμ + Qμ + QΣ Q + QΣ Q ( t+ ( t+ ( t+ u Qf Q f,, wth q ( f f; μ, Σ ( u Captures Local Spatal Actvty
Gaussan xture odels Example: Gaussan xture odels.4 p( x; π ( x; μ, Σ.35.3 odel any pdf.5. Soft clusterng.5. arameters.5 { π, μ, Σ},, axmum Lelhood Estmaton Dffcult arg max log π ( x; μ, Σ L 3 3 4 5 6 7 Gaussan xture odels: Data Generaton echansm Introduce bnary hdden varable z. Select component z, z (,,,,,, z z p ( z; π π multnomal. Generate sample from selected component x p ( x ( x; μ, Σ Gaussan xture odels: Data Generaton echansm Jont pdf p( xz, ; p( x z p( z ( x; μ, Σ argnal pdf p( x p ; ( xz, ; z ( π z z π ( ;, x μ Σ Gaussan xture odels: osteror osteror (responsblty can be computed analytcally p( x z p( z π ( x μ, Σ z ( x ( ( p( z x; p x π l l x μ l, Σl π (, π ( l l x μ l, Σl x μ Σ z Gaussan xture odels: arameter Estmaton axmum lelhood L argmax log p( X; arg max log π ( x; μ, Σ Use E Smplfes optmzaton roved convergence Satsfes postvty constrants π >, π
Gaussan xture odels: arameter Estmaton E z z n p( XZ, ; ( xn; μ, Σ ( π n ( t Q(, log p( XZ, ; ( t p( zx ; ( t ( t zn π + zn xn; μ, Σ n n log log ( n Gaussan xture odels: arameter Estmaton E E step z z n ( t ( t ( t ( t ( t ( t π ( ( ( xn; μ, Σ π x t n; μ, Σ n ( t ( t ( t ( t ( t ( t z n π xn; μ, Σ π x n; μ, Σ step ( ( ( t ( t π + z n n μ ( t z ( t+ n n ( t z n n x n Σ z ( xn μ ( xn μ ( ( ( n n t ( + z ( t n n Gaussan xture odels: Lmtatons How many components? Ill condtoned covarance matrces 6 Varatonal Bayesan Gaussan xture odels 4 4 H. Attas, "A Varatonal Bayesan Framewor for Graphcal odels", roc. IS, pp. 9 6, I ress,. 6 6 4 4 6 Varatonal Bayesan Gaussan xture odels: ror Dstrbuton reat parameters as hdden varables h { Z, πμ,, } Introduce conugate prors p ( π α Dr( π,, Γ( α α α α π Γ( α Varatonal Bayesan Gaussan xture odels: ean Feld Approxmaton Exact Bayesan Inference Intractable Varatonal ean Feld Approxmaton q( h qz( Z qπ( π qμ( μ, q ( μ, q ( μ q ( μ μ p( μ ( μ ; μ, β ( ν d / ν, ν, d ν d/ / n/ d( d 4 p( V W( V exptr V π V Γ (( ν + /
Varatonal Bayesan Gaussan xture odels: Approxmate osterors Varatonal Bayesan Gaussan xture odels: Approxmate osterors Z n z n q ( Z r n d log rn logπ + log ( yn μ ( y n μ β β ( β / ( β q ( μ ( μ ; m, μ m μ + μ + β + β μ z nyn n q ( π Dr( π { λ } π λ + α z n n q ( W( ; η, U η + ν ( ( / ( β U Σ + β μ μ μ μ + + n( n ( n n Σ z y μ y μ Varatonal Bayesan Gaussan xture odels: Dscusson Select arameters that defne unnformatve prors (e.g. α / Advantages Dsallow sngular covarance matrce Bayesan model selecton Drchlet dstrbuton for mxng coeffcents π dsallows prunng of unnecessary components Varatonal Bayesan Gaussan xture odels: Removng the pror from the mxng weghts A. Corduneanu and C. Bshop, Varatonal Bayesan odel Selecton for xture Dstrbutons, roc. AI and Statstcs Conference, January Varatonal Bayesan G: Removng the pror from the mxng weghts reat π as parameter Include step to update π r n n π r n n r n rn r n r π exp log tr x x x μ + μ x + μμ n n n n n Advantage Elmnates rrelevant components Varatonal Bayesan G: Example.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5
Incremental Varatonal Bayesan Gaussan xture odels Incremental Varatonal Bayesan Gaussan xture odels Solutons depend on: maxmum ntal number of components ntalzaton of component parameters specfcaton of the scale matrx V of p( Wshart(v, V Constantnopoulos C. and Las, A., "Unsupervsed Learnng of Gaussan xtures Based on Varatonal Component Splttng", IEEE rans. on eural etwors, vol. 8, no. 3, pp. 745-755, 7. Incremental Varatonal Bayesan Gaussan xture odels Dvde components as fxed and free Restrct competton among free components only α + s s ( Γ α s π + p( π π π s Γ( α s s + π + s μ X Z π ~ Js π s Incremental Varatonal Bayesan Gaussan xture odels We start by tranng a G wth two components At each step: Select a component Set, V dλi, where λ the max egenvalue of Splt the component n two subcomponents Apply VB learnng consderng the two components as free Incremental Varatonal Bayesan Gaussan xture odels If the data n the regon of component suggest the exstence of more than two components then the two components wll be retaned Otherwse one of the two components wll be removed from the model Incremental Varatonal Bayesan Gaussan xture odels
Conclusons Varatonal Approxmaton ros:. Very Flexble ool. ce heoretcal ropertes 3. Gves ractable Algorthms 4. Appled to any roblems * Varatonal Approxmaton Cons:. ghtness of Bound. Sometmes Dffcult Calculatons