1000-9825/2002/13(09)1813-10 2002 Journal of Software Vol13, No9 1, 1, 2, 2, 2 2, 1 (, 710049); 2 (,100080) E-mal: {hchen,nnzheng}@aarxjtueducn; {-llang,yl,yxu,hshum}@mcrosoftcom http://wwwaarxjtueducn; http://researchmcrosoftcom/asa/group/research/vsualzeasp :,, 3 :,,,, : ; ; ; : TP391 : A,,,,,, [1~3] [2,3], [4,5],,,,,,, :, [1],,,,, : 2002-03-05; : 2002-06-18 : (60024301) : (1975),,,, ; (1952),,,,,,, ; (1974),,, ; (1973),,,, ;,,,, ;,,,,,
1814 Journal of Software 2002,13(9) 1 2,, 1 [4,5],nkwell [6] : CharToon [4] :,,, Brennan [7] Murakam PCASSO [8] Web-PCASSO [9],,,,, [10], [3],, [3],, : [3],,,,Sherstnkey M- [11],Wong [12],, [13,14] Phoneme-vseme [13],,,,, 2, 1, 3 : (1),,, (2),,, (3),,
: 1815 Cartoon edtor nput mage Cartoon generator Cartoon face Exaggerated Cartoon Exaggerated expresson template Cartoon anmator Voce Cartoon anmaton seuence,,,,,,,, Fg1 System archtecture 1 3 [10], : 2, :,,, 2, : ; Sketch generatng Stroke renderng {{ } { S } } { } } { S } { S } Mean- Shape Template fttng Runtme ASM model Non-Parametrc renderng Tranng Feature ponts T T S S Har contour extracton S Cartoon Fg2,,,ASM,,,,, Cartoon generaton The approach ncludes two steps: sketch generaton and stroke renderng 2 :
1816 Journal of Software 2002,13(9) 31 311, 2, : ( );,, S 3 : : S() () p ( S( ) ) p ( S( ) N ( )) = :, :,,,, (MeanShape), ASM [15], p ( S( ) N ( )), T,,,,,, 2,, : (1) (a) ; (b) ) = G( ; (c) T ; (d) T S ; (e) S = G 1 ( S ) (2), 312 [16],,, p (S() N ()),, p ( S( ) N ( )),,, l l ASM, (), K N Ω ( ) = {( S ( ), N S( ) { S ( ), = 1,, K}, : ( )), = 1,, K},,, K p ( S( ) N ( )) = α δ ( S( ) S ( )) (1) = 1 1 α = exp( d( N ( ), N ( ))), (2) Z N
: 1817 d(,),, Z, T : K T ( ) = α S ( ) (3) = 1, K, K,,, K 25%,, 7 7,, l 3 3, [17], 3 3 313,,, S = { L : = 1,2,, k}, k ; L, L =, θ } : { c c c = 0, c = 1 ; θ = {( x j, y j ); j = 1,2,, n}, n /,,,, : k p( S) = p( c ) p( θ ), (4) = 1,, p c ), p θ ), ( ASM [15],, ASM,,,, P θ ),, P c ) 314, ASM,,,,, 3(b), 32 3, : : ; :,, Catmull-Rom,,, 3, 3, ( ( (
1818 Journal of Software 2002,13(9), (a) Orgnal mage (a) (b) Generated Sketch (b) Fg3 Generated Cartoon 3 (c) Generated Cartoon (c) 4,,,,, 4 Fg4 User-nterface of Cartoon edtor 4, 7 : : (1) : (2) : (3) :, (4) :,, (5) :,,, 5 5,, phoneme-vseme [13,14], ( MFCC ),,
: 1819 (a) Orgnal mages (a) (b) Realstc-lookng Cartoon faces (c) mpressve Cartoon expressons and exaggerated Cartoon wth a partcular stroke style faces generated by applyng pre-desgned templates (b) (c) Fg5 51 Cartoon faces generated by ths system 5 MFCC(mel-freuency cepstrum coeffcents)delta coeffcents [18] 18 α,, v π µ, [19] n ( n =15 ),, v j : T j = arg mn( v µ ) R k k 1 k ( v µ ), k R k = 1,, n (5),,, α σ 6 j 52 7, 40 α,,, j
1820 Journal of Software 2002,13(9) u 18 2 1 ( α = j αj ) p( α µ ) exp (6) 2 j = 1 2πσ 2σ j j Proto-Lp Classfcaton Lp tranng data Synchronzaton Correspondng audo data - - Modelng Lkelhood p( ap(a µ µ ) ),,,, Fg6 Model tranng of lp-syncng algorthm 6 aa New audo Lkelhood p(a µ ) p( a µ ) Bayesan estmaton p(µ a) p( a) µ Syntheszed output n n vv= = µ pµ (µ p (µ a) a) = 1 = 1,,, Fg7 Synthess process of lp-syncng algorthm 7, p( α µ ) p( µ ) p( µ α) = n, (7) p( α µ ) p( µ ) p( µ ) = π = 1
: 1821 v = n = 1 µ ( µ α) (8) p,, 6 5 5(b) 256 256, 40, V14Ghz PC 2 5,,, 5(c) 3, 8 Fg8 Generated Cartoon anmaton seuence 8,,,,,,,,,, References: [1] Hsu, SC, Lee, HH, Wseman, NE Skeletal strokes n: UST 93 Proceedngs of the ACM SGGRAPH and SGCH Symposum on User nterface Software and Technology Secrets of the Face 1993 197~206 [2] Ostromoukhov, V Dgtal facal engravng n: Proceedngs of the ACM SGGRAPH 1999 1999 417~424 [3] Durand, F Decouplng strokes and hgh-level attrbutes for nteractve tradtonal drawng n: Proceedngs of the Eurographcs Renderng Workshop 2001 2001 71~82 [4] Ruttkay, Z, Noot, H Anmated chartoon faces n: Proceedngs of the 1st nternatonal Symposum on Non-Photorealstc Anmaton and Renderng 2000 2000 91~100 [5] Flash http://wwwmacromedacom/software/
1822 Journal of Software 2002,13(9) [6] Ltwnowcz, PC nkwell: a 25-d anmaton system Computer Graphcs, 1991,25(4):113~122 [7] Brennan, SE Carcature generator [MS Thess] Cambrdge, MA: MT Press, 1982 [8] Tomnaga, M, Fukuoka, S, Murakam, K, et al Facal carcaturng wth moton carcaturng n PCASSO system n: Proceedngs of the EEE/ASME nternatonal Conference on Advanced ntellgent Mechatroncs 97 1997 30~37 [9] http://wwwkosh-labsccschukyo-uacjp/~fuj/pca2 [10] Edwards, B The new drawng on the rght sde of the bran Harper Collns, 1999 [11] Sherstnsky, A, Pcard, RW M-lattce: a novel non-lnear dynamcal system and ts applcaton to halftonng n: EEE nternatonal Conference on Acoustcs, Speech, and Sgnal Processng (CASSP), Vol 2 1994 /565~/568 [12] Wong, EC Artstc renderng of protrat photographs [MS Thess] Cornell Unversty, 1999 [13] Bregler, C, Covell, M, Slaney, M Vdeo rewrte: drvng vsual speech wth audo n: Proceedngs of the ACM SGGRAPH 97 1997 353~360 [14] Morshma, S, Azawa, K, Harashma, H An ntellgent facal mage codng drven by speech and phoneme n: Proceedngs of the EEE CASSP 1989 1795-1798 [15] Cootes, TF, Taylor, CJ Statstcal models of appearance for computer vson Techncal Report, Manchester: Unversty of Manchester, 2000 [16] Efros, AA, Leung, TK Texture synthess by non-parametrc samplng n: Proceedngs of the 7th nternatonal Conference on Computer Vson 1999 1033~1038 [17] Xu, Yng-ng, Guo, Ba-nng, Shum, H Chaos mosac: fast and memory effcent texture synthess Techncal Report, MSR-TR-2000-32, Mcrosoft Research, 2000 [18] Rabner, L, Juang, BH Fundamentals of Speech Recognton Prentce Hall, 1993 [19] Bouman, CA Cluster: an unsupervsed algorthm for modelng Gaussan mxtures Software Manual, http://wwwecepurdueedu/ ~bouman A Personalzed mage-based Cartoon System CHEN Hong 1, ZHENG Nan-nng 1, LANG Ln 2, L Yan 2, XU Yng-ng 2, Shum Heung-Yeung 2 1 (nsttute of A and Robot, X an Jaotong Unversty, X an 710049, Chna); 2 (Mcrosoft Research Asa, Bejng 100080, Chna) E-mal: {hchen,nnzheng}@aarxjtueducn; {-llang,yl,yxu,hshum}@mcrosoftcom http://wwwaarxjtueducn; http://researchmcrosoftcom/asa/group/research/vsualzeasp Abstract: n ths paper, a Cartoon system s presented, whch can generate a personalzed Cartoon face from an nput mage The system s desgned for general users After a realstc-lookng Cartoon face automatcally generated by ths system, attractve Cartoon expressons and anmaton can be created wth lttle user-nteractons The system conssts of three man components, an mage-based automatc Cartoon generator, a template-based Cartoon edtor, and a speech-drven Cartoon anmator An example-based approach s taken by Cartoon generator to generate the Cartoon face whle capturng an artst style An nhomogeneous non-parametrc samplng s employed to learn the statstcal relatonshp between the orgnal mage and the drawng For ease edtng, pre-desgned expresson templates are suppled by Cartoon edtor A real tme lp-syncng algorthm s also developed to make Cartoon anmaton easly The Cartoon and anmaton generated by ths system have personal style Ths s the man advantage of ths system over others Key words: non-photorealstc renderng; example-based learnng; non-parametrc samplng; lp-syncng Receved March 5, 2002; accepted June 18, 2002 Supported by the Natonal Natural Scence Foundaton of Chna under Grant No60024301