MachineDancing: MikuMikuDance (MMD) *1 MMD MMD. Kinect. MachineDancing. 3 MachineDancing. 1 MachineDancing :

Σχετικά έγγραφα
ΕΥΡΕΣΗ ΤΟΥ ΔΙΑΝΥΣΜΑΤΟΣ ΘΕΣΗΣ ΚΙΝΟΥΜΕΝΟΥ ΡΟΜΠΟΤ ΜΕ ΜΟΝΟΦΘΑΛΜΟ ΣΥΣΤΗΜΑ ΟΡΑΣΗΣ

3: A convolution-pooling layer in PS-CNN 1: Partially Shared Deep Neural Network 2.2 Partially Shared Convolutional Neural Network 2: A hidden layer o

CSJ. Speaker clustering based on non-negative matrix factorization using i-vector-based speaker similarity

Buried Markov Model Pairwise

Area Location and Recognition of Video Text Based on Depth Learning Method

[1] DNA ATM [2] c 2013 Information Processing Society of Japan. Gait motion descriptors. Osaka University 2. Drexel University a)

Gaze Estimation from Low Resolution Images Insensitive to Segmentation Error

MIDI [8] MIDI. [9] Hsu [1], [2] [10] Salamon [11] [5] Song [6] Sony, Minato, Tokyo , Japan a) b)

Optimization, PSO) DE [1, 2, 3, 4] PSO [5, 6, 7, 8, 9, 10, 11] (P)

EM Baum-Welch. Step by Step the Baum-Welch Algorithm and its Application 2. HMM Baum-Welch. Baum-Welch. Baum-Welch Baum-Welch.

ΣΤΟΙΧΕΙΑ ΠΡΟΤΕΙΝΟΜΕΝΟΥ ΕΞΩΤΕΡΙΚΟΥ ΕΜΠΕΙΡΟΓΝΩΜΟΝΟΣ Προσωπικά Στοιχεία:

Fourier transform, STFT 5. Continuous wavelet transform, CWT STFT STFT STFT STFT [1] CWT CWT CWT STFT [2 5] CWT STFT STFT CWT CWT. Griffin [8] CWT CWT

Detection and Recognition of Traffic Signal Using Machine Learning

Architecture for Visualization Using Teacher Information based on SOM

ER-Tree (Extended R*-Tree)

HOSVD. Higher Order Data Classification Method with Autocorrelation Matrix Correcting on HOSVD. Junichi MORIGAKI and Kaoru KATAYAMA

An Automatic Modulation Classifier using a Frequency Discriminator for Intelligent Software Defined Radio

(Υπογραϕή) (Υπογραϕή) (Υπογραϕή)

Query by Phrase (QBP) (Music Information Retrieval, MIR) QBH QBP / [1, 2] [3, 4] Query-by-Humming (QBH) QBP MIDI [5, 6] [8 10] [7]

Automatic extraction of bibliography with machine learning

Toward a SPARQL Query Execution Mechanism using Dynamic Mapping Adaptation -A Preliminary Report- Takuya Adachi 1 Naoki Fukuta 2.

2016 IEEE/ACM International Conference on Mobile Software Engineering and Systems

Yoshifumi Moriyama 1,a) Ichiro Iimura 2,b) Tomotsugu Ohno 1,c) Shigeru Nakayama 3,d)

Vol.4-DCC-8 No.8 Vol.4-MUS-5 No.8 4// 3 3 Hanning (T ) 3 Hanning 3T (y(t)w(t)) dt =.5 T y (t)dt. () STRAIGHT F 3 TANDEM-STRAIGHT[] 3 F F 3 [] F []. :

Mapping Textures on 3D Geometric Model Using Reflectance Image

: Monte Carlo EM 313, Louis (1982) EM, EM Newton-Raphson, /. EM, 2 Monte Carlo EM Newton-Raphson, Monte Carlo EM, Monte Carlo EM, /. 3, Monte Carlo EM

1 n-gram n-gram n-gram [11], [15] n-best [16] n-gram. n-gram. 1,a) Graham Neubig 1,b) Sakriani Sakti 1,c) 1,d) 1,e)

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/6 CS Activity 1,a) CS Computer Science Activity Activity Actvity Activity Dining Eight-He

: Volterra. Volterra 3. (i.i.d.) Volterra cross-correlation. 2.1 Volterra f[ ] y(t) = f[t, u( )] y(t) = h 0 +

476,,. : 4. 7, MML. 4 6,.,. : ; Wishart ; MML Wishart ; CEM 2 ; ;,. 2. EM 2.1 Y = Y 1,, Y d T d, y = y 1,, y d T Y. k : p(y θ) = k α m p(y θ m ), (2.1

1,a) 1,b) 2 3 Sakriani Sakti 1 Graham Neubig 1 1. A Study on HMM-Based Speech Synthesis Using Rich Context Models

The Algorithm to Extract Characteristic Chord Progression Extended the Sequential Pattern Mining

[5] F 16.1% MFCC NMF D-CASE 17 [5] NMF NMF 3. [5] 1 NMF Deep Neural Network(DNN) FUSION 3.1 NMF NMF [12] S W H 1 Fig. 1 Our aoustic event detect

Comparison of Discriminant Analysis in Ear Recognition

Stabilization of stock price prediction by cross entropy optimization

Random Forests Leo. Hitoshi Habe 1

Identifying Scenes with the Same Person in Video Content on the Basis of Scene Continuity and Face Similarity Measurement

ΓΙΑΝΝΟΥΛΑ Σ. ΦΛΩΡΟΥ Ι ΑΚΤΟΡΑΣ ΤΟΥ ΤΜΗΜΑΤΟΣ ΕΦΑΡΜΟΣΜΕΝΗΣ ΠΛΗΡΟΦΟΡΙΚΗΣ ΤΟΥ ΠΑΝΕΠΙΣΤΗΜΙΟΥ ΜΑΚΕ ΟΝΙΑΣ ΒΙΟΓΡΑΦΙΚΟ ΣΗΜΕΙΩΜΑ

Kernel Methods and their Application for Image Understanding

Big Data/Business Intelligence

Στοιχεία εισηγητή Ημερομηνία: 10/10/2017

1530 ( ) 2014,54(12),, E (, 1, X ) [4],,, α, T α, β,, T β, c, P(T β 1 T α,α, β,c) 1 1,,X X F, X E F X E X F X F E X E 1 [1-2] , 2 : X X 1 X 2 ;


R k = r k x r k y r k z

substructure similarity search using features in graph databases

Bayesian Discriminant Feature Selection

Bundle Adjustment for 3-D Reconstruction: Implementation and Evaluation

ΔΙΠΛΩΜΑΤΙΚΕΣ ΕΡΓΑΣΙΕΣ ΠΜΣ «ΠΛΗΡΟΦΟΡΙΚΗ & ΕΠΙΚΟΙΝΩΝΙΕς» OSWINDS RESEARCH GROUP

Current Status and Future Prospects of Camera-Based Character Recognition and Document Image Analysis

1181 (real-timespeechdriven) 1 1 ( ) D FAP FAP (voiceactivationdetectionvad) D FaceGen 3- D XfaceEd MPEG-4 1 FAP 66 FAP ( ) FAP 84

Ανάλυση σχημάτων βασισμένη σε μεθόδους αναζήτησης ομοιότητας υποακολουθιών (C589)

Real istic Human Head Model ing with Hairstyle Reconstruction

No. 7 Modular Machine Tool & Automatic Manufacturing Technique. Jul TH166 TG659 A

ΔΙΠΛΩΜΑΤΙΚΕΣ ΕΡΓΑΣΙΕΣ ΠΜΣ «ΠΛΗΡΟΦΟΡΙΚΗ & ΕΠΙΚΟΙΝΩΝΙΕΣ» OSWINDS RESEARCH GROUP

Method to Distinguish between Handwritten and Machine-printed Characters Inspired by Human Vision System

38 8 Vol. 38, No ACTA AUTOMATICA SINICA August, , (Graphic processing unit, GPU). : 1) GPU Based Fast 3D-Object Modeling with Kinect

Anomaly Detection with Neighborhood Preservation Principle

Development of a basic motion analysis system using a sensor KINECT

SocialDict. A reading support tool with prediction capability and its extension to readability measurement

JOURNAL OF APPLIED SCIENCES Electronics and Information Engineering. Cyclic MUSIC DOA TN (2012)

Estimation of stability region for a class of switched linear systems with multiple equilibrium points

Ανάλυση, Περιγραφή και Ανάκτηση Μουσικών Δεδομένων: το έργο ΠΟΛΥΜΝΙΑ*

[4] 1.2 [5] Bayesian Approach min-max min-max [6] UCB(Upper Confidence Bound ) UCT [7] [1] ( ) Amazons[8] Lines of Action(LOA)[4] Winands [4] 1

Βιογραφικό Σημείωμα Γεώργιος Παπαϊωάννου

Estimation, Evaluation and Guarantee of the Reverberant Speech Recognition Performance based on Room Acoustic Parameters

{takasu, Conditional Random Field

Wireless capsule endoscopy video classification using an unsupervised learning approach

Reading Order Detection for Text Layout Excluded by Image

Applying Markov Decision Processes to Role-playing Game

Adaptive grouping difference variation wolf pack algorithm

DECO DECoration Ontology

Quick algorithm f or computing core attribute

Development of the Nursing Program for Rehabilitation of Woman Diagnosed with Breast Cancer

J. of Math. (PRC) Banach, , X = N(T ) R(T + ), Y = R(T ) N(T + ). Vol. 37 ( 2017 ) No. 5

ΟΙΚΟΝΟΜΙΚΟ ΠΑΝΕΠΙΣΤΗΜΙΟ ΑΘΗΝΩΝ

Αυτόματη Ανακατασκευή Θραυσμένων Αντικειμένων

Ένα µοντέλο Ισοδύναµης Χωρητικότητας για IEEE Ασύρµατα Δίκτυα. Εµµανουήλ Καφετζάκης

[2], [3], [8], [20] [4] [6], [18] [1], [11], [19] [13] [10] N SVD PCA N SVD Vasilescu Vasilescu N SVD [14] [17] Y Li [7] Y Li N SVD [12] 2,,,,, 596

Nov Journal of Zhengzhou University Engineering Science Vol. 36 No FCM. A doi /j. issn

BCI On Feature Extraction from Multi-Channel Brain Waves Used for Brain Computer Interface

ΕΚΘΕΣΗ ΠΡΟΟ ΟΥ Υποψήφιος ιδάκτορας: Ιωάννης Κυριαζής

Τεχνολογίες Πληροφορίας & Επικοινωνιών στην Εκπαίδευση (ΤΠΕ-Ε)

HMY 795: Αναγνώριση Προτύπων

ΤΕΙ ΘΕΣΣΑΛΙΑΣ. Αναγνώριση προσώπου με επιλογή των κατάλληλων κυρίων συνιστωσών. ΤΜΗΜΑ ΜΗΧΑΝΙΚΩΝ ΠΛΗΡΟΦΟΡΙΚΗΣ Τ.Ε ΚΑΒΒΑΔΙΑ ΑΛΕΞΑΝΔΡΟΥ.

Re-Pair n. Re-Pair. Re-Pair. Re-Pair. Re-Pair. (Re-Merge) Re-Merge. Sekine [4, 5, 8] (highly repetitive text) [2] Re-Pair. Blocked-Repair-VF [7]

Matrices and vectors. Matrix and vector. a 11 a 12 a 1n a 21 a 22 a 2n A = b 1 b 2. b m. R m n, b = = ( a ij. a m1 a m2 a mn. def

Gait Identification Using a View Transformation Model in the Frequency Domain

Psychological Evaluation on Interactions between People and Robot

(hidden Markov model: HMM) FUNDAMENTALS OF SPEECH SYNTHESIS BASED ON HMM. Keiichi Tokuda. Department of Computer Science

ΒΙΟΓΡΑΦΙΚΟ ΣΗΜΕΙΩΜΑ ΛΕΩΝΙΔΑΣ Α. ΣΠΥΡΟΥ Διδακτορικό σε Υπολογιστική Εμβιομηχανική, Τμήμα Μηχανολόγων Μηχανικών, Πανεπιστήμιο Θεσσαλίας.

Text Mining using Linguistic Information

y = f(x)+ffl x 2.2 x 2X f(x) x x p T (x) = 1 Z T exp( f(x)=t ) (2) x 1 exp Z T Z T = X x2x exp( f(x)=t ) (3) Z T T > 0 T 0 x p T (x) x f(x) (MAP = Max

GPGPU. Grover. On Large Scale Simulation of Grover s Algorithm by Using GPGPU

ΕΙΣΑΓΩΓΗ. 1.1 Εισαγωγή

Data Visualization. Πληροφορική Πολυμέσα. Γραφικές Τέχνες - Πολυμέσα Ελληνικό Ανοικτό Πανεπιστήμιο

ΒΙΟΓΡΑΦΙΚΟ ΣΗΜΕΙΩΜΑ ΣΤΥΛΙΑΝΗΣ Κ. ΣΟΦΙΑΝΟΠΟΥΛΟΥ Αναπληρώτρια Καθηγήτρια. Τµήµα Τεχνολογίας & Συστηµάτων Παραγωγής.

Three Dimensional Signed Eucl idean Distance Transf orm and Its Appl ications

ΕΚΘΕΣΗ ΠΡΟΟ ΟΥ Υποψήφιος ιδάκτορας: Ιωάννης Κυριαζής

UAV. UAV Unmanned Aerial Vehicle LED Light Emitting Diodes LQR Linear Quadratic Regulator

The State of the Art and Difficulties in Automatic Chinese Word Segmentation

Transcript:

MachineDancing: 1 1 3 MachineDancing GP 1. 33DCG 3D CG MikuMikuDance (MMD) *1 MMD MMD ( ) () 3D 1 National Institute of Advanced Industrial Science and Technology (AIST) *1 http://www.geocities.jp/higuchuu4 1 MachineDancing: Kinect ( ) MachineDancing c 2014 Information Processing Society of Japan 1

[1][13] [5][14] [11] [12] 2 [3] 1 [8] HMM [11] HMM [16] (DBN) [7] (HDP-HMM) [17] Kernel Canonical Correlation Analysis (KCCA) [4] (GP) [9] Topological Gesture Analysis (TGA) [10][15] Gaussian Process Dynamical Models (GPDM) [18] Multi-layer Joint Gait-Pose Manifolds (multi-layer JGPM) [2] GP MachineDancing 1. HMM 2. GP 1 MachineDancing HMM 2. 2 3D koron [19] 2.1 2.1.1 MFCC 16 MFCC MFCC 16 32 2.1.2 3 (x 1,x 2,x 3 ) c 2014 Information Processing Society of Japan 2

2 3 3 3 3 3 3 (θ x,θ y,θ z ) Q (q 1,q 2,q 3 ) q 0 Q = q 0 + q 1 i + q 2 j + q 3 k i 2 = j 2 = k 2 = ijk = 1 ij = ji = k jk = kj = i ki = ik = j 4 20 2 20 3 3 19 4 1 79 2.2 2.2.1 A 2 A 3 : 0.5 1 1 1.0 1 4 4 1 4 1 3 4 1.5 3 0.5 1 1.0 2.2.2 n D n v n D n d [d 1,,d s,,d S ] T d s 79 S c 2014 Information Processing Society of Japan 3

S D n v n P (D n v n ) P (v n v n 1 ) N{D n } N {v n } N P ( {D n } N,{v n } N ) N = P (D n v n )P (v n v n 1 )(1) P (v 1 v 0 ) = P (v 1 ) 0 HMM ( ) P (D n v n )P (v n v n 1 ) P (v 1 )P (D n v n ) 2.2.3 2 D 1 D 2 2 D [d 1,,d S ] 11.0 d s = f (t s ) f t s d s ( 0.5 < t s < 1.0) f 2 d s f (t s ) d s f (t s ) 2 N ( 0,σ 2 I ) (2) N ( 0,σ 2 I ) σ 2 I f (t) φ j (t) f (t s ) = J a j φ j (t s ) (3) j=1 {a j } J j=1 0 Σ p a = (a 1,,a J ) N (0,Σ p ) (4) Representer φ j f3 J D RBF φ tj (t) = k(t,t j ) = exp ( λ2 ) t t j 2 (5) 2 P (D,a) = P (D a)p (a) a = P (D t;σ,λ) ( 1 exp (2π) 1 t α K α 2 tr( D T K 1 D )) (6) t t = (t 1,,t s,,t S ) T α d K RBF (K) ij = k(t i,t j )+σ 2 δ tit j δ 2.3 2.3.1 {v k } K k=1 L = N logp (D n t n,v ;σ vn,λ vn ) EM E-step: v σ v λ v M-step: 2 E-step M-step 2.3.1.1 E-step M {D 1,,D M } c 2014 Information Processing Society of Japan 4

v v 6 D D = [ D 1,,D M] T t t = [ t 1,,t M] T σ λ σ λ LL(σ,λ,v ) = logp (D t,v ;σ,λ) σ v λ v Scaled Conjugate Gradient [7] v v 2.3.1.2 M-step E-step {v k } K k=1 ˆD ) ˆD ˆv ˆv = argmax v logp (ˆD ˆt,v;σ v,λ v 2.3.2 D v σ v λ v = P (D v,d,t,t) 1 exp ( (2π) 12 ( tr Z Z) ) T ˆK 1 (7) t α K α Z = D AK 1 D, ˆK = B A T K 1 A (8) (A) ij = exp ( λ v 2 ) 2 t i t j 2 +σ 2 v δ t i t j (9) (B) ij = exp ( λ v 2 ) 2 t i t j 2 +σ 2 v δ tit j (10) 7 v v 2.4 2.4.1 v MFCC+ MFCC 3 4 {M n } N P (M v) {M n } N K{z k } K k=1 P (M n v) z P (M n v n ) = z = z P (M n,z v n ) P (M n z)p (z v n ) (11) P (M n z) z P (z v n ) 11P (M n z) P (z v n ) z P (z v n) = 1 2.4.2 v MFCC+ MFCC HMM (1) {v n } N P ( {M n } N,{D n } N ) = P ( {M n } N,{D n } N,{v n } N ) {v n} N = {v n} N N P (M n v n )P (D n v n )P (v n v n 1 ) (12) 2.5 P ( {M n } N,{D n } N ) c 2014 Information Processing Society of Japan 5

3. https://staff.aist.go.jp/s.fukayama/ MachineDancing/index-j.html 60 MMD vmd [19] 0.5 1.5 3D CG 4 : 1.53 4 {v n } N 2 {M n } N {v n} N {v n} N = argmax {v n} N N P (M n v n )P (v n v n 1 ) (13) HMM Viterbi {v n} N D new n v n D = [ D 1,,D M] T 3 4 4 2 koron3d 4. MachineDancing MachineDancing 3D 3D koron JST CREST OngaCREST [1] K. M. Chen, S. T. Shen and S. D. Prior: Using music and motion analysis to construct 3D animations and visualisations, Digital Creativity Vol. 19, No. 2 (2008). [2] M. Ding and G. Fan: Multi-layer joint gait-pose manc 2014 Information Processing Society of Japan 6

ifold for human motion modeling, In Proc. FG 2013 pp. 1 8, (2013). [3] R. Fan, S. Xu and W. Geng: Example-based automatic music-driven conventional dance motion synthesis, IEEE Transactions on Visualization and Computer Graphics Vol. 18, No. 3 (2012). [4] T. Hirose and T. Taniguchi: Abstraction multimodal low-dimensional representation from high-dimensional posture information and visual images, Journal of Robotics and Mechatronics Vol. 25, No. 1 (2013). [5] J. W. Kim, H. Fouad, J. L. Sibert and J. K. Hahn: Perceptually motivated automatic dance motion generation for music, Computer Animation and Virtual Worlds 2009 Vol. 20 (2009), pp. 375-384. [6] M. Lee, L. Lee and J. Park: Music similarity-based approach to generating dance motion sequence, Multimedia Tools and Applications Vol. 62, No. 3 (2013), pp. 895-912. [7] M. F. Møller: A scaled conjugate gradient algorithm for fast supervised learning, Neural Networks 6, 4 (1993), pp. 525-533. [8] K. Moon and V. Pavlović: Graphical models for human motion modelling. Human Motion, Computational Imaging and Vision Vol. 36 (2008), pp. 159-183. [9] T. Mukai and S. Kuriyama: Geostatistical Motion Interpolation, In Proc. ACM SIGGRAPH, Vol. 24, No. 3 (2005), pp. 1062-1070. [10] L. Naveda and M. Leman: The spatiotemporal representation of dance and music gestures using topological gesture analysis (TGA), Music Perception Vol. 28, No. 1 (2010), pp. 93-111. [11] F. Ofli, E. Erzin, Y. Yemez and A. M. Tekalp: Learn2dance: Learning statistical music-to-dance mappings for choreography synthesis, IEEE Transactions on Multimedia Vol. 14, No. 3 (2012). [12] S. Oore and Y. Akiyama: Learning to synthesize arm motion to music by example, In Proc. WSCG 2006 (2006), pp. 201-208. [13] C. Panagiotakis, A. Holzapfel, D. Michel and A. Argyros: A. Beat synchronous dance animation based on visual analysis of human motion and audio analysis of music tempo, In Proc. ISVC 2013 (2013), pp. 118-127. [14] T. Shiratori, A. Nakazawa and K. Ikeuchi: Synthesizing dance performance using musical and motion features, In Proc. ICRA 2006 (2006), pp. 3654-3659. [15] P. Sousa, J. L. Oliveira, L. P. Reis and F. Gouyon: Humanized robot dancing: Humanoid motion retargeting based in a metrical representation of human dance styles, In Proc. EPIA 2011 (2011), pp. 392-406. [16] T. Takeda, Y. Hirata and K. Kosuge: Dance step estimation method based on HMM for dance partner robot, IEEE Transactions on Industrial Electronics Vol. 54, No. 2 (2007). [17] T. Taniguchi, K. Hamahata and N. Iwahashi: Unsupervised segmentation of human motion data using sticky HDP-HMM and MDL-based chunking method for imitation learning, Advanced Robotics Vol. 25, No. 17(2011). [18] J. M. Wang, D. J. Fleet and A. Hertzmann: Gaussian process dynamical models for human motion, IEEE Transactions on Pattern Recognition and Machine Intelligence Vol. 30, No. 2 (2008). [19],,, M. Mauch, : Songle,, Vol. 54, No. 4 (2013), pp. 1363 1372. c 2014 Information Processing Society of Japan 7