[9, 10] [2] [4] [10] Kyoto University, Yoshida Honmachi, Sakyo, Kyoto, , Japan 2. Yamaha Corporation. Waseda University a)

Σχετικά έγγραφα
A Robust Bootstrapping Algorithm of Speaker Models for On-Line Unsupervised Speaker Indexing

Parameter Estimation of Mixture Model of Multiple Instruments and Application to Musical Instrument Identification

MIDI [8] MIDI. [9] Hsu [1], [2] [10] Salamon [11] [5] Song [6] Sony, Minato, Tokyo , Japan a) b)

3: A convolution-pooling layer in PS-CNN 1: Partially Shared Deep Neural Network 2.2 Partially Shared Convolutional Neural Network 2: A hidden layer o

Stabilization of stock price prediction by cross entropy optimization

MÉTHODES ET EXERCICES

Π Ο Λ Ι Τ Ι Κ Α Κ Α Ι Σ Τ Ρ Α Τ Ι Ω Τ Ι Κ Α Γ Ε Γ Ο Ν Ο Τ Α

EM Baum-Welch. Step by Step the Baum-Welch Algorithm and its Application 2. HMM Baum-Welch. Baum-Welch. Baum-Welch Baum-Welch.

( ) (Harmonic-Temporal Clustering; HTC) [1], [2] ( ) ( ) [4] HTC. (Non-negative Matrix Factorization; NMF) [3] [5], [6] [7], [8]

Mellin transforms and asymptotics: Harmonic sums

Query by Phrase (QBP) (Music Information Retrieval, MIR) QBH QBP / [1, 2] [3, 4] Query-by-Humming (QBH) QBP MIDI [5, 6] [8 10] [7]

[5] F 16.1% MFCC NMF D-CASE 17 [5] NMF NMF 3. [5] 1 NMF Deep Neural Network(DNN) FUSION 3.1 NMF NMF [12] S W H 1 Fig. 1 Our aoustic event detect


Στοχαστικά Σήµατα και Εφαρµογές. ιδάσκων: Ν. Παπανδρέου (Π.. 407/80) Πανεπιστήµιο Πατρών ΤµήµαΜηχανικώνΗ/Υ και Πληροφορικής

Singing Information Processing: Music Information Processing for Singing Voices

DISPLAY SUPPLY: FILTER STANDBY

(x y) = (X = x Y = y) = (Y = y) (x y) = f X,Y (x, y) x f X

Fourier transform, STFT 5. Continuous wavelet transform, CWT STFT STFT STFT STFT [1] CWT CWT CWT STFT [2 5] CWT STFT STFT CWT CWT. Griffin [8] CWT CWT

Ψηφιακή Επεξεργασία Φωνής

Non-negative Matrix Factorization, NMF [5] NMF. [1 3] Bregman [4] Harmonic-Temporal Clustering, HTC [2,3] 1,2,b) NTT

. i-vector, Total Variability Subspace Adaptation Based Speaker Recognition. Brief Paper ACTA AUTOMATICA SINICA Vol. 40, No. 8 August, 2014.

: Monte Carlo EM 313, Louis (1982) EM, EM Newton-Raphson, /. EM, 2 Monte Carlo EM Newton-Raphson, Monte Carlo EM, Monte Carlo EM, /. 3, Monte Carlo EM

T : g r i l l b a r t a s o s Α Γ Ί Α Σ Σ Ο Φ Ί Α Σ 3, Δ Ρ Α Μ Α. Δ ι α ν ο μ έ ς κ α τ ο ί κ ο ν : 1 2 : 0 0 έ ω ς 0 1 : 0 0 π μ

Α Ρ Ι Θ Μ Ο Σ : 6.913



Μάθηση Λανθανόντων Μοντέλων με Μερικώς Επισημειωμένα Δεδομένα (Learning Aspect Models with Partially Labeled Data) Αναστασία Κριθαρά.

GAUGE BLOCKS. Grade 0 Tolerance for the variation in length. Limit deviation of length. ± 0.25μm. 0.14μm ±0.80μm. ± 1.90μm. ± 0.40μm. ± 1.

Πανεπιστήμιο Πατρών Τμήμα Ηλεκτρολόγων Μηχανικών και Τεχνολογίας Υπολογιστών. Διάλεξη 5

Exact linearization control scheme of DFIG

1 n-gram n-gram n-gram [11], [15] n-best [16] n-gram. n-gram. 1,a) Graham Neubig 1,b) Sakriani Sakti 1,c) 1,d) 1,e)

A Vocabulary-Free Infinity-Gram Model for Chord Progression Analysis

1181 (real-timespeechdriven) 1 1 ( ) D FAP FAP (voiceactivationdetectionvad) D FaceGen 3- D XfaceEd MPEG-4 1 FAP 66 FAP ( ) FAP 84

(Υπογραϕή) (Υπογραϕή) (Υπογραϕή)

STABILITY OF ABERRATION RETRIEVAL METHOD USING SPOT IMAGES

Dissertation for the degree philosophiae doctor (PhD) at the University of Bergen

Buried Markov Model Pairwise

HMY 795: Αναγνώριση Προτύπων

Erkki Mäkinen ja Timo Poranen Algoritmit

DOCUMENTS DE TRAVAIL / WORKING PAPERS

([28] Bao-Feng Feng (UTP-TX), ( ), [20], [16], [24]. 1 ([3], [17]) p t = 1 2 κ2 T + κ s N -259-

Applying Markov Decision Processes to Role-playing Game

2. Α ν ά λ υ σ η Π ε ρ ι ο χ ή ς. 3. Α π α ι τ ή σ ε ι ς Ε ρ γ ο δ ό τ η. 4. Τ υ π ο λ ο γ ί α κ τ ι ρ ί ω ν. 5. Π ρ ό τ α σ η. 6.

ECE 308 SIGNALS AND SYSTEMS FALL 2017 Answers to selected problems on prior years examinations

HOMEWORK#1. t E(x) = 1 λ = (b) Find the median lifetime of a randomly selected light bulb. Answer:

SocialDict. A reading support tool with prediction capability and its extension to readability measurement

2?nom. Bacc. 2 nom. acc. S <u. >nom. 7acc. acc >nom < <

FORMULAS FOR STATISTICS 1

F (x) = kx. F (x )dx. F = kx. U(x) = U(0) kx2

GUI

2. THEORY OF EQUATIONS. PREVIOUS EAMCET Bits.

Aquinas College. Edexcel Mathematical formulae and statistics tables DO NOT WRITE ON THIS BOOKLET

ibemo Kazakhstan Republic of Kazakhstan, West Kazakhstan Oblast, Aksai, Pramzone, BKKS office complex Phone: ; Fax:

Αριθµητικές Μέθοδοι Collocation. Απεικόνιση σε Σύγχρονες Υπολογιστικές Αρχιτεκτονικές

Coupling strategies for compressible - low Mach number flows

Solar Neutrinos: Fluxes


Στοχαστικά Σήματα και Τηλεπικοινωνιές

ITU-R P (2012/02) &' (



38 Te(OH) 6 2NH 4 H 2 PO 4 (NH 4 ) 2 HPO 4

Bundle Adjustment for 3-D Reconstruction: Implementation and Evaluation

Vol.4-DCC-8 No.8 Vol.4-MUS-5 No.8 4// 3 3 Hanning (T ) 3 Hanning 3T (y(t)w(t)) dt =.5 T y (t)dt. () STRAIGHT F 3 TANDEM-STRAIGHT[] 3 F F 3 [] F []. :

Αναγνώριση Προτύπων. Μη παραμετρικές τεχνικές Αριθμητικά. (Non Parametric Techniques)

{takasu, Conditional Random Field

Feasible Regions Defined by Stability Constraints Based on the Argument Principle

clearing a space (focusing) clearing a space, CS CS CS experiencing I 1. E. T. Gendlin (1978) experiencing (Gendlin 1962) experienc-

Ax = b. 7x = 21. x = 21 7 = 3.

2. N-gram IDF. DEIM Forum 2016 A1-1. N-gram IDF IDF. 5 N-gram. N-gram. N-gram. N-gram IDF.

Τυπολογίο Μαθηµατικών Μεθόδων Φυσικής ΙΙ

Hartree-Fock Theory. Solving electronic structure problem on computers

1,a) 1,b) 2 3 Sakriani Sakti 1 Graham Neubig 1 1. A Study on HMM-Based Speech Synthesis Using Rich Context Models

An Automatic Modulation Classifier using a Frequency Discriminator for Intelligent Software Defined Radio


Additional Results for the Pareto/NBD Model

An Advanced Manipulation for Space Redundant Macro-Micro Manipulator System


Echo path identification for stereophonic acoustic echo cancellation without pre-processing

Εθνικό & Καποδιστριακό Πανεπιστήμιο Αθηνών. Εισαγωγή στην Οικονομική Ανάλυση. Νίκος Θεοχαράκης Διάλεξη 5 Ιανουάριος 2014

1 B0 C00. nly Difo. r II. on III t o. ly II II. Di XR. Di un 5.8. Di Dinly. Di F/ / Dint. mou. on.3 3 D. 3.5 ird Thi. oun F/2. s m F/3 /3.

Η μέθοδος των πεπερασμένων στοιχείων για την εξίσωση της θερμότητας

Fundamentals of Signal Processing for Communications Systems

11 Drinfeld. k( ) = A/( ) A K. [Hat1, Hat2] k M > 0. Γ 1 (M) = γ SL 2 (Z) f : H C. ( ) az + b = (cz + d) k f(z) ( z H, γ = cz + d Γ 1 (M))

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

Vol. 37 ( 2017 ) No. 3. J. of Math. (PRC) : A : (2017) k=1. ,, f. f + u = f φ, x 1. x n : ( ).

Κεφάλαιο 2 ΕΚΤΙΜΗΣΗ ΠΑΡΑΜΕΤΡΩΝ. 2.1 Σηµειακή Εκτίµηση. = E(ˆθ) και διασπορά σ 2ˆθ = Var(ˆθ).

The Algorithm to Extract Characteristic Chord Progression Extended the Sequential Pattern Mining

CDMA. Performance Analysis of Chaotic Spread Spectrum CDMA Systems. LI Xiao - chao, GUO Dong - hui, ZENG Quan, WU Bo - xi RESEARCH & DEVELOPMENT

Μελέτη και Υλοποίηση Ελεγκτών Ρομποτικών Συστημάτων με χρήση Αλγορίθμων Ενισχυτικής Μάθησης

BCI On Feature Extraction from Multi-Channel Brain Waves Used for Brain Computer Interface

Voice Conversion based on Non-negative Matrix Factorization with Segment Features in Noisy Environments

Probability and Random Processes (Part II)

#%" )*& ##+," $ -,!./" %#/%0! %,!

= df. f (n) (x) = dn f dx n

= f(0) + f dt. = f. O 2 (x, u) x=(x 1,x 2,,x n ) T, f(x) =(f 1 (x), f 2 (x),, f n (x)) T. f x = A = f

"!$#&%('*),+.- /,0 +/.1),032 #4)5/ /.0 )80/ 9,: A B C <ED<8;=F >.<,G H I JD<8KA C B <=L&F8>.< >.: M <8G H I

Τύπος TAYLOR. f : [a, b] R f (n 1) (x) συνεχής x [a, b] f (n) (x) x (a, b) ξ μεταξύ x και x 0. (x x 0 ) k k! f(x) = f (k) (x 0 ) + R n (x)

y = f(x)+ffl x 2.2 x 2X f(x) x x p T (x) = 1 Z T exp( f(x)=t ) (2) x 1 exp Z T Z T = X x2x exp( f(x)=t ) (3) Z T T > 0 T 0 x p T (x) x f(x) (MAP = Max

Topic Structure Mining based on Wikipedia and Web Search

Transcript:

,2,a) 3. [ 3 [4 [5 8 Kyoto Univerity, Yohida Honachi, Sakyo, Kyoto, 66 85, Japan 2 Yaaha Corporation 3 Waeda Univerity a) akira.aezawa@gail.co [9, [2 [ c 24 Inforation Proceing Society of Japan

Generative Model of Muic Copoition Perforance Perforance Perforance 2 Perforance 3 Tie Generative Model of Muic Copoition Generative Model of Shared Teporal Interpretation Perf. iilar! Perf. 3 Perf. 2 Tie iilar! Perf. Perf. 2 Perf. 3 ) 2) 3) 2 3 4 2. [ 3 2 2. N S Z {z n } N n z n S z n θ θ z z z π τ 2 π τ S pl n,d) z z n- ϕ d, z n z N ϕ d,t- ϕ d,t ϕ d,td x d, x d,t- x d,t x d,td pl n, d) d n l ergodic : pz π, τ ) π z, n2 θ S D τ z n, zn, ) τ Dirichlet τ Dirτ, ) π π Dirπ ) τ, π τ π 2.2 θ n d T d N Left-to-right Φ d {ϕ d,t } T d t ϕ d, ϕ d,td N Z T d > N ϕ dt [... N [... L n, l) n l d n l pl n, d) c 24 Inforation Proceing Society of Japan 2

3 υ ξ M Λ u a μ u n- u n a n- a n μ n- μ n To pl n,d) u N μ N a N µ 2 pl n, d) : pϕ d,t{ Td }) L pl n, d)δn, ) ϕ d,,n, δn, S) ϕ d,t d,n, l T d [ L pl n, d) ϕ d,t,n,ϕ d,t,n,l t n2 l L l2 ϕ d,t,n,lϕ d,t,n,l 2) pl n, d) 3 2.3 d t ϕ dt n z n z n θ zn : px dt z, ϕ, θ) n px dt θ ) znϕ dtn 3) px θ ) x θ pθ ; θ ) px dt θ ) dix dt ) θ Noral-Gaa θ { µ, λ } θ {, ν, u, k } x dt µ, λ N µ, λ ) µ i, λ i N G,i, ν,i, u,i, k,i ) 3. 2.2 3 3. d n l nd n d n a n a n µ n [µ n, µ n,d * a n µ l n : pl n a n, µ n, λ ) N l n a n µ n, λ ) 4) : pa n κ, ι) N a n κ, ι ) 5) κ ι µ µ µ * µ c 24 Inforation Proceing Society of Japan 3

µ a aµ.8.6.4.2..8 µ.6.4 µ2.2. 2 4 6 8 5 4 3 2 2 4 6 8 6 5 4 3 2 2 4 6 8 aµ aµ2 zt µ 2..5..5. 2..5..5 2 3 4 5 6 7 8 9 µ µ2 µ3..8.6.4.2..2.4.6.8.. 2 3 4 5 6 7 8 9 4 µ a aµ µ Mean-reverting AR) µ n µ n : µ n µ n α µ n ) + ϵ n 6) ϵ n Λ n Λ n µ n + α) 2 ) < α < µ n α µ Λ n 4 µ a a µ µ 3.2 6 Λ n Λ n M u M u u ξ υ 5 3 6 µ µ 2 6 9 µ µ 2 µ 2 µ 3 pu ξ, υ) υ u, n ξ un,u n,, 7) u n d 3 u n n µ Switching-tate Kalan filter : pµ n µ n, Λ, u n )N µ n µ ) un n +α + α, Λ 8) u ξ υ Dirichlet Λ Wihart Wn, W ) 5 2 pl n, d) : pl) [ [ n N l n,d a n µ n,d, λ d N µ n µ n, Λ Dirυ; υ ) 3.3 ) un, υ u, ) N Gan, λ a n,, a n,l ) ξ un,u n,, [ Dirξ ; ξ )WΛ ; n, W ) 9) 2 c 24 Inforation Proceing Society of Japan 4

q KL : N µ Σ : N d,n,t z n, ϕ d,t,n 7) qϕ, z, θ, π, τ, µ, a, u, υ, ξ, Λ) qϕ d, )qz)qπ) qθ )qτ )) d qµ)qu)qυ) qa n )qξ n )) qλ) ) n µ Σ N z n, ϕ d,t,n x d,t 8) d,n,t N z n, ϕ d,t,n x d,t µ ) 2 9) d,n,t KL qϕ) qz) HMM x t n O t,n n n T n,n p tn x t ) n O t,nt n,np t,n x t ) px t+ T tn ) n px t T t,n )O t+,n T n,n p tn ) px t+ T tn )p tn x t ) qz) n g n v : log g n, d,t ϕ d,t,n log px d,t θ ) ) log v, log τ, 2) fx) q fx) π qπ) Dirπ + z ) τ qτ ) Dirτ, + N n> z n,z n ) qϕ d,t ) qz) h d,n n, l ) n, l) w d,n,l),n,l ) : log h d,t,n z n, log px d,t θ ) 3) l >, n n log w d,n,l ),n,l) E l,n l, n n + 4) otherwie E l,n : E l,n 2 λ l nd a n µ nd ) 2 + D 2 log λ 5) a n µ n l n n θ : q µ, λ ) N G ν + N, ν + N µ ν + N, u + N 2, k + N Σ + ν N ) )) 2 µ 2 ν + N 6) Switching-tate Kalan filter [ Switching-tate Kalan filter µ Kalan oother u HMM : T d X ndl ϕ d,t,n, t )ϕ d,t,n,l t) 2) C nd t L X ndl 2) l M nd C nd L lx ndl 22) l u HMM O n T, : log O n 2 trγ nλ ) + 2 log det Λ 23) log T, log ξ, 24) υ qυ) Dirυ + u ) ξ qξ,: ) Dirξ + N n> u n,z n,: ) µ pµ n X n, ) N µ n g n, V n ) Kalan oother g V : γ α + α 25) β + α 26) Γ n µ n βµ n γ)µ n βµ n γ) T 27) M S n u n Λ 28) A n diag a n 29) : c 24 Inforation Proceing Society of Japan 5

P n V n + β 2 S n 3) V n S n + A n Λ n A n β 2 S n P n S n 3) g n Vn βs n P n V n g n βs n γ) ) + S n γ + A n Λ n M n 32) px n+n µ n, ) N µ n h n, W n ) h W : Error DTW HMM Independent Coupled Coupled+Dynaic Q n W n + S n + A n Λ n A n 33) W n β 2 S n I Q n S n ) 34) h n βwn S n Q n W n h n +S n γ+a n Λ n M n ) γ) 35) µ n : qµ n ) N U n V n g n +W n h n ), U n ) 36) U n V n +W n ) Γ n µ n µ n µ n µ n : µ n µ n U n 37) µ n µ n βpn S n Q n + β 2 Sn T Pn S n ) 38) a : D ικ+λ d a n N C nd µ nd M nd D ι+λ d C nd µ 2 nd, D ι + λ C nd µ 2 nd ) ) d Λ : ) N N ) Λ W n + u n, W + u n Γ n n n 39) 4) HMM Kalan oother 4. 4. Chopin Mazurka9 2 5 [ Chroa 6 % 3% 5% 7% 9% Percentile * Krukal-Walli DTW p.5) vector [2 -chroa 24 44.kHz 892 764 DTW DTW [3 ) HSMM HMM HMM 2) HSMM Independent 3) Coupled 4) Coupled+Dynaic α. W I d n D λ 3 ι. κ ξ. υ. 6 DTW Independent Coupled 4.2 J.S.Bach 5 c 24 Inforation Proceing Society of Japan 6

7 Phraing d{5,6,7,8}+8 d{,2,3,4}+8 d{3,4,7,8}+8 d{,2,5,6}+8 d{2,4,6,8}+8 d{,3,5,7}+8 : low : ediu 2 : fat 8 u Λ gn Λ ) ) log ab Λ ) 7 2 3 d 7 8 d... 4 d 5... 8, 2 5. [2 - [ Sapp, C. S.: Coparative Analyi of Multiple Muical Perforance, ISMIR, pp. 2 5 27). [2 Stowell, D. and Chew, E.: Maxiu a Poteriori Etiation of Piecewie Arc in Tepo Tie-Serie, Fro Sound to Muic and Eotion, LNCS79), Springer, pp. 387 399 23). [3 Konz, V.: Autoated Method for Audio-Baed Muic Analyi with Application to Muicology, PhD Thei, Saarland Univerity 22). [4 Miki, S., Baba, T. and Katayoe, H.: PEVI: Interface for retrieving and analyzing expreive uical perforance with cape plot, SMC, pp. 748 753 23). [5 Raphael, C.: A Hybrid Graphical Model for Aligning Polyphonic Audio with Muical Score, ISMIR, pp. 387 394 24). [6 Cont, A.: A Coupled Duration-Focued Architecture for Real-Tie Muic-to-Score Alignent, PAMI, Vol. 32, No. 6, pp. 974 987 2). [7 Otuka, T., Nakadai, K., Ogata, T. and Okuno, H. G.: Increental Bayeian Audio-to-Score Alignent with Flexible Haronic Structure Model, ISMIR, pp. 525 53 2). [8 Sako, S., Yaaoto, R. and Kitaura, T.: Ryry: A Real-Tie Score-Following Autoatic Accopanient Playback Syte Capable of Real Perforance with Error, Repeat and Jup, AMT, pp. 34 45 24). [9 Miotto, R., Montecchio, N. and Orio, N.: Statitical Muic Modeling Aied at Identification and Alignent, Ad- MIRe, pp. 87 22 2). [, 24-MUS-3 24). [ Ghahraani, Z. and Hinton, G. E.: Variational learning for witching tate-pace odel, Neural Coputation, Vol. 2, pp. 963 996 998). [2 Fujihia, T.: Realtie Chord Recognition of Muical Sound: A Syte Uing Coon Lip Muic, ICMC, pp. 464 467 999). [3 Hu, N., Dannenberg, R. B. and Tzanetaki, G.: Polyphonic Audio Matching and Alignent for Muic Retrieval, WASPAA, pp. 85 88 23). c 24 Inforation Proceing Society of Japan 7