Higher-Order Correlation Analysis of Pitch Fluctuations in Sustained Normal Vowels by the Method of Surrogate Data

Σχετικά έγγραφα
Buried Markov Model Pairwise

Optimization, PSO) DE [1, 2, 3, 4] PSO [5, 6, 7, 8, 9, 10, 11] (P)

Fourier transform, STFT 5. Continuous wavelet transform, CWT STFT STFT STFT STFT [1] CWT CWT CWT STFT [2 5] CWT STFT STFT CWT CWT. Griffin [8] CWT CWT

An Automatic Modulation Classifier using a Frequency Discriminator for Intelligent Software Defined Radio

n 1 n 3 choice node (shelf) choice node (rough group) choice node (representative candidate)

Vol.4-DCC-8 No.8 Vol.4-MUS-5 No.8 4// 3 3 Hanning (T ) 3 Hanning 3T (y(t)w(t)) dt =.5 T y (t)dt. () STRAIGHT F 3 TANDEM-STRAIGHT[] 3 F F 3 [] F []. :

1 (forward modeling) 2 (data-driven modeling) e- Quest EnergyPlus DeST 1.1. {X t } ARMA. S.Sp. Pappas [4]

VOCODER VOCODER Vocal

High order interpolation function for surface contact problem

VBA Microsoft Excel. J. Comput. Chem. Jpn., Vol. 5, No. 1, pp (2006)

3: A convolution-pooling layer in PS-CNN 1: Partially Shared Deep Neural Network 2.2 Partially Shared Convolutional Neural Network 2: A hidden layer o

ΔΙΠΛΩΜΑΤΙΚΗ ΕΡΓΑΣΙΑ. του Φοιτητή του Τμήματος Ηλεκτρολόγων Μηχανικών και Τεχνολογίας Υπολογιστών της Πολυτεχνικής Σχολής του Πανεπιστημίου Πατρών

SNR F0 [2], [3], [4] F0 F0 F0 F0 F0 TUSK F0 TUSK F0 6 TUSK 6 F0 2. F0 F0 [5] [6] [7] p[8] Cepstrum [9], [10] [11] [12] [13] F0 [14] F0 [15] DIO[16] [1

Quantitative chemical analyses of rocks with X-ray fluorescence analyzer: major and trace elements in ultrabasic rocks

ΑΠΟΔΟΣΗ ΕΛΕΓΧΩΝ ΓΡΑΜΜΙΚΗΣ ΤΑΣΗΣ ΧΡΟΝΟΣΕΙΡΩΝ ΜΕ ΧΡΗΣΗ ΤΕΧΝΙΚΩΝ ΤΥΧΑΙΟΠΟΙΗΣΗΣ

Comparison of Evapotranspiration between Indigenous Vegetation and Invading Vegetation in a Bog

Stabilization of stock price prediction by cross entropy optimization

Analysis of prosodic features in native and non-native Japanese using generation process model of fundamental frequency contours

Aquinas College. Edexcel Mathematical formulae and statistics tables DO NOT WRITE ON THIS BOOKLET

CorV CVAC. CorV TU317. 1

P É Ô Ô² 1,2,.. Ò± 1,.. ±μ 1,. ƒ. ±μ μ 1,.Š. ±μ μ 1, ˆ.. Ê Ò 1,.. Ê Ò 1 Œˆ ˆŸ. ² μ Ê ² μ Ì μ ÉÓ. É μ ±, Ì μé μ Ò É μ Ò ² μ Ö

Supplementary Materials for Evolutionary Multiobjective Optimization Based Multimodal Optimization: Fitness Landscape Approximation and Peak Detection

C F E E E F FF E F B F F A EA C AEC

3.8.1 J (7) (1883~1906) (1907~1931) A ~ (10) i J C-1 ~1973 C-2

[4] 1.2 [5] Bayesian Approach min-max min-max [6] UCB(Upper Confidence Bound ) UCT [7] [1] ( ) Amazons[8] Lines of Action(LOA)[4] Winands [4] 1

{takasu, Conditional Random Field

ΚΑΤΑΣΚΕΥΑΣΤΙΚΟΣ ΤΟΜΕΑΣ

,,, (, ) , ;,,, ; -

MIDI [8] MIDI. [9] Hsu [1], [2] [10] Salamon [11] [5] Song [6] Sony, Minato, Tokyo , Japan a) b)

*,* + -+ on Bedrock Bath. Hideyuki O, Shoichi O, Takao O, Kumiko Y, Yoshinao K and Tsuneaki G

Design and Fabrication of Water Heater with Electromagnetic Induction Heating

Retrieval of Seismic Data Recorded on Open-reel-type Magnetic Tapes (MT) by Using Existing Devices

Studies on the Binding Mechanism of Several Antibiotics and Human Serum Albumin

Bundle Adjustment for 3-D Reconstruction: Implementation and Evaluation

Στοιχεία εισηγητή Ημερομηνία: 10/10/2017

ΔΗΜΟΣΙΕΥΣΕΙΣ σε περιοδικά με κριτές

BCI On Feature Extraction from Multi-Channel Brain Waves Used for Brain Computer Interface

3-dimensional motion simulation of a ship in waves using composite grid method

Vol. 31,No JOURNAL OF CHINA UNIVERSITY OF SCIENCE AND TECHNOLOGY Feb

GS3. A liner offset equation of the volumetric water content that capacitance type GS3 soil moisture sensor measured

Anomaly Detection with Neighborhood Preservation Principle

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

Outline Analog Communications. Lecture 05 Angle Modulation. Instantaneous Frequency and Frequency Deviation. Angle Modulation. Pierluigi SALVO ROSSI

Estimation, Evaluation and Guarantee of the Reverberant Speech Recognition Performance based on Room Acoustic Parameters

Ηλεκτρονικοί Υπολογιστές IV

Journal of the Institute of Science and Engineering. Chuo University

Japanese Fuzzy String Matching in Cooking Recipes

Evolution of Novel Studies on Thermofluid Dynamics with Combustion

( ) , ) , ; kg 1) 80 % kg. Vol. 28,No. 1 Jan.,2006 RESOURCES SCIENCE : (2006) ,2 ,,,, ; ;

* ** *** *** Jun S HIMADA*, Kyoko O HSUMI**, Kazuhiko O HBA*** and Atsushi M ARUYAMA***

Scrub Nurse Robot: SNR. C++ SNR Uppaal TA SNR SNR. Vain SNR. Uppaal TA. TA state Uppaal TA location. Uppaal

X g 1990 g PSRB

A life-table metamodel to support management of data deficient species, exemplified in sturgeons and shads. Electronic Supplementary Material

Cable Systems - Postive/Negative Seq Impedance

EM Baum-Welch. Step by Step the Baum-Welch Algorithm and its Application 2. HMM Baum-Welch. Baum-Welch. Baum-Welch Baum-Welch.

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/6 CS Activity 1,a) CS Computer Science Activity Activity Actvity Activity Dining Eight-He

Supporting Information

Eects of Gas-Surface Interaction Model in Hypersonic Rareed Gas Flow

Resurvey of Possible Seismic Fissures in the Old-Edo River in Tokyo

([28] Bao-Feng Feng (UTP-TX), ( ), [20], [16], [24]. 1 ([3], [17]) p t = 1 2 κ2 T + κ s N -259-

Ψηφιακή Επεξεργασία Φωνής

90 [, ] p Panel nested error structure) : Lagrange-multiple LM) Honda [3] LM ; King Wu, Baltagi, Chang Li [4] Moulton Randolph ANOVA) F p Panel,, p Z

Experimental Study of Dielectric Properties on Human Lung Tissue

Development of a Seismic Data Analysis System for a Short-term Training for Researchers from Developing Countries

Reaction of a Platinum Electrode for the Measurement of Redox Potential of Paddy Soil

Hydrologic Process in Wetland

GUI

1,a) 1,b) 2 3 Sakriani Sakti 1 Graham Neubig 1 1. A Study on HMM-Based Speech Synthesis Using Rich Context Models

Biostatistics for Health Sciences Review Sheet

: Monte Carlo EM 313, Louis (1982) EM, EM Newton-Raphson, /. EM, 2 Monte Carlo EM Newton-Raphson, Monte Carlo EM, Monte Carlo EM, /. 3, Monte Carlo EM

Transient Voltage Suppression Diodes: 1.5KE Series Axial Leaded Type 1500 W

difluoroboranyls derived from amides carrying donor group Supporting Information

Numerical Methods for Civil Engineers. Lecture 10 Ordinary Differential Equations. Ordinary Differential Equations. d x dx.

Trace gas emissions from soil ecosystems and their implications in the atmospheric environment

Statistical Inference I Locally most powerful tests

ER-Tree (Extended R*-Tree)

Prey-Taxis Holling-Tanner

stability and aromaticity in the benzonitrile H 2 O complex with Na+ or Cl

1181 (real-timespeechdriven) 1 1 ( ) D FAP FAP (voiceactivationdetectionvad) D FaceGen 3- D XfaceEd MPEG-4 1 FAP 66 FAP ( ) FAP 84

Computer No.53 (1992) IBM 650. Bacon TSS JRR-2.[1] free inductin decay IBM 7044 FACOM

Maxima SCORM. Algebraic Manipulations and Visualizing Graphs in SCORM contents by Maxima and Mashup Approach. Jia Yunpeng, 1 Takayuki Nagai, 2, 1

Ψηφιακή Επεξεργασία Φωνής

Statistics 104: Quantitative Methods for Economics Formula and Theorem Review

Σχέση µεταξύ της Μεθόδου των ερµατοπτυχών και της Βιοηλεκτρικής Αντίστασης στον Υπολογισµό του Ποσοστού Σωµατικού Λίπους

Correction of chromatic aberration for human eyes with diffractive-refractive hybrid elements

ΖΩΝΟΠΟΙΗΣΗ ΤΗΣ ΚΑΤΟΛΙΣΘΗΤΙΚΗΣ ΕΠΙΚΙΝΔΥΝΟΤΗΤΑΣ ΣΤΟ ΟΡΟΣ ΠΗΛΙΟ ΜΕ ΤΗ ΣΥΜΒΟΛΗ ΔΕΔΟΜΕΝΩΝ ΣΥΜΒΟΛΟΜΕΤΡΙΑΣ ΜΟΝΙΜΩΝ ΣΚΕΔΑΣΤΩΝ

46 2. Coula Coula Coula [7], Coula. Coula C(u, v) = φ [ ] {φ(u) + φ(v)}, u, v [, ]. (2.) φ( ) (generator), : [, ], ; φ() = ;, φ ( ). φ [ ] ( ) φ( ) []

Feasible Regions Defined by Stability Constraints Based on the Argument Principle

Supplementary Appendix

Radiation observation at Dome Fuji Station, Antarctica

Engineering Tunable Single and Dual Optical. Emission from Ru(II)-Polypyridyl Complexes. Through Excited State Design

ΠΑΝΕΠΙΣΤΗΜΙΟ ΘΕΣΣΑΛΙΑΣ ΠΟΛΥΤΕΧΝΙΚΗ ΣΧΟΛΗ ΤΜΗΜΑ ΠΟΛΙΤΙΚΩΝ ΜΗΧΑΝΙΚΩΝ ΠΜΣ ΕΦΑΡΜΟΣΜΕΝΗ ΜΗΧΑΝΙΚΗ ΚΑΙ ΠΡΟΣΟΜΟΙΩΣΗ ΣΥΣΤΗΜΑΤΩΝ

ΒΙΟΓΡΑΦΙΚΟ ΣΗΜΕΙΩΜΑ. Λέκτορας στο Τμήμα Οργάνωσης και Διοίκησης Επιχειρήσεων, Πανεπιστήμιο Πειραιώς, Ιανουάριος 2012-Μάρτιος 2014.

Expansion formulae of sampled zeros and a method to relocate the zeros

ΚΛΙΜΑΤΟΛΟΓΙΑ CLIMATOLOGY

Yahoo 2. SNS Social Networking Service [3,5,12] Copyright c by ORSJ. Unauthorized reproduction of this article is prohibited.

A summation formula ramified with hypergeometric function and involving recurrence relation


Nov Journal of Zhengzhou University Engineering Science Vol. 36 No FCM. A doi /j. issn

Histogram list, 11 RANDOM NUMBERS & HISTOGRAMS. r : RandomReal. ri : RandomInteger. rd : RandomInteger 1, 6

Transcript:

a) Higher-Order Correlation Analysis of Pitch Fluctuations in Sustained Normal Vowels by the Method of Surrogate Data Isao TOKUDA a), Takaya MIYANO, and Kazuyuki AIHARA 2 3 3 2 3 1. [1] [3] Department of Computer Science and Systems Engineering, Muroran Institute of Technology, Muroran-shi, 050 0071 Japan COE Center for Promotion of the COE Program, Ritsumeikan University, Kusatsu-shi, 525 8577 Japan Department of Mathematical Informatics, The University of Tokyo, Bunkyo-ku, Tokyo, 113 8656 Japan a) E-mail: tokuda@agnld.uni-potsdam.de AR [2] [4] [5] [6] [10] [11] [12] [14] [15] [3] [2] [21], [22] 3 [18], [19] Schreiber [23] A Vol. J87 A No. 3 pp. 355 363 2004 3 355

2004/3 Vol. J87 A No. 3 Schreiber 2. 3. 4. 5. 1 4 oo, ao, ka, yo /a/ Table 1 Data length, number of extracted pitches, mean pitch period, and coefficient of variation of a sustained vowel /a/ recorded from 4 subjects (oo, ao, ka, yo). oo 800 msec 137 5.86 msec 0.873 % ao 450 msec 71 6.26 msec 0.778 % ka 900 msec 223 4.06 msec 0.773 % yo 900 msec 254 3.56 msec 1.10 % 2. 2. 1 2 oo, ao 2 ka, yo /a/ 4 10 khz 16 22.05 khz 1 2. 2 [20] {T i : i =1, 2,.., N} (1) 1 4 oo, ao, ka, yo N = 137, 71, 223, 254 2 (a) oo coefficient of variation [16] α = 100 T 1 N N (T i T ) 2 [%] (2) i=1 1 {T 1,T 2,...} Fig.1 Pitch extraction via the zero-crossing method and generation of the pitch period sequence {T 1,T 2,...}. T = 1 N Ti 4 N i=1 oo, ao, ka, yo 1 [17] 1.05±0.40% 4 4 3. [18], [19] method of surrogate [21], [22] statistical hypothesis testing 356

(a) (b) (c) (d) (e) 2 (a) oo {T 1,T 2,...,T 137} (b) (a) (c) 2 (d) 3 (e) 4 Fig.2 (a) Original pitch period sequence {T 1,T 2,..., T 137} extracted from the subject oo.(b) Surrogate data generated by random shuffling of the original data (a).(c) Surrogate data that preserves the 2nd order correlation function.(d) Surrogate data that preserves the 3rd order correlation function.(e) Surrogate data that preserves the 4th order correlation function. FT [21] Schreiber [23] {T i} 357

2004/3 Vol. J87 A No. 3 3. 1 {T i} T σ τ µ } rst (τ ) {{} m times 1 = N (m 1)τ N (m 1)τ t=1 1 σ r+s+t+ (T i T ) r (T i+τ T ) s (T i+2τ T ) t (3) 2 κ 2(τ ) κ 2(τ )=µ 11(τ ) = 1 σ 1 2 N τ N τ i=1 (T i T ) (T i+τ T ) (4) 2 κ 2(τ )=µ 11(τ ) (5) 3 4 [18], [24] 3 d κ d d (C1) [21] {T i} {T i} (C2) {T i} {T i} d κ k, κ k (k =2,...,d) E({T })= d k=2 τ {κ k (τ ) κ k(τ )} 2 (8) SA (8) {T } SA (T i,t j)(i j) {...,T i,...,t j,...} {..., T j,...,t i,...} SA l t =1/log(l) E(T i T j) 1/{1 + exp( E/t)} oo {T i} 4 {T i} 3 l 2.5 10 4 SA κ 3(τ )=µ 111(τ ) (6) 4 κ 4(τ )=µ 1111(τ ) µ 0011(τ ) µ 1100(τ ) µ 1010(τ ) µ 0101(τ ) µ 1001(τ ) µ 0110(τ ) (7) 4 [18], [24] 3. 2 {T 1,T 2,...} {T 1,T 2,...} 3 (8) Fig.3 Process of minimizing the error function (8) by the simulated annealing method. 358

2 (e) 2 (a) 4 2 3 4 4. (a) (b) 4. 1 LPC [20] {T i} 5 (S1) (S2) N (c) 4 oo 4 (a) 2 κ 2(τ ) (b) 3 κ 3(τ ) (c) 4 κ 4(τ ) Fig.4 Higher order correlation functions of the subject oo s original pitch sequence (bold line) and the surrogate data that preserves the 4th order correlation function (crosses).(a) The 2nd order correlation functions κ 2(τ ), (b) The 3rd order correlation functions κ 3(τ ), (c) The 4th order correlation functions κ 4(τ ). Fig.5 5 Speech synthesis based on waveform editting. 359

2004/3 Vol. J87 A No. 3 {T 1,T 2,...,T N} (S3) [4] (S3) (S2) [25] 4. 2 5 (a) (b) (c) 2 (d) 3 (e) 4 5 [26] 20 4 oo, ao, yo, ka 5 6 4 2 2 [3] [2] 3 AR ao 4 ao 1 {T 1,...,T 71} {T 1,...,T 71} 2 {T 1,...,T 71,T 1,...,T 71}, {T 1,...,T 71,T 1,...,T 71} 7 5. 1 360

(a) (b) (c) (d) 6 2 3 4 (a) oo (b) ao (c) yo (d) ka Fig.6 Measurement of the naturalness of the synthesized sounds. From left side, random shuffed surrogate data, 2nd order surrogate data, 3rd order surrogate data, 4th order surrogate data, and the original data.(a) subject oo, (b) subject ao, (c) subject yo, (d) subject ka. 7 2 ao Fig.7 Measurement of the naturalness of the synthesized sounds from the subject ao in the case when the pitch sequence was repeated twice. 2 2 [3] [2] 3 3 AR 2 [6] [10] [27] 361

2004/3 Vol. J87 A No. 3 3 I II LPC III AR [1] L.Dolansky and P.Tjernlund, On certain irregularities of voiced-speech waveforms, IEEE Trans.Audio Electroacoust., vol.16, pp.51 56, 1968. [2] vol.47, no.8, pp.539 544, 1991. [3] vol.47, no.12, pp.928 934, 1991. [4] vol.47, no.12, pp.903 910, 1991. [5] N.Aoki and T.Ifukube, Analysis and perception of spectral 1/f characteristics of amplitude and period fluctuations in normal sustained vowels, J.Acoust. Soc. Am., vol.106, no.1, pp.423 433, 1999. [6] H.Herzel, D.Berry, I.R.Titze, and M.Saleh, Analysis of vocal disorders with method from nonlinear dynamics, J. Speech Hear. Res., vol.37, pp.1008 1019, 1994. [7] W.Mende, H.Herzel, and I.R.Titze, Bifurcations and chaos in newborn cries, Phys.Lett.A, vol.145, pp.418 424, 1990. [8] S.S. Narayanan and A.A. Alwan, A nonlinear dynamical systems analysis of fricative consonants, J. Acoust. Soc. Am., vol.97, no.4, pp.2511 2524, 1995. [9] A.Behrman, Global and local dimensions of vocal dynamics, J. Acoust. Soc. Am., vol.105, no.1, pp.432 443, 1999. [10] M.Banbrook, S.McLaughlin, and I.Mann, Speech characterization and synthesis by nonlinear methods, IEEE Trans.Speech Audio.Process.vol.7, no.1, pp.1 17, 1999. [11] M.Sato, K.Joe, and T.Hirahara, APOLONN brings us to the real world, Proc.Int.Joint Conf. Neural Networks, vol.1, pp.581 587, 1990. [12] I.Tokuda, R.Tokunaga, and K.Aihara, A simple geometrical structure underlying speech signals of the Japanese vowel /a/, Int. J. Bif. Chaos, vol.6, no.1, pp.149 160, 1996. [13] I.Tokuda, T.Miyano, and K.Aihara, Surrogate analysis for detecting nonlinear dynamics in normal vowels, J. Acoust. Soc. Am., vol.110, no.6, pp.3207 3217, 2001. [14] NLP99-152, 2000. [15] T.Ikeguchi and K.Aihara, Estimating correlation dimensions of biological time series with a reliable method, J. Int. Fuzzy Sys., vol.5, no.1, pp.33 52, 1997. [16] N.B. Pinto and I.R. Titze, Unification of perturbation measures in speech signals, J.Acoust.Soc.Am., vol.87, pp.1278 1289, 1990. [17] R.C. Scherer, V.J. Vail, and C.G. Guo, Required number of tokens to determine representative voice perturbation values, J. Speech Hear. Res., vol.38, pp.1260 1269, 1995. [18] 2000. [19] 2002. [20] 1985. [21] J.Theiler, S.Eubank, A.Longtin, B.Galdrikian, and J.D. Farmer, Testing for nonlinearity in time series: The method of surrogate data, Physica D, vol.58, pp.77 94, 1992. [22] J.Theiler and D.Prichard, Constrained-realization Monte-Carlo method for hypothesis testing, Physica D, vol.94, pp.221 235, 1996. [23] T.Schreiber, Constrained randomization of time series data, Phys. Rev. Lett., vol.80, no.10, pp.2105 2108, 1998. [24] A.Stuart and J.K.Ord, Kendall s Advanced Theory of Statistics, Fifth Ed., Charles Griffin, 1987. [25] CAS2000-41, 2000. [26] 362

1991. [27] I.Tokuda, T.Riede, J.Neubauer, M.J.Owren, and H.Herzel, Nonlinear analysis of irregular animal vocalizations, J. Acoust. Soc. Am., vol.111, no.6, pp.2908 2919, 2002. 15 6 16 9 21 11 4 4 6 8 15 14 60 11 15 COE IEEE 52 57 ME INNS SMB 363