Analysis of prosodic features in native and non-native Japanese using generation process model of fundamental frequency contours

Σχετικά έγγραφα
Buried Markov Model Pairwise

Retrieval of Seismic Data Recorded on Open-reel-type Magnetic Tapes (MT) by Using Existing Devices

CSJ. Speaker clustering based on non-negative matrix factorization using i-vector-based speaker similarity

[4] 1.2 [5] Bayesian Approach min-max min-max [6] UCB(Upper Confidence Bound ) UCT [7] [1] ( ) Amazons[8] Lines of Action(LOA)[4] Winands [4] 1

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/6 CS Activity 1,a) CS Computer Science Activity Activity Actvity Activity Dining Eight-He

Re-Pair n. Re-Pair. Re-Pair. Re-Pair. Re-Pair. (Re-Merge) Re-Merge. Sekine [4, 5, 8] (highly repetitive text) [2] Re-Pair. Blocked-Repair-VF [7]

Resurvey of Possible Seismic Fissures in the Old-Edo River in Tokyo

Fourier transform, STFT 5. Continuous wavelet transform, CWT STFT STFT STFT STFT [1] CWT CWT CWT STFT [2 5] CWT STFT STFT CWT CWT. Griffin [8] CWT CWT

An Automatic Modulation Classifier using a Frequency Discriminator for Intelligent Software Defined Radio

n 1 n 3 choice node (shelf) choice node (rough group) choice node (representative candidate)

* * E mail : matsuto eng.hokudai.ac.jp. Zeiss

Development of a basic motion analysis system using a sensor KINECT

Comparison of Evapotranspiration between Indigenous Vegetation and Invading Vegetation in a Bog

Japanese Fuzzy String Matching in Cooking Recipes

HIV HIV HIV HIV AIDS 3 :.1 /-,**1 +332

A summation formula ramified with hypergeometric function and involving recurrence relation

Physical and Chemical Properties of the Nest-site Beach of the Horseshoe Crab Rehabilitated by Sand Placement

A study of geometric dependency of cepstrum on vocal tract length

, Evaluation of a library against injection attacks

High order interpolation function for surface contact problem

1, +,*+* + +-,, -*, * : Key words: global warming, snowfall, snowmelt, snow water equivalent. Ohmura,,**0,**

MIDI [8] MIDI. [9] Hsu [1], [2] [10] Salamon [11] [5] Song [6] Sony, Minato, Tokyo , Japan a) b)

* /+* *,3- +**+ + ** , +1 - ect of a Dietary Education Program for Kindergarten Children and their Mothers

Development of a Tiltmeter with a XY Magnetic Detector (Part +)

{takasu, Conditional Random Field

SocialDict. A reading support tool with prediction capability and its extension to readability measurement

SCOPE OF ACCREDITATION TO ISO 17025:2005

2016 IEEE/ACM International Conference on Mobile Software Engineering and Systems

Simplex Crossover for Real-coded Genetic Algolithms

Influence of Flow Rate on Nitrate Removal in Flow Process

Quick algorithm f or computing core attribute

GPU. CUDA GPU GeForce GTX 580 GPU 2.67GHz Intel Core 2 Duo CPU E7300 CUDA. Parallelizing the Number Partitioning Problem for GPUs

Automatic extraction of bibliography with machine learning

Ψηφιακή Επεξεργασία Φωνής

Applying Markov Decision Processes to Role-playing Game

,,, (, ) , ;,,, ; -

1. A fully continuous 20-payment years, 30-year term life insurance of 2000 is issued to (35). You are given n A 1

(Υπογραϕή) (Υπογραϕή) (Υπογραϕή)

*,* + -+ on Bedrock Bath. Hideyuki O, Shoichi O, Takao O, Kumiko Y, Yoshinao K and Tsuneaki G

Queensland University of Technology Transport Data Analysis and Modeling Methodologies

E#ects of Drying on Bacterial Activity and Iron Formation in Acid Sulfate Soils

Scrub Nurse Robot: SNR. C++ SNR Uppaal TA SNR SNR. Vain SNR. Uppaal TA. TA state Uppaal TA location. Uppaal

Homework 3 Solutions

MSM Men who have Sex with Men HIV -

Estimation, Evaluation and Guarantee of the Reverberant Speech Recognition Performance based on Room Acoustic Parameters

ΑΓΓΛΙΚΗ ΓΛΩΣΣΑ ΣΕ ΕΙΔΙΚΑ ΘΕΜΑΤΑ ΔΙΕΘΝΩΝ ΣΧΕΣΕΩΝ & ΟΙΚΟΝΟΜΙΑΣ

Vol.4-DCC-8 No.8 Vol.4-MUS-5 No.8 4// 3 3 Hanning (T ) 3 Hanning 3T (y(t)w(t)) dt =.5 T y (t)dt. () STRAIGHT F 3 TANDEM-STRAIGHT[] 3 F F 3 [] F []. :

3: A convolution-pooling layer in PS-CNN 1: Partially Shared Deep Neural Network 2.2 Partially Shared Convolutional Neural Network 2: A hidden layer o

ΙΠΛΩΜΑΤΙΚΗ ΕΡΓΑΣΙΑ. ΘΕΜΑ: «ιερεύνηση της σχέσης µεταξύ φωνηµικής επίγνωσης και ορθογραφικής δεξιότητας σε παιδιά προσχολικής ηλικίας»

Emulsifying Properties of Egg Yolk as a Function of Diacylglycerol Oil

Technical Research Report, Earthquake Research Institute, the University of Tokyo, No. +-, pp. 0 +3,,**1. No ,**1

DuPont Suva. DuPont. Thermodynamic Properties of. Refrigerant (R-410A) Technical Information. refrigerants T-410A ENG

Εργαστήριο Ανάπτυξης Εφαρμογών Βάσεων Δεδομένων. Εξάμηνο 7 ο

Estimation of grain boundary segregation enthalpy and its role in stable nanocrystalline alloy design

Adaptive grouping difference variation wolf pack algorithm

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

SNR F0 [2], [3], [4] F0 F0 F0 F0 F0 TUSK F0 TUSK F0 6 TUSK 6 F0 2. F0 F0 [5] [6] [7] p[8] Cepstrum [9], [10] [11] [12] [13] F0 [14] F0 [15] DIO[16] [1

Maxima SCORM. Algebraic Manipulations and Visualizing Graphs in SCORM contents by Maxima and Mashup Approach. Jia Yunpeng, 1 Takayuki Nagai, 2, 1

Congruence Classes of Invertible Matrices of Order 3 over F 2

Supporting Information. Asymmetric Binary-acid Catalysis with Chiral. Phosphoric Acid and MgF 2 : Catalytic

Η αλληλεπίδραση ανάμεσα στην καθημερινή γλώσσα και την επιστημονική ορολογία: παράδειγμα από το πεδίο της Κοσμολογίας

Research of Han Character Internal Codes Recognition Algorithm in the Multi2lingual Environment

Homomorphism in Intuitionistic Fuzzy Automata

Investigating the fuzzy areas of accuracy and confidence of muslim pupils- learners of Greek as Second Language in Thrace, Greece

, Litrrow. Maxwell. Helmholtz Fredholm, . 40 Maystre [4 ], Goray [5 ], Kleemann [6 ] PACC: 4210, 4110H

Eects of Gas-Surface Interaction Model in Hypersonic Rareed Gas Flow

A Convolutional Neural Network Approach for Objective Video Quality Assessment

Schedulability Analysis Algorithm for Timing Constraint Workflow Models

Reading Order Detection for Text Layout Excluded by Image

Optimizing Microwave-assisted Extraction Process for Paprika Red Pigments Using Response Surface Methodology

2 ~ 8 Hz Hz. Blondet 1 Trombetti 2-4 Symans 5. = - M p. M p. s 2 x p. s 2 x t x t. + C p. sx p. + K p. x p. C p. s 2. x tp x t.

IMES DISCUSSION PAPER SERIES

ΔΡΟΓΓΙΤΟΥ ΜΑΡΙΑ droggitou.tripod.com

Reaction of a Platinum Electrode for the Measurement of Redox Potential of Paddy Soil

: Monte Carlo EM 313, Louis (1982) EM, EM Newton-Raphson, /. EM, 2 Monte Carlo EM Newton-Raphson, Monte Carlo EM, Monte Carlo EM, /. 3, Monte Carlo EM

VBA Microsoft Excel. J. Comput. Chem. Jpn., Vol. 5, No. 1, pp (2006)

Από την Κριτική Εθνογραφία στην Κριτική Έρευνα Δράσης: Ένα συνεχές στο σχεδιασμό της εκπαιδευτικής καινοτομίας 1

ΤΟ ΜΟΝΤΕΛΟ Οι Υποθέσεις Η Απλή Περίπτωση για λi = μi 25 = Η Γενική Περίπτωση για λi μi..35

Η ΔΙΔΑΣΚΑΛΙΑ ΤΩΝ ΜΟΡΦΟΛΟΓΙΚΩΝ ΔΙΑΔΙΚΑΣΙΩΝ ΤΗΣ ΠΑΡΑΓΩΓΗΣ ΚΑΙ ΤΗΣ ΣΥΝΘΕΣΗΣ ΥΠΟ ΤΟ ΠΡΙΣΜΑ ΤΩΝ ΑΠΣ: ΜΙΑ ΚΡΙΤΙΚΗ ΘΕΩΡΗΣΗ

Homomorphism of Intuitionistic Fuzzy Groups

Detection and Recognition of Traffic Signal Using Machine Learning

Support and Life Reconstruction for Living with HIV-infected Hemophilia in Japan

Vol. 31,No JOURNAL OF CHINA UNIVERSITY OF SCIENCE AND TECHNOLOGY Feb

Molecular evolutionary dynamics of respiratory syncytial virus group A in

Correction of chromatic aberration for human eyes with diffractive-refractive hybrid elements

Speech Recognition using Phase Information based on Long-Term Analysis

þÿ Ç»¹º ³µÃ ± : Ãż²» Ä Â

1 (forward modeling) 2 (data-driven modeling) e- Quest EnergyPlus DeST 1.1. {X t } ARMA. S.Sp. Pappas [4]

QUICKTRONIC PROFESSIONAL QTP5

1181 (real-timespeechdriven) 1 1 ( ) D FAP FAP (voiceactivationdetectionvad) D FaceGen 3- D XfaceEd MPEG-4 1 FAP 66 FAP ( ) FAP 84

Main source: "Discrete-time systems and computer control" by Α. ΣΚΟΔΡΑΣ ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ ΔΙΑΛΕΞΗ 4 ΔΙΑΦΑΝΕΙΑ 1

4.6 Autoregressive Moving Average Model ARMA(1,1)

þÿ ÀÌ Ä º± µä À ¹ ¼ ½

Discriminantal arrangement

* ** *** *** Jun S HIMADA*, Kyoko O HSUMI**, Kazuhiko O HBA*** and Atsushi M ARUYAMA***

Wiki. Wiki. Analysis of user activity of closed Wiki used by small groups

Exercises to Statistics of Material Fatigue No. 5

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΑΤΡΩΝ ΤΜΗΜΑ ΕΠΙΣΤΗΜΩΝ ΤΗΣ ΕΚΠΑΙΔΕΥΣΗΣ ΚΑΙ ΤΗΣ ΑΓΩΓΗΣ ΣΤΗΝ ΠΡΟΣΧΟΛΙΚΗ ΗΛΙΚΙΑ ΜΕΤΑΠΤΥΧΙΑΚΟΣ ΚΥΚΛΟΣ ΣΠΟΥΔΩΝ

SMD Power Inductor-VLH

Transcript:

THE INSTITUTE O ELECTRONICS, INORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT O IEICE. 277 8562 5 1 5 113 3 7 3 1 6 817 17 8 E-mail: {hiran,wtgu,hirose,mine}@gavo.t.u-tokyo.ac.jp, goh@kawai.com ( ) 1) 2) 1) 2) 3) 4) ( ) Abstract Analysis of prosodic features in native and non-native Japanese using generation process model of fundamental frequency contours H. HIRANO, W. GU, K. HIROSE, N. MINEMATSU, and G. KAWAI Graduate School of rontier Sciences, University of Tokyo 5 1 5, Kashiwanoha, Kashiwa shi, Chiba, 277 8562 Japan Graduate School of Information Science and Technology, University of Tokyo 7 3 1, Hongo, Bunkyo ku, Tokyo, 113-33 Japan Institute of Language and Culture Studies, Hokkaido University Kita 17, Nishi 8, Kita ku, Sapporo, 6-817 Japan E-mail: {hiran,wtgu,hirose,mine}@gavo.t.u-tokyo.ac.jp, goh@kawai.com We compared patterns of the utterances by native and non-native (Mandarin Chinese) speakers of Japanese. Model-based analysis indicates that non-native speakers tend to transfer the prosodic characteristics of their native language into the utterances of Japanese. They show the following peculiarities as compared with native speakers: 1) a higher baseline frequency; 2) more frequent occurrences of phrase commands and hence shorter prosodic phrases; 3) a tendency of decomposition of a bunsetsu into multiple prosodic words, and hence redundant addition or incorrect location of accent commands; 4) occasional rapider falling (mostly at the end of utterance) and hence introduction of negative accent commands there. These observed peculiarities can be ascribed to interference by tonal and syllable-timed natures of Mandarin Chinese, difficulty in prosodic planning during speaking second language, and lack of adequate teaching of the skills for sentence intonation. Key words L2 learning of Japanese, sentence reading, patterns, generation process model of contours 1

1. 1. 1 25 (4.3) [sec].17.12 (.6) (.4) [sec] 5.1 3.59 (1.26) (1.17) 1.52 1.28 ( ) ( ) 5 2. 2. 1 3 1 1. 2 16bit) 12 [1] 1) 21 2)11 4 27 [2] [3] 2. 2 [4] 7 ( 2) ( ) 8 1 6 2. 3 P RAAT [8] 1ms 1. 3 Hirano [5] [6] 1) 2. 4 1 1 7 [sec].72.77 (.16) (.1) 4m 2 Sony D-38B DAT Sony TCD-D1 PRO 48kHz 3)555 1 3 ( 1 1 25 1 1 7 2) [7] [sec] 66 2 3 4 1 2

Pitch (Hz) 5 3 2 15 1 1 Pitch (Hz) 5 3 2 15 1 Non-native n=7 Native n=7 ( ) ( ) 7 { } 5 2 Pitch (Hz) 3 2 15 silb me ga ne no P ni a u P na o ya no a ni ni P i N do no ryo: ri o na ra i ni i ku sile 1 Time (s) 8.36633 Pitch (Hz) 5 3 2 15 silbme ga ne no ni a u P na o ya no a ni ni P i N do no ryo: ri o na ra i ni i ku sile 1 Time (s) 4.29423 2 Non-native speakers (Chinese) 3 Native speakers (Japanese) ( ) ( ) 1 1 2. / / / / 3 / 8 3. 2 7 (1,138)=13.37, p.1 3. 3. 1 1 1 5 { } 7 (Hz) [9] 4. 4. 1 ( ) ujisaki [9] 7 ( ) LHHLLL(L) 2 1 ( 4). 3

Ap T1 2 Aa T T12 T22 T11T21 T 4 13 23 T3 t T14T24 T t Gp(t) log e b Ga(t) logeb log e (t) 2 t b(hz) 2 5 b (Hz) Number of occurrences 8 6 4 64 JP_male JP_female 76 88 1 112 5 7 CH_male CH_female 124 136 148 16 172 184 IX log e (t) = log e b + A pi Gp(t T i ) i=1 JX + A aj {G a(t T 1j ) G a(t T 2j )} (1) j=1 3 b (Hz) 5 7 CH male CH female JP male JP female mean(hz) 84.5 157.3 78.1 131.7 SD 12.1 8.6 4.6 9.8 b I 14 J ( 1 7 ) A pi i ( ) A aj j ( ) T i i 1kHz 16bit T 1j j T 2j j [11] 1ms G p(t) (2) (2) (3) b 5 3 5 7 35 b = 3/s = 2/s =.9 b (131.7Hz) (157.3Hz) (3) b 78Hz b 1Hz 4. 2 [1] P RAAT [8] 8 < α 2 t exp( αt) (t > = ) G p(t) = (2) : (t < ) G a (t) (3) 3/s 2/s.9 8 < min[1 (1 + β t ) exp( βt), γ] (t > = ) G a (t) = (3) : (t < ) 4. 3 b 4

4 5 5 7 CH male CH female JP male JP femal mean 4.69 4.83 2.69 3.4 SD 1.45 1.22.8.65 5 7 CH male CH female JP male JP female mean 9.69 1.71 6.54 7.83 SD 2.46 3.28 1.48 1.81 Number of occurrences 1 8 6 4 2 bunsetsu CH_female CH_male JP_female JP_male 1 2 3 4 5 6 7 Sentence ID Number of occurrences mora bunsetsu CH_female CH_male JP_female JP_male 3 2 1 1 2 3 4 5 6 7 Sentence ID 6 7 4 ( ) 5 7 7 4 ( bunsetsu ) 5 mora bunsetsu 4. 4 18 JP_male 4 6 7 16 JP_female 4 14 CH_male 12 CH_female 1 8 6 4 2.69 3.4 4.69 4.83 2 6 5 6.5 1 2 3 4 5 4. 5 Number of accent commands in a bunsetsu 8 5 7 Number of occurences 7 4. 6 1 2 8 4 5 3 /-en/ /-ai/.5 ( ) 5

o(t)36 [Hz] 27 18 b Ap.5. Aa.5. o(t)36 [Hz] 27 b 18 Ap.5. Aa.5. 9 a o mo ri no ri N go o ta be ru ma de i e no N wa be re i ri go ta ra na..5 1. 1.5 2. 2.5 3. 3.5 4. 4.5 5. 5.5..5 1. 1.5 2. 2.5 3. 3.5 4. 4.5 5. 5.5..5 1. 1.5 2. 2.5 3. 3.5 4. 4.5 5. 5.5 TIME [s] a o mo ri no ri N go o ta be ru ma de P i e no ri N go wa P ta be ra re na i..5 1. 1.5 2. 2.5 3. 3.5 4. 4.5 5. 5.5..5 1. 1.5 2. 2.5 3. 3.5 4. 4.5 5. 5.5..5 1. 1.5 2. 2.5 3. 3.5 4. 4.5 5. 5.5 TIME [s] ( ) ( ) 1 ( b ) (A p ) (A a) / / / / 7 1 1 5. ( ) 4 ( ) [1], 17 [12] http://www.jasso.go.jp/statistics/intl student/data5.html (accessed 15 Oct, 26) [2], 7 48,, pp.29-37, 1991 4. 7 [3] May 25. [4],,, D1 3, 1991 9 1 1 [5] Hirano, H. and Kawai, G., 25. Pitch patterns of intonational phrases and intonational phrase groups in native and 6 non-native speech, Proc. Interspeech25, Lisbon, Portugal, pp.761-764. [6] 6 4 SP25-188 pp.23-28 26. 4 [7] Hirano, H., Gu, W., and Hirose, K., Model-based Analysis 7 of Contours of Japanese Sentences Uttered by Chinese 1 1 Speakers The 7th Phonetic Conference of China (7th PCC) and International orum on Phonetic rontiers, to appear, 26. [8] Boersma, P. and Weenik, D., 26 Praat: doing phonetics by computer http://www.praat.org/ (accessed 15 Oct, 26). [9] ujisaki, H. and Hirose, K., Analysis of voice fundamental 2 frequency contours for declarative sentences of Japanese, J. Acous. Soc. Japan (E) 5 (4), pp. 233-242, 1984. [1],,, Vol.43, No.7, pp.2155-2168 22. [11] K.Hirose, H.ujisaki, and S.Seto, A scheme for pitch extraction of speech using autocorrelation function with frame / 2 1 length proportional to the time lag, Proc. IEEE ICASSP, Vol.1, pp.149-152, 1992. [12] ujisaki, H., Hirose, K., Hallé, and P., Lei, H., Analysis and modeling of tonal features in polysyllabic words and sentences of the Standard Chinese, Proc. ICSLP 199, Kobe, ( ) Japan, pp.841-844, 199. 6