Area Location and Recognition of Video Text Based on Depth Learning Method

Σχετικά έγγραφα
ER-Tree (Extended R*-Tree)

Nov Journal of Zhengzhou University Engineering Science Vol. 36 No FCM. A doi /j. issn

Q L -BFGS. Method of Q through full waveform inversion based on L -BFGS algorithm. SUN Hui-qiu HAN Li-guo XU Yang-yang GAO Han ZHOU Yan ZHANG Pan

Adaptive grouping difference variation wolf pack algorithm

Reading Order Detection for Text Layout Excluded by Image

Motion analysis and simulation of a stratospheric airship

An Automatic Modulation Classifier using a Frequency Discriminator for Intelligent Software Defined Radio

Detection and Recognition of Traffic Signal Using Machine Learning

No. 7 Modular Machine Tool & Automatic Manufacturing Technique. Jul TH166 TG659 A

Quick algorithm f or computing core attribute

3: A convolution-pooling layer in PS-CNN 1: Partially Shared Deep Neural Network 2.2 Partially Shared Convolutional Neural Network 2: A hidden layer o

Buried Markov Model Pairwise

Vol. 31,No JOURNAL OF CHINA UNIVERSITY OF SCIENCE AND TECHNOLOGY Feb

Πτυχιακή Εργασι α «Εκτι μήσή τής ποιο τήτας εικο νων με τήν χρή σή τεχνήτων νευρωνικων δικτυ ων»

Schedulability Analysis Algorithm for Timing Constraint Workflow Models

ΕΥΡΕΣΗ ΤΟΥ ΔΙΑΝΥΣΜΑΤΟΣ ΘΕΣΗΣ ΚΙΝΟΥΜΕΝΟΥ ΡΟΜΠΟΤ ΜΕ ΜΟΝΟΦΘΑΛΜΟ ΣΥΣΤΗΜΑ ΟΡΑΣΗΣ

Research of Han Character Internal Codes Recognition Algorithm in the Multi2lingual Environment

ΕΘΝΙΚΟ ΜΕΤΣΟΒΙΟ ΠΟΛΥΤΕΧΝΕΙΟ ΣΧΟΛΗ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΜΗΧΑΝΙΚΩΝ ΥΠΟΛΟΓΙΣΤΩΝ

Re-Pair n. Re-Pair. Re-Pair. Re-Pair. Re-Pair. (Re-Merge) Re-Merge. Sekine [4, 5, 8] (highly repetitive text) [2] Re-Pair. Blocked-Repair-VF [7]

Study on the Strengthen Method of Masonry Structure by Steel Truss for Collapse Prevention

Current Status and Future Prospects of Camera-Based Character Recognition and Document Image Analysis

CorV CVAC. CorV TU317. 1

Ανάκτηση Εικόνας βάσει Υφής με χρήση Eye Tracker

Optimizing Microwave-assisted Extraction Process for Paprika Red Pigments Using Response Surface Methodology

PACS: Pj, Gg

n 1 n 3 choice node (shelf) choice node (rough group) choice node (representative candidate)

SVM. Research on ERPs feature extraction and classification

: Monte Carlo EM 313, Louis (1982) EM, EM Newton-Raphson, /. EM, 2 Monte Carlo EM Newton-Raphson, Monte Carlo EM, Monte Carlo EM, /. 3, Monte Carlo EM

Method to Distinguish between Handwritten and Machine-printed Characters Inspired by Human Vision System


ΕΡΕΥΝΗΤΙΚΑ ΠΡΟΓΡΑΜΜΑΤΑ ΑΡΧΙΜΗΔΗΣ ΕΝΙΣΧΥΣΗ ΕΡΕΥΝΗΤΙΚΩΝ ΟΜΑΔΩΝ ΣΤΟ ΤΕΙ ΣΕΡΡΩΝ. Ενέργεια στ ΘΕΜΑ ΕΡΕΥΝΑΣ: ΔΙΑΡΘΡΩΣΗ ΠΕΡΙΕΧΟΜΕΝΟΥ ΕΧΡΩΜΩΝ ΕΓΓΡΑΦΩΝ

A Method for Singularity Detection in Fingerprint Images

Optimization, PSO) DE [1, 2, 3, 4] PSO [5, 6, 7, 8, 9, 10, 11] (P)

Feasible Regions Defined by Stability Constraints Based on the Argument Principle

( ) , ) , ; kg 1) 80 % kg. Vol. 28,No. 1 Jan.,2006 RESOURCES SCIENCE : (2006) ,2 ,,,, ; ;

CSJ. Speaker clustering based on non-negative matrix factorization using i-vector-based speaker similarity

JOURNAL OF APPLIED SCIENCES Electronics and Information Engineering. Cyclic MUSIC DOA TN (2012)

[4] 1.2 [5] Bayesian Approach min-max min-max [6] UCB(Upper Confidence Bound ) UCT [7] [1] ( ) Amazons[8] Lines of Action(LOA)[4] Winands [4] 1

Study of In-vehicle Sound Field Creation by Simultaneous Equation Method

Study of urban housing development projects: The general planning of Alexandria City

MIDI [8] MIDI. [9] Hsu [1], [2] [10] Salamon [11] [5] Song [6] Sony, Minato, Tokyo , Japan a) b)

(Υπογραϕή) (Υπογραϕή) (Υπογραϕή)

Δημήτριος Θ. Τόμτσης, Ph.D. Αναλυτικό Βιογραφικό Σημείωμα

,,, (, ) , ;,,, ; -

HOSVD. Higher Order Data Classification Method with Autocorrelation Matrix Correcting on HOSVD. Junichi MORIGAKI and Kaoru KATAYAMA


[1] DNA ATM [2] c 2013 Information Processing Society of Japan. Gait motion descriptors. Osaka University 2. Drexel University a)

A High Precision Iris Feature Extraction and Its Application in Iris Recognition

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/6 CS Activity 1,a) CS Computer Science Activity Activity Actvity Activity Dining Eight-He

Yoshifumi Moriyama 1,a) Ichiro Iimura 2,b) Tomotsugu Ohno 1,c) Shigeru Nakayama 3,d)

2 ~ 8 Hz Hz. Blondet 1 Trombetti 2-4 Symans 5. = - M p. M p. s 2 x p. s 2 x t x t. + C p. sx p. + K p. x p. C p. s 2. x tp x t.

The Research on Sampling Estimation of Seasonal Index Based on Stratified Random Sampling

Wiki. Wiki. Analysis of user activity of closed Wiki used by small groups

SocialDict. A reading support tool with prediction capability and its extension to readability measurement

ΚΛΙΜΑΤΟΛΟΓΙΑ CLIMATOLOGY

Studies on the Binding Mechanism of Several Antibiotics and Human Serum Albumin

Gain self-tuning of PI controller and parameter optimum for PMSM drives

Arbitrage Analysis of Futures Market with Frictions

Σύντομο Βιογραφικό Σημείωμα

{takasu, Conditional Random Field

Automatic extraction of bibliography with machine learning

A Method for Creating Shortcut Links by Considering Popularity of Contents in Structured P2P Networks

40 3 Journal of South China University of Technology Vol. 40 No Natural Science Edition March

UAV. UAV Unmanned Aerial Vehicle LED Light Emitting Diodes LQR Linear Quadratic Regulator

ΓΙΑΝΝΟΥΛΑ Σ. ΦΛΩΡΟΥ Ι ΑΚΤΟΡΑΣ ΤΟΥ ΤΜΗΜΑΤΟΣ ΕΦΑΡΜΟΣΜΕΝΗΣ ΠΛΗΡΟΦΟΡΙΚΗΣ ΤΟΥ ΠΑΝΕΠΙΣΤΗΜΙΟΥ ΜΑΚΕ ΟΝΙΑΣ ΒΙΟΓΡΑΦΙΚΟ ΣΗΜΕΙΩΜΑ

Antimicrobial Ability of Limonene, a Natural and Active Monoterpene

Zigbee. Zigbee. Zigbee Zigbee ZigBee. ZigBee. ZigBee

High order interpolation function for surface contact problem

1530 ( ) 2014,54(12),, E (, 1, X ) [4],,, α, T α, β,, T β, c, P(T β 1 T α,α, β,c) 1 1,,X X F, X E F X E X F X F E X E 1 [1-2] , 2 : X X 1 X 2 ;

A method of seeking eigen-rays in shallow water with an irregular seabed

Automatic Domain2Specific Term Extraction and Its Application in Text Cla ssification

ΠΣΤΥΙΑΚΗ ΔΡΓΑΙΑ. Μειέηε Υξόλνπ Απνζηείξσζεο Κνλζέξβαο κε Τπνινγηζηηθή Ρεπζηνδπλακηθή. Αζαλαζηάδνπ Βαξβάξα

Correction of chromatic aberration for human eyes with diffractive-refractive hybrid elements

Supporting Information

Speeding up the Detection of Scale-Space Extrema in SIFT Based on the Complex First Order System

Research on divergence correction method in 3D numerical modeling of 3D controlled source electromagnetic fields

Prey-Taxis Holling-Tanner

Research on vehicle routing problem with stochastic demand and PSO2DP algorithm with Inver2over operator

Wireless capsule endoscopy video classification using an unsupervised learning approach

Bayesian Discriminant Feature Selection

DECO DECoration Ontology

Application of a novel immune network learn ing algorithm to fault diagnosis

A multipath QoS routing algorithm based on Ant Net

Approximation Expressions for the Temperature Integral

Εφαρμογή Υπολογιστικών Τεχνικών στην Γεωργία

Study on application of spectral analysis of instantaneous power to fault diagnosis of traction motor rotor

ΤΕΧΝΟΛΟΓΙΚΟ ΕΚΠΑΙΔΕΥΤΙΚΟ ΙΔΡΥΜΑ ΑΓΡΟΤΙΚΕΣ ΣΤΑΤΙΣΤΙΚΕΣ ΜΕ ΕΡΓΑΛΕΙΑ ΓΕΩΠΛΗΡΟΦΟΡΙΚΗΣ

The Algorithm to Extract Characteristic Chord Progression Extended the Sequential Pattern Mining

Research on Economics and Management

Congruence Classes of Invertible Matrices of Order 3 over F 2

, Litrrow. Maxwell. Helmholtz Fredholm, . 40 Maystre [4 ], Goray [5 ], Kleemann [6 ] PACC: 4210, 4110H

EM Baum-Welch. Step by Step the Baum-Welch Algorithm and its Application 2. HMM Baum-Welch. Baum-Welch. Baum-Welch Baum-Welch.

The martingale pricing method for pricing fluctuation concerning stock models of callable bonds with random parameters

2002 Journal of Software

2016 IEEE/ACM International Conference on Mobile Software Engineering and Systems

ΣΤΟΙΧΕΙΑ ΠΡΟΤΕΙΝΟΜΕΝΟΥ ΕΞΩΤΕΡΙΚΟΥ ΕΜΠΕΙΡΟΓΝΩΜΟΝΟΣ Προσωπικά Στοιχεία:

SCITECH Volume 13, Issue 2 RESEARCH ORGANISATION Published online: March 29, 2018

Application of Wavelet Transform in Fundamental Study of Measurement of Blood Glucose Concentration with Near2Infrared Spectroscopy

Gemini, FastMap, Applications. Εαρινό Εξάμηνο Τμήμα Μηχανικών Η/Υ και Πληροϕορικής Πολυτεχνική Σχολή, Πανεπιστήμιο Πατρών

Fourier transform, STFT 5. Continuous wavelet transform, CWT STFT STFT STFT STFT [1] CWT CWT CWT STFT [2 5] CWT STFT STFT CWT CWT. Griffin [8] CWT CWT

Transcript:

21 6 2016 12 Vol 21 No 6 JOURNAL OF HARBIN UNIVERSITY OF SCIENCE AND TECHNOLOGY Dec 2016 1 1 1 2 1 150080 2 130300 Gabor RBM OCR DOI 10 15938 /j jhust 2016 06 012 TP391 43 A 1007-2683 2016 06-0061- 06 Area Location and Recognition of Video Text Based on Depth Learning Method LIU Ming-zhu 1 ZHENG Yun-fei 1 FAN Jin-fei 1 YU Fang 2 1 School of Measure-control Technology and Communications Engineering Harbin University of Science and Technology Harbin 150080 China 2 Dehui Education Technology Service Center of Jilin Province Dehui 130300 China Abstract It is advantageous to improve the efficiency and accuracy of video information processing through fast and accurate text area location and recognition of video images The Gabor filter has been used to extract the texture features of video images in the four directions of horizontal vertical left-failing and right-falling Then by RBM layer increment depth learning algorithm a depth belief network has been structured and at the same time the text region location for the texture feature images has been realized The paper also studied the feasibility and recognition effect about using morphological process and OCR character database to realize the video image text recognition The test results showed that the proposed optimized depth learning algorithm combining with morphology character recognition method can not only realize the accurate location of the text region for video images but also improve the efficiency and accuracy of the character recognition Keywords depth learning algorithm video image text area location morphological denoising character recognition 0 OCR 2015-06 - 29 61401126 1973 E-mail lmz@ hrbust edu cn 1990 1989

62 21 1 2 Gabor 3 4 2D-Gabor Daugman 5 OCR 6 OCR Gabor 1 1 exp - 2πj Gabor g x y = Kexp - π p 2 x - x 0 2 + q 2 y - y 0 2 u 0 x - x 0 + v 0 y - y 0 2 F u v = K pq exp - π u - u 0 2 + v - v 0 { 2 p 2 q } 2 1 exp{ - 2πj x 0 u - u 0 + y 0 v - v 0 } Gabor 2 K Gauss x 0 y 0 restricted boltzmann machine RBM Gauss u 0 v 0 p q Gauss x 0 y 0 0 deep belief network 0 p q Gabor DBN 7 - p q 3 8 λ = U h /U I 1 M - 1 p = λ - 1 U h / λ + 1 槡 2ln2 OCR q = tan( π 2T) [ U h - 2ln2 p2 U ] 2ln2-2ln2 2 p [ 2 1 2 ] 1 Gabor Gabor U I = 0 2 2 U h = 0 4 T = 4 M = 4 2 1 2 b 2 a 4 Gabor 2 h U 2 h 1 3 U h U I T M λ Gabor 4 Gabor 4 4 Wang Gabor Gabor λ λ η 9 Gabor Gabor

6 63 b Gabor 3 4 v h 3 RBM 2 Gabor P θ v h = 1 exp - E v Z θ h θ 2 2 = 1 Z θ e W ij v i h j e b i v i e a j h j 5 ij i j Z θ = exp - E v h θ h v 10 depth belief network DBN re- P v h = P v j h P v j = 1 h stricted boltzmann machine RBM i 1 = 6 1 + exp - j W ij h j - b i S n S 1 S 2 S n I O I S 1 S 2 S n O O I I = I S i S i RBM 0 1 P v h Boltzmann v Boltzmann E v h θ = - W ij v i h j - b i v i - a j h j 4 ij i j θ = W a b a b RBM θ v P h v = P h j v P h j = 1 v j 1 1 + exp - i W ij - a i I 1 2 L θ = 1 N n N logp θ v n - λ = 1 N W 2 F 3 L θ W ij L θ W ij 2 3 DBN L θ = E W Pdata v i h j - Pθ E v i h j - 2λ ij N W ij 9 RBM E Pdata v i h j Hinton Sejnowski E Pθ v i h j RBM - 11 7 D = v 1 v 2 v N θ = W a b 8

64 21 DBN DBN 2 4 DBN DBN 12 RBM RBM DBN DBN 13 1 H 0 DBN n L 1 L 2 DBN L n DBN H 0 H 0 H 1 W 0 8 0 1 9 W 0 7 H 1 H 1 W 1 W 2 W n - 1 3 DBN 5 b 5 c 4 DBN DBN W i = W 0 W 1 W 2 Z 2 A W n - 1 DBN n + 2 H 0 H 1 H 2 H n CC A AΘC H 0 AΘC = z C z A 10 64 n Z 2 C z C z L 1 L 2 L n z DBN H 0 H 1 H 0 H 1 11 Z 2 RBM H 0 v H 1 A C C A A h C W 0 RBM A C = z C z A Φ 11 RBM Z 2 C z C z z Φ RBM DBN 5 5 DBN DBN 4 DBN 5 5

6 65 5 DBN 4 OCR DBN 1 DBN 6 7 OCR 1 DBN OCR 6 5 7 OCR 14 Kim 15 SVM 16 12 RR PR F RR = c m PR = c n 2 PR RR F = PR + RR 12 c m n F 4 1 DBN 4-DBN 4-DBN 4 2 100 4 2 DBN F n m c /% /% 2-DBN 378 302 224 74 17 59 26 65 88 3-DBN 378 334 251 75 15 66 40 70 50 4-DBN 378 364 295 81 04 78 04 79 51 5-DBN 378 369 301 81 57 79 62 80 58 6-DBN 378 372 304 81 72 80 42 81 06

66 21 2 F n m c /% /% 378 305 229 75 81 60 58 67 34 Kim 378 327 253 77 37 66 93 71 77 SVM 378 342 276 80 70 73 02 76 67 4-DBN 378 364 295 81 04 78 04 79 51 2 DBN 3 Kim SVM F 1 DBN 378 5 059-631 OCR 2 DBN 3 5 Gabor DBN OCR 1 Edge Detection Based on Mathematics Morphology J International Journal of Signal Processing Image Processing and J 2005 28 3 427-432 Pattern 2 EPSHTEIN B OFEK E WEXLER Y Detecting Text in Natural Scenes with Stroke Width Transform C 2010 IEEE Conference on Computer Vision and Pattern Recognition San Francisco USA IEEE Computer Society 2010 2963-2970 3 - SVM Natural Scene Images Using Hierarchical Feature Combining and J 2010 31 4 916-922 4 CHEN X R YUILLE A L Detecting and Reading Text in Natural Scenes C 2004 IEEE Computer Society Conference on Computer Verification C Proceedings of the 17th International Conference on Pattern Recognition Cambridge United Kingdom Institute of Electrical and Electronics Engineers Inc 2004 679 Vision and Pattern Recognition Washington D C USA - 682 Institute of Electrical and Electronics Engineers Computer Society 2004 366-373 16 YAN J Q LI J GAO X B Chinese Text Location Under Complex Background Using Gabor Filter and SVM J Neurocomputing 5 DAUGMAN J G Complete Discrete 2-D Gabor Transforms by Neural Networks for Image Analysis and Compression J IEEE Transactions on Acoustics Speech and Signal Processing 1988 36 7 1169-1179 6 J 2005 10 1 122-124 7 KAMARAINEN Joni KYRKI Ville K LVI INEN Heikki Fundamental Frequency Gabor Filters for Object Recognition J International Conferenceon Pattern Recognition 2002 16 1 628 8 FU P LI M YIN T Gabor Filter Based Text Extraction from Digital Document Image J Tien Tzu Hsueh Pao / Acta Electronica Sinica 2006 34 2387-2390 9 WANG X W DING X Q LIU C S Optimized Gabor Filter Based Feature Extraction for Character Recognition C 16th International Conference on Pattern Recoqnition Quebec City Canada Institute of Electrical and Electronics Engineers Inc 2002 223-234 10 J 2015 45 2 596-599 11 HINTON G E OSINDERO S BAO K Reducing the Dimensionality of Data with Neural Networks C Proceedings of the 10th International Workshop on Artificial Intelligene and Statistics Barbados Society for Artificial Intelligence and Statistics United States 2005 128-135 12 HINTON G E A Practical Guide to Training Restricted Boltzmann Machines J Lecture Notes in Computer Science 2012 599-619 13 DENG C X CHEN Y BI H et al Recognition 2014 7 5 309-322 The Improved Algorithm of 14 JUNG K Neural Network-based Text Location in Color Images J Pattern Recognition Letters 2001 22 14 1503-1515 15 KIM K C BYUN H R SONG Y J et al Scene Text Extraction in 2011 74 17 2998-3008