Conference Articles

Temporal Integration for Word-Wise Caption and Scene Text Identification

Sangheeta Roy, Universiti Malaya
Palaiahnakote Shivakumara, Universiti Malaya
Umapada Pal, Indian Statistical Institute, Kolkata
Tong Lu, Nanjing University
Ainuddin Wahid Bin Abdul Wahab, Universiti Malaya

Document Type

Conference Article

Publication Title

Proceedings of the International Conference on Document Analysis and Recognition, ICDAR

Abstract

Generally video consists of edited text (i.e., caption text) and natural text (i.e., scene text), and these two texts differ from one another in nature as well as characteristics. Such different behaviors of caption and scene texts lead to poor accuracy for text recognition in video. In this paper, we explore wavelet decomposition and temporal coherency for the classification of caption and scene text. We propose wavelet of high frequency sub-bands to separate text candidates that are represented by high frequency coefficients in an input word. The proposed method studies the distribution of text candidates over word images based on the fact that the standard deviation of text candidates is high at the first zone, low at the middle zone and high at the third zone. This is extracted by mapping standard deviation values to 8 equal sized bins formed based on the range of standard deviation values. The correlation among bins at the first and second levels of wavelets is explored to differentiate caption and scene text and for determining the number of temporal frames to be analyzed. The properties of caption and scene texts are validated with the chosen temporal frames to find the stable property for classification. Experimental results on three standard datasets (ICDAR 2015, YVT and License Plate Video) show that the proposed method outperforms the existing methods in terms of classification rate and improves recognition rate significantly based on classification results.

First Page

349

Last Page

354

DOI

10.1109/ICDAR.2017.65

Publication Date

7-2-2017

Recommended Citation

Roy, Sangheeta; Shivakumara, Palaiahnakote; Pal, Umapada; Lu, Tong; and Wahab, Ainuddin Wahid Bin Abdul, "Temporal Integration for Word-Wise Caption and Scene Text Identification" (2017). Conference Articles. 225.
https://digitalcommons.isical.ac.in/conf-articles/225

This document is currently not available here.

COinS

Conference Articles

Temporal Integration for Word-Wise Caption and Scene Text Identification

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Browse

Search

Author Corner

Links

Conference Articles

Temporal Integration for Word-Wise Caption and Scene Text Identification

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Share

Browse

Search

Author Corner

Links