Journal Articles

A New Deep Wavefront Based Model for Text Localization in 3D Video

Lokesh Nandanwar, Universiti Malaya
Palaiahnakote Shivakumara, Universiti Malaya
Raghavendra Ramachandra, Norges Teknisk-Naturvitenskapelige Universitet
Tong Lu, Nanjing University
Umapada Pal, Indian Statistical Institute, Kolkata
Apostolos Antonacopoulos, University of Salford
Yue Lu, East China Normal University

Article Type

Research Article

Publication Title

IEEE Transactions on Circuits and Systems for Video Technology

Abstract

With the evolution of electronic devices, such as 3D cameras, addressing the challenges of text localization in 3D video (e.g., for indexing) is increasingly drawing the attention of the multimedia and video processing community. Existing methods focus on 2D video and their performance in the presence of the challenges in 3D video, such as shadow areas associated with text and irregularly sized and shaped text, degrades. This paper proposes the first approach that successfully addresses the challenges of 3D video in addition to those of 2D. It employs a number of innovations, among which, the first is the Generalized Gradient Vector Flow (GGVF) for dominant points detection. The second is the Wavefront concept for text candidate point detection from those dominant points. In addition, an Adaptive B-Spline Polygon Curve Network (ABS-Net) is proposed for accurate text localization in 3D videos by constructing tight fitting bounding polygons using text candidate points. Extensive experiments on custom (3D video) and standard datasets (2D video and scene text) show that the proposed method is practical and useful, and overall outperforms existing state-of-the-art methods.

First Page

3375

Last Page

3389

DOI

10.1109/TCSVT.2021.3110990

Publication Date

6-1-2022

Comments

Open Access, Green

Recommended Citation

Nandanwar, Lokesh; Shivakumara, Palaiahnakote; Ramachandra, Raghavendra; Lu, Tong; Pal, Umapada; Antonacopoulos, Apostolos; and Lu, Yue, "A New Deep Wavefront Based Model for Text Localization in 3D Video" (2022). Journal Articles. 3118.
https://digitalcommons.isical.ac.in/journal-articles/3118

This document is currently not available here.

COinS

Journal Articles

A New Deep Wavefront Based Model for Text Localization in 3D Video

Article Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

A New Deep Wavefront Based Model for Text Localization in 3D Video

Authors

Article Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Share

Browse

Search

Author Corner

Links