Recent advances in video captioning with object detection
Document Type
Book Chapter
Publication Title
Advancement of Deep Learning and Its Applications in Object Detection and Recognition
Abstract
Object detection, a primary area of computer vision, has tremendously boosted other computer vision tasks ranging from fine-grained classification to captioning. Post Deep learning object detection methodology can be broadly segregated into two types: (i) Two-stage region proposal-based methods and (ii) Single-stage regression-based methods. In this chapter, we first overview both types of object detection methodology. However, our primary focus lies in the second part, which describes the advancements in the video captioning task due to improved object detectors.
First Page
1
Last Page
21
Publication Date
12-25-2022
Recommended Citation
Ullah, Nasib and Mohanta, Partha Pratim, "Recent advances in video captioning with object detection" (2022). Book Chapters. 125.
https://digitalcommons.isical.ac.in/book-chapters/125