Recent advances in video captioning with object detection

Document Type

Book Chapter

Publication Title

Advancement of Deep Learning and Its Applications in Object Detection and Recognition

Abstract

Object detection, a primary area of computer vision, has tremendously boosted other computer vision tasks ranging from fine-grained classification to captioning. Post Deep learning object detection methodology can be broadly segregated into two types: (i) Two-stage region proposal-based methods and (ii) Single-stage regression-based methods. In this chapter, we first overview both types of object detection methodology. However, our primary focus lies in the second part, which describes the advancements in the video captioning task due to improved object detectors.

First Page

1

Last Page

21

Publication Date

12-25-2022

This document is currently not available here.

Share

COinS