ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

3

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019

Preprint

Author(s): Xinxin Zhu , Longteng Guo , Peng Yao , Jing Liu , Hanqing Lu

Publication date Created: 17 October 2019

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

This document describes our solution for the VATEX Captioning Challenge 2019, which requires generating descriptions for the videos in both English and Chinese languages. We identified three crucial factors that improve the performance, namely: multi-view features, hybrid reward, and diverse ensemble. Our method achieves the 2nd and the 3rd places on the Chinese and English video captioning tracks, respectively.

Related collections

Most cited references 3

Record: found
Abstract: not found
Conference Proceedings: not found

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Peter Anderson, Xiaodong He, Chris Buehler … (2018)

0 comments Cited 213 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

A Closer Look at Spatiotemporal Convolutions for Action Recognition

Du Tran, Heng Wang, Lorenzo Torresani … (2018)

0 comments Cited 153 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Self-Critical Sequence Training for Image Captioning

Jerret Ross, Etienne Marcheret, Youssef Mroueh … (2017)

0 comments Cited 131 times – based on 0 reviews

Author and article information

Journal

Publication date Created: 17 October 2019

Article

ArXiV ID: 1910.11102

SO-VID: 1b719d7f-8a0b-48a8-808f-90b4d7983254

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments 3 pages,1 figure

Categories cs.CV cs.MM

ScienceOpen disciplines: Computer vision & Pattern recognition,Graphics & Multimedia design

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition, Graphics & Multimedia design

Comments

Comment on this article