Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
deep-learning
representation-learning
pos-tagging
encoder-decoder
video-captioning
video-to-text
msvd
video-description
msr-vtt
wacv2021
syntactic-representations
-
Updated
Apr 16, 2021 - Python