[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The code used in our paper "From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering", CVPR2022.
commonsense-reasoning
video-question-answering
evidence-reason
visual-understanding
video-question-answering-dataset
-
Updated
Jul 11, 2024 - Python