visual-understanding

Here are 2 public repositories matching this topic...

bcmi / Causal-VidQA

[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The code used in our paper "From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering", CVPR2022.

commonsense-reasoning video-question-answering evidence-reason visual-understanding video-question-answering-dataset

Updated Jul 11, 2024
Python

jaleedkhan / neusire

Star

NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment

knowledge-graph image-captioning image-generation scene-graph visual-reasoning visual-genome ms-coco image-representation scene-graph-generation knowledge-enrichment scene-graph-to-image commonsense-knowledge neuro-symbolic-ai visual-understanding scene-graph-to-text scene-graph-enrichment

Updated Mar 10, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the visual-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the visual-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly