video-language-pretraining

Here are 7 public repositories matching this topic...

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

llama large-language-models video-language-pretraining vision-language-pretraining cross-modal-pretraining blip2 minigpt4 multi-modal-chatgpt

Updated Jun 4, 2024
Python

XLearning-SCU / 2024-ICLR-Norton

Star

Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]

video-language-pretraining long-video-understanding noisy-correspondence

Updated Apr 18, 2024
Python

bytedance / Shot2Story

Star

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

benchmark research video-summarization dataset video-captioning video-story vision-language video-question-answering video-language large-language-models video-language-pretraining video-story-generation

Updated Sep 25, 2024
Python

bigai-nlco / VideoLLaMB

Star

Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

video-language-pretraining long-context video-language-understanding

Updated Sep 19, 2024
Python

liveseongho / Awesome-Video-Language-Understanding

Star

A Survey on video and language understanding.

machine-learning deep-learning paper dataset multimodal-deep-learning awesome-papers video-language video-language-pretraining video-language-understanding

Updated Apr 21, 2023

SCZwangxiao / RTQ-MM2023

Star

ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model

machine-learning deep-learning multi-modal video-understanding vision-and-language video-language video-language-pretraining foundational-models

Updated Jan 31, 2024
Python

Maddy12 / SSL4VideoSurvey

Star

The official GitHub page for the survey paper "Self-Supervised learning for Videos: A survey"

computer-vision action-recognition pre-training text-to-video video-language video-language-pretraining video-language-understanding video-to-video

Updated Jul 19, 2023

Improve this page

Add a description, image, and links to the video-language-pretraining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-language-pretraining topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-language-pretraining

Here are 7 public repositories matching this topic...

DAMO-NLP-SG / Video-LLaMA

XLearning-SCU / 2024-ICLR-Norton

bytedance / Shot2Story

bigai-nlco / VideoLLaMB

liveseongho / Awesome-Video-Language-Understanding

SCZwangxiao / RTQ-MM2023

Maddy12 / SSL4VideoSurvey

Improve this page

Add this topic to your repo