Skip to content

Latest commit

 

History

History
38 lines (27 loc) · 748 Bytes

PT.md

File metadata and controls

38 lines (27 loc) · 748 Bytes

TVLT Pre-Training

After data preparation and before running the training script, please modify data_root command in scripts, e.g.

data_root='./dataset'

Howto100m

Download howto100m(link) meta files and videos and organize the data structures as below

Dataset
│
├── pretrain_dataset                   
│   ├── caption.json
│   └── videos_ht

Yttemporal

Download yttemporal(link) meta files and videos and organize the data structures as below

Dataset
│
├── pretrain_dataset      
│   └── videos_yt

Pretraining

Pretraining Script

bash scripts/pretrain_mae_vam.sh