HUMAN ACTION RECOGNITION (VIDEO MERL SHOPPING)

This is the code of our team in Datathon 2023 Challenge based on mmaction2, you can see the demo here: Demo

Prepare dataset:

In this repo, I use format of kinetics datasets. Please follow the folder structure below:

data
├── train
|    ├── train_00001.mp4
|    ├── train_00002.mp4
|    ├── ... 
├── val
|    ├──val_00001.mp4
|    ├──val_00002.mp4
|    ├── ...             
├── train.txt                     
├── val.txt
├── label.txt

About each video in train and val folder, please notice that its frames should be a mutiple of 25 to prevent unexpected errors.
Format of train.txt and val.txt, the following number refer to the label of the video.

train_00001.mp4 5
train_00002.mp4 4
...

label.txt is the file containing all your label actions in train.txt and val.txt. For example, the competition provided 6 actions and in label.txt it should be:

Note: If your data similar to MERL Shopping dataset which has crop video and .mat label file, you can use tools/prepare_data_2.py to get the data structure above.

More further information about the kinetics dataset, you can refer to mmaction2 guide: Prepare Datasets

Set up Environment

1. You can refer to mmaction2 installation. (But sometimes it costs much time for setting up from scratch)

2. I have set up and saved this image to docker hub and you can use this instead of setting up enviroment from scratch.

Require docker to run this.

# Pull the image
docker pull hienhayho/mmaction2

# Start the docker container
docker run -d --gpus all --shm-size=4G -it -v path/to/your/folder:path/to/your/folder --name mmaction2 hienhayho/mmaction2:latest bash

# Execute your container
docker exec -it mmaction2 bash

# Activate venv
conda activate openmmlab

Training

1. About the pretrained model, you can download it from: here

2. In config file: mvit/mvit-base-p244_32x3x1_kinetics400-rgb.py. Please set these values:

# Your local dataset path
ann_file_test = ...   # val.txt
ann_file_train = ...  # train.txt
ann_file_val = ...    # val.txt
data_root = ...       # train/
data_root_val = ...   # val/

...

#Set downloaded pretrained model path 
load_from = ...

Note: You can try with different config files in config provided by mmaction2

3. Training

CUDA_VISIBLE_DEVICES=0 python3 \
    tools/train.py \
    mvit/mvit-base-p244_32x3x1_kinetics400-rgb.py \
    --work-dir train_mvit/

Note: If you encounter errors about the video data, please use decord to try loading videos and delete error videos.

Video Demo

Use demo/long_video_demo.py to inference on a video.

For example:

CUDA_VISIBLE_DEVICES=0 python3 \
    demo/long_video_demo.py \
    your_path_to_config_file \
    your_path_to_check_point \
    demo/9_3_crop.mp4 \
    your_path_to_label_file \
    video_demo/demo.mp4 \
    --batch-size 4

For more info about the provided arguments, please refer to demo/long_video_demo.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.circleci		.circleci
.github		.github
configs		configs
demo		demo
docker		docker
docs		docs
mmaction		mmaction
mmengine		mmengine
mvit		mvit
projects		projects
requirements		requirements
resources		resources
tests		tests
tools		tools
video_demo		video_demo
web_visualize		web_visualize
.gitignore		.gitignore
.owners.yml		.owners.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_zh-CN.md		README_zh-CN.md
dataset-index.yml		dataset-index.yml
model-index.yml		model-index.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HUMAN ACTION RECOGNITION (VIDEO MERL SHOPPING)

Prepare dataset:

Set up Environment

Training

Video Demo

About

Releases

Packages

Languages

License

hienhayho/human-action-recognition

Folders and files

Latest commit

History

Repository files navigation

HUMAN ACTION RECOGNITION (VIDEO MERL SHOPPING)

Prepare dataset:

Set up Environment

Training

Video Demo

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages