interaction tasks

Just a sandbox right now to test AIHabitat.

Data collection:

Include information about required habitat assets, as well as the configs
Include

Data labelling done with LLava-NeXT-Video-Qwen2-32B, clone the LLaVa-NeXT repo and put it in the root repo run the data_annotation bash script inside the dataset folder

TO-DO: Think about using an outer loop to take in multiple scenes / instances to run data_collection on

Common issues that I run into:

the vision tower in LLaVa-Next is not working that well, keep freezing at 12/14, what was the fix before? Solution: module load cuda/12.2.2 apptainer pytorch/2.0.1 apptainer run --nv $CONTAINERDIR/pytorch-2.0.1.sif inference needs to be run on appropriate GPU sizes (64GB) for 32
seems like auto_batch_size in model_config is the way to go, need to be on 4.46.0 and have enough GPU VRAM
error where the finetuned model is only outputting padding tokens -- i think it's just because it got overfitted to output padding tokens/data pollution Common habitat issues:
Make sure you are never accessing env variables before calling env.reset(), and intending to use them after the env.reset()

Some other ideas:

do model merging / moe with posegpts to get even better gesture recognizers, but not needed
next steps: figure out how to convert natural language planning to feasible actions, or just make the VLM output motor rotations instead
other steps could be to look at the current image and make the VLM figure out where they should head to next, which is another important planning step

Modules you need: habitat-lab and huggingface

git-lfs: module load git-lfs Models:
cuda: module load cuda
apptainer and pytorch: module load apptainer pytorch habitat-lab:
module load ffmpeg

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
__pycache__		__pycache__
data_creation		data_creation
good_data		good_data
scripts		scripts
.gitignore		.gitignore
README.md		README.md
inference.py		inference.py
label_run.sh		label_run.sh
labelling.ipynb		labelling.ipynb
labelling.py		labelling.py
model_cfg.yaml		model_cfg.yaml
requirements.txt		requirements.txt
train.py		train.py
training.ipynb		training.ipynb
upload_dataset.py		upload_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

interaction tasks

Data collection:

About

Releases

Packages

Languages

kingsleykimm/interaction

Folders and files

Latest commit

History

Repository files navigation

interaction tasks

Data collection:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages