Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about downstream tasks #15

Open
Yipinggggg opened this issue Apr 5, 2024 · 1 comment
Open

Questions about downstream tasks #15

Yipinggggg opened this issue Apr 5, 2024 · 1 comment

Comments

@Yipinggggg
Copy link

Hi, great work! But I have a question I don't understand.

The backbone you used for training is a timesformer which takes a sequence of frames as input, but for all the downstream tasks the input is a single frame. Maybe I haven't fully understood the code, but what does the time dimension do in downstream tasks?

Thank you very much!

@Kyfafyd
Copy link
Member

Kyfafyd commented Apr 5, 2024

Hi @Yipinggggg
Thanks for your interest!
All of our downstream tasks take video sequences as the model input to model the temporal information.
May I learn which part of code is confusing?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants