Add pretrained MAE weights, option to load checkpoints in ViT builder #479

ebsmothers · 2023-10-04T18:34:27Z

Summary:
For MAE fine-tuning, fine-tuning occurs just on the encoder (ViT). This change allows easy loading of MAE pretrained weights directly into our ViT class.

Test plan:

python -m pytest -v tests/models/*
...
========== 207 passed, 25 warnings in 424.67s (0:07:04) ===========================

python -m pytest -v tests/modules/*
...
======================== 192 passed, 2 skipped, 22 warnings in 10.75s ==========================

Test instantiating ViT using MAE pretrained weights for each of the 3 checkpoints:

codecov-commenter · 2023-10-04T18:42:17Z

Codecov Report

Attention: 7 lines in your changes are missing coverage. Please review.

Comparison is base (0de91e1) 72.21% compared to head (f488945) 72.18%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #479      +/-   ##
==========================================
- Coverage   72.21%   72.18%   -0.03%     
==========================================
  Files         187      187              
  Lines       13160    13174      +14     
==========================================
+ Hits         9503     9510       +7     
- Misses       3657     3664       +7

Files	Coverage Δ
...hmultimodal/modules/encoders/vision_transformer.py	`52.72% <66.66%> (+0.80%)`	⬆️
...orchmultimodal/models/masked_auto_encoder/model.py	`92.98% <45.45%> (-5.08%)`	⬇️

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ankitade · 2023-10-04T20:11:57Z

torchmultimodal/models/masked_auto_encoder/model.py

@@ -20,6 +20,13 @@
 )


+MAE_MODEL_MAPPING = {


isnt it nicer to expose as vit_mae* wrapper with pretrained=True like clip in here itself vs making the user pass it around

Yeah that's fine too. I thought it was a bit weird cause we are then mixing builders across files (i.e. we either have a builder for MAE inside vision_transformer.py or a builder returning ViT inside MAE model.py). But maybe the second option (I think that's what you're suggesting?) isn't so bad

facebook-github-bot · 2023-10-06T16:10:08Z

@ebsmothers has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-10-06T19:01:02Z

@ebsmothers merged this pull request in 6f32ca1.

Add pretrained MAE weights, option to load checkpoints in ViT builder

ff7d657

ebsmothers requested review from pbontrager, ankitade, kartikayk and pikapecan October 4, 2023 18:34

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 4, 2023

ankitade reviewed Oct 4, 2023

View reviewed changes

ebsmothers added 2 commits October 5, 2023 17:49

Merge branch 'facebookresearch:main' into mae-weights

06098cd

Add explicit builders for pretrained encoders to MAE file

f488945

facebook-github-bot closed this in 6f32ca1 Oct 6, 2023

facebook-github-bot added the Merged label Oct 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pretrained MAE weights, option to load checkpoints in ViT builder #479

Add pretrained MAE weights, option to load checkpoints in ViT builder #479

ebsmothers commented Oct 4, 2023 •

edited

Loading

codecov-commenter commented Oct 4, 2023 •

edited

Loading

ankitade Oct 4, 2023

ebsmothers Oct 4, 2023

facebook-github-bot commented Oct 6, 2023

facebook-github-bot commented Oct 6, 2023

Add pretrained MAE weights, option to load checkpoints in ViT builder #479

Add pretrained MAE weights, option to load checkpoints in ViT builder #479

Conversation

ebsmothers commented Oct 4, 2023 • edited Loading

codecov-commenter commented Oct 4, 2023 • edited Loading

Codecov Report

ankitade Oct 4, 2023

Choose a reason for hiding this comment

ebsmothers Oct 4, 2023

Choose a reason for hiding this comment

facebook-github-bot commented Oct 6, 2023

facebook-github-bot commented Oct 6, 2023

ebsmothers commented Oct 4, 2023 •

edited

Loading

codecov-commenter commented Oct 4, 2023 •

edited

Loading