Merge wilds mllm #266

i-gao · 2023-09-21T19:53:00Z

Modular eval code

TODOs:

test an eval for each dataset

liyongqi67 · 2023-10-04T10:10:36Z

Does this branch test the "evaluate" code? I tested with the eval_flickr30 flag, and found it reported an error:

  File "/home/share/yongqi/project/open_flamingo/open_flamingo/src/helpers.py", line 240, in forward
    assert (
AssertionError: current text cannot be longer than conditioned media locations

My script is

CUDA_VISIBLE_DEVICES=3,4,6,7 torchrun --nnodes=1 --nproc_per_node=4 --master_port=1997 ./open_flamingo/eval/evaluate.py \
    --model_family flamingo \
    --vision_encoder_path ViT-L-14 \
    --vision_encoder_pretrained openai\
    --lm_path anas-awadalla/mpt-1b-redpajama-200b-hf-style  \
    --tokenizer_path anas-awadalla/mpt-1b-redpajama-200b-hf-style  \
    --cross_attn_every_n_layers 1 \
    --results_file results.json \
    --precision fp32 \
    --batch_size 1 \
    --eval_flickr30 \
    --shots 0 \

I printed the two corresponding length values via " # print(x.shape[1], media_locations.shape[1])" in helpers.py before line 240.

At the last call, 48>47.

And if I set batchsize=2, it will report another error.

  File "/home/share/yongqi/project/open_flamingo/open_flamingo/src/helpers.py", line 273, in forward
    sim = sim.masked_fill(~text_to_media_mask, -torch.finfo(sim.dtype).max)
RuntimeError: The size of tensor a (2) must match the size of tensor b (6) at non-singleton dimension 0

liyongqi67 · 2023-10-04T11:03:08Z

In the evaluate.py line 747, the code should be revised from

        outputs = eval_model.get_outputs(
            batch_images=batch_images,
            batch_text=batch_text,
            min_generation_length=min_generation_length,
            max_generation_length=max_generation_length,
            num_beams=num_beams,
            length_penalty=length_penalty,
        )

to

        outputs = eval_model.get_outputs(
            batch_images=batch_images,
            batch_text=batch_text,
            min_new_tokens=min_generation_length,
            max_new_tokens=max_generation_length,
            num_beams=num_beams,
            length_penalty=length_penalty,
        )

Because min_new_tokens and max_new_tokens are accepted arguments for the LLM generate().

i-gao · 2023-10-04T19:53:16Z

Hi @liyongqi67, thanks for pointing out these issues! Sorry, I have not finished cleaning up this gnarly merge yet -- will get to it in the next few days.

liyongqi67 · 2023-10-05T05:47:47Z

Hi @liyongqi67, thanks for pointing out these issues! Sorry, I have not finished cleaning up this gnarly merge yet -- will get to it in the next few days.

Many thanks for your effort.

fix for none past_key_values, getting supported tasks

anas-awadalla and others added 29 commits July 1, 2023 09:53

added rices for picking demos for captioning and vqa

fc1c777

add caching

33bf8c5

script to cache RICES features; RICES DDP

50df567

RICES for Hatefulmemes

e5ea7a0

make RICES a command line argument

9278322

remove DDP

8e0ee06

refactor classification

cb06a69

add prompt ensembling

4fd3988

enforce correct class ordering

2aba5d7

refactor cached classification

8a677fd

add waterbirds and celeba

b50a4d9

fix classification caching

da25e19

Merge branch 'rices' into wilds

086eb6d

Merge branch 'main' into wilds

9b88177

waterbirds

8759b5b

add wandb; refactor

992e71d

adding camelyon17

cfba7ae

class-conditional sampling

fb3afad

merge

fef4df2

remove length normalization

1d84adf

rices + class-conditional

45cb422

fix

b1e6049

gather on cpu

615b085

fix memory leak

817eda4

rename to eval_models

20a1374

attempt to merge mllm -> wilds

670a171

Merge branch 'wilds' into merge-wilds-mllm

a885c25

fixes

96daeec

fixes

3f1d66b

liyongqi67 mentioned this pull request Oct 4, 2023

Major refactor to support new architectures #261

Draft

7 tasks

i-gao and others added 7 commits October 9, 2023 09:37

kwarg change

b51caf9

fixes for num_beams > 1

17cd546

fix for null past_key_values, getting supported tasks

e104a0a

add coco dataset name to evaluate.py

55a8f7c

fix generation use_cache issue for mpt

aa995e3

Merge branch 'merge-wilds-mllm' into mllm-eval-hl

5d10063

Merge pull request #275 from hannahyklee/mllm-eval-hl

427d974

fix for none past_key_values, getting supported tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge wilds mllm #266

Merge wilds mllm #266

i-gao commented Sep 21, 2023

liyongqi67 commented Oct 4, 2023 •

edited

Loading

liyongqi67 commented Oct 4, 2023

i-gao commented Oct 4, 2023

liyongqi67 commented Oct 5, 2023

Merge wilds mllm #266

Are you sure you want to change the base?

Merge wilds mllm #266

Conversation

i-gao commented Sep 21, 2023

liyongqi67 commented Oct 4, 2023 • edited Loading

liyongqi67 commented Oct 4, 2023

i-gao commented Oct 4, 2023

liyongqi67 commented Oct 5, 2023

liyongqi67 commented Oct 4, 2023 •

edited

Loading