New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[vLLM] metadata script #959

Open

mishig25 wants to merge 2 commits into main from vllm_metadata

Collaborator

mishig25 commented Oct 8, 2024 •

edited

Loading

Follow up to #957. To precisely show vLLM snippet on HF models, we need to have record on which models are supported by vLLM. vLLM provides supported models list: doc page, python src.

This PR creates a github cron job (runs once a day) script that:

parses python src through AST to extract supported architectures list (both for transformers & ggml/llama.cpp styles)
uploads the result to huggingface/vllm-metadata. You can see working example here.

HF models will use this list to whether show or not show vLLM snippet.

Find the python script here: get_vlmm_metadata.py.zip (you can locally test run if you want).

mishig25 changed the title ~~vlmm_metadata~~ [vLLM] metadata script

mishig25 force-pushed the vllm_metadata branch from 2023e3b to 309da1c Compare

October 8, 2024 12:09


          vLLM metadata script

7f53fc3

mishig25 force-pushed the vllm_metadata branch from 309da1c to 7f53fc3 Compare

October 8, 2024 12:15

mishig25 marked this pull request as ready for review

October 8, 2024 12:23

mishig25 requested review from gary149, julien-c, LysandreJik, osanseviero, SunMarc and Vaibhavs10

October 8, 2024 12:25

Collaborator Author

mishig25 commented Oct 8, 2024

@simon-mo could you confirm that this list is the correct way to find vLLM supported models both for transformers & ggml/llama.cpp architectures

_MODELS = {
    **_TEXT_GENERATION_MODELS,
    **_EMBEDDING_MODELS,
    **_MULTIMODAL_MODELS,
    **_SPECULATIVE_DECODING_MODELS,
}

simon-mo commented Oct 8, 2024

Yes!

Vaibhavs10 approved these changes

View reviewed changes

Member

Vaibhavs10 left a comment

Looks dope! I didn't test the script locally, but went through the exported dataset - looks amazing!

pcuenca reviewed

View reviewed changes

.github/workflows/vllm-metadata.yml Show resolved Hide resolved

pcuenca reviewed

View reviewed changes

.github/workflows/vllm-metadata.yml

+                        source_code = response.text
+                        models_dict = extract_models_dict(source_code)
+                        architectures = [item for tup in models_dict.values() for item in tup]

Member

pcuenca Oct 9, 2024

Suggested change

      
                      architectures = [item for tup in models_dict.values() for item in tup]
          
                      architectures = sorted(list({item for tup in models_dict.values() for item in tup}))

Maybe, if we want to remove duplicates and assuming tuple order does not matter (i.e., llama does not have to appear before LlamaForCausalLM

mishig25 commented

View reviewed changes

.github/workflows/vllm-metadata.yml Outdated Show resolved Hide resolved


          Update .github/workflows/vllm-metadata.yml

465fa2a

osanseviero reviewed

View reviewed changes

Contributor

osanseviero left a comment

Very cool 🔥

.github/workflows/vllm-metadata.yml

+                    - name: Execute Python script
+                      env:
+                        HF_VLLM_METADATA_PUSH: ${{ secrets.HF_VLLM_METADATA_PUSH }}
+                      run: |

Contributor

osanseviero Oct 9, 2024

Can we move this code to a script rather than having the Python code in the yaml? It will be easier to maintain, update, and review

Collaborator Author

mishig25 Oct 9, 2024

it will be easier to review

agree with this point

it will be easier to maintain, update

I think maintaining a separate python script would be painful. We would need to find a place to place this python script and tell the yaml job to download and run this python script (which can introduce other security issues since we are running what's being downloaded)

.github/workflows/vllm-metadata.yml

+                        from huggingface_hub import HfApi
+                        def extract_models_sub_dict(parsed_code, sub_dict_name):
+                            class MODELS_SUB_LIST_VISITOR(ast.NodeVisitor):

Contributor

osanseviero Oct 9, 2024

Suggested change

      
                          class MODELS_SUB_LIST_VISITOR(ast.NodeVisitor):
          
                          class ModelsSubListVisitor(ast.NodeVisitor):

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

osanseviero osanseviero left review comments

pcuenca pcuenca left review comments

Vaibhavs10 Vaibhavs10 approved these changes

gary149 Awaiting requested review from gary149

julien-c Awaiting requested review from julien-c

LysandreJik Awaiting requested review from LysandreJik

SunMarc Awaiting requested review from SunMarc

Labels

None yet