Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SYCL] fix multi-gpu issue on sycl #8554

Merged
merged 11 commits into from
Jul 25, 2024
Merged

Conversation

ClarkChin08
Copy link
Contributor

@ClarkChin08 ClarkChin08 commented Jul 18, 2024

fix multi-gpu crash issue by filter the platforms of sycl.

Signed-off-by: Chen Xi <xi2chen@intel.com>
@github-actions github-actions bot added the SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language label Jul 18, 2024
@ClarkChin08 ClarkChin08 changed the title fix multi-gpu issue on sycl [SYCL] fix multi-gpu issue on sycl Jul 18, 2024
Chen Xi added 3 commits July 18, 2024 07:13
Signed-off-by: Chen Xi <xi2chen@intel.com>
Signed-off-by: Chen Xi <xi2chen@intel.com>
Signed-off-by: Chen Xi <xi2chen@intel.com>
@airMeng
Copy link
Collaborator

airMeng commented Jul 18, 2024

@luoyu-intel

@airMeng
Copy link
Collaborator

airMeng commented Jul 18, 2024

@ClarkChin08 can you attach the measurements results? like llama3-70B on 8 GPUs, memory consumption on each GPU, performance?

@ClarkChin08
Copy link
Contributor Author

This is the llama2-70b memory consumption data and the performance data:
0. run command: ./build/bin/llama-cli -m ../llama-2-70b-chat.Q4_0.gguf -p "how to build a website?" -n 400 -e -ngl 81 -sm layer

  1. memory consumption
    image

image
2. performance
image

Signed-off-by: Chen Xi <xi2chen@intel.com>
ggml/src/ggml-sycl/dpct/helper.hpp Outdated Show resolved Hide resolved
ggml/src/ggml-sycl/dpct/helper.hpp Outdated Show resolved Hide resolved
@mofosyne mofosyne added Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level Review Complexity : High Generally require indepth knowledge of LLMs or GPUs and removed Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level labels Jul 19, 2024
Signed-off-by: Chen Xi <xi2.chen@intel.com>
Signed-off-by: Chen Xi <xi2.chen@intel.com>
@ClarkChin08
Copy link
Contributor Author

image
This is the new performance table with input=6 and output=32

Copy link
Collaborator

@OuadiElfarouki OuadiElfarouki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All good thank you!

Signed-off-by: Chen Xi <xi2.chen@intel.com>
@github-actions github-actions bot added the documentation Improvements or additions to documentation label Jul 25, 2024
Signed-off-by: Chen Xi <xi2.chen@intel.com>
Signed-off-by: Chen Xi <xi2.chen@intel.com>
@airMeng airMeng merged commit ed67bcb into ggerganov:master Jul 25, 2024
53 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation Review Complexity : High Generally require indepth knowledge of LLMs or GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants