Skip to content

Eutenacity/python-ggml-matmul

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

python-ggml-matmul

ggml-matmul with pytorch

quick start

  1. copy your cuda path to setup.py line24
  2. install pytorch >= 2.1.2
  3. install ggml (cd ggml-master, mkdir build, cd build, cmake .., export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/cuda/lib64, cmake --build . --config Release -j 8)
  4. pip install .
  5. test.py is the example
  6. my_gguf.py is copied from https://github.com/kvcache-ai/ktransformers/blob/main/ktransformers/util/custom_gguf.py
  7. gguf file is downloaded from https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF

changes in ggml

  1. ggml-master\src\ggml-cuda.cu line 2270-2275
  2. ggml-master\include\ggml-cuda.h line 45

to do

supoort to pytorch cuda graph (I have no idea now)

About

python-api for ggml-matmul

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published