python-ggml-matmul

ggml-matmul with pytorch

quick start

copy your cuda path to setup.py line24
install pytorch >= 2.1.2
install ggml (cd ggml-master, mkdir build, cd build, cmake .., export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/cuda/lib64, cmake --build . --config Release -j 8)
pip install .
test.py is the example
my_gguf.py is copied from https://github.com/kvcache-ai/ktransformers/blob/main/ktransformers/util/custom_gguf.py
gguf file is downloaded from https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF

supoort to pytorch cuda graph (I have no idea now)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
ggml-master		ggml-master
README.md		README.md
ggml_mm.cpp		ggml_mm.cpp
my_gguf.py		my_gguf.py
setup.py		setup.py
test.py		test.py