Skip to content
View Rayrtfr's full-sized avatar

Block or report Rayrtfr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. FasterTransformer FasterTransformer Public

    Forked from void-main/FasterTransformer

    Transformer related optimization, including BERT, GPT

    C++ 17 1

  2. fastertransformer_backend fastertransformer_backend Public

    Forked from void-main/fastertransformer_backend

    Python 9

  3. llama.cpp llama.cpp Public

    Forked from ggerganov/llama.cpp

    Port of Facebook's LLaMA model in C/C++

    C++ 1 1

  4. llama2-webui llama2-webui Public

    Forked from liltom-eth/llama2-webui

    Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.

    Python 1

  5. Llama2-Chinese Llama2-Chinese Public

    Forked from LlamaFamily/Llama-Chinese

    最好的中文Llama大模型

    Python

  6. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python