Rayrtfr

Rayrtfr Rayrtfr

Achievements

FasterTransformer FasterTransformer Public

Forked from void-main/FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 17 1
fastertransformer_backend fastertransformer_backend Public

Forked from void-main/fastertransformer_backend

Python 9
llama.cpp llama.cpp Public

Forked from ggerganov/llama.cpp

Port of Facebook's LLaMA model in C/C++

C++ 1 1
llama2-webui llama2-webui Public

Forked from liltom-eth/llama2-webui

Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.

Python 1
Llama2-Chinese Llama2-Chinese Public

Forked from LlamaFamily/Llama-Chinese

最好的中文Llama大模型

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python