llm-tokenwise-inference

Token-wise and real-time display Inference module for Llama2 and other LLMs.

Getting Started

cd llm-tokenwise-inference
pip install -r requirements.txt

from llminferencepkg import TokenWiseLLM
model = TokenWiseLLM("path/to/model") # or HF repository
model.inference("Question")

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
llminferencepkg.py		llminferencepkg.py
requirements.txt		requirements.txt