#

ggml

Here are 91 public repositories matching this topic...

llama.cpp

ggerganov / llama.cpp

LLM inference in C/C++

Updated Nov 22, 2024
C++

rustformers / llm

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

rust ai ml llm ggml

Updated Jun 24, 2024
Rust

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated Nov 22, 2024
Python

LostRuins / koboldcpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

Updated Nov 22, 2024
C++

leejet / stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

flux ai cplusplus image-generation diffusion text2image image2image img2img txt2img latent-diffusion stable-diffusion ggml flux-dev flux-schnell

Updated Oct 24, 2024
C++

RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

machine-learning deep-learning quantization language-model llm rwkv ggml

Updated Aug 7, 2024
C++

guinmoon / LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

macos swift ios ai llama gpt-2 rwkv ggml gptneox starcoder

Updated Nov 20, 2024
Swift

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

gpu pytorch llama quantization language-model huggingface llm llamacpp ggml llama2

Updated Nov 2, 2024
JavaScript

PABannier / bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech generation

machine-learning text-to-speech inference tts ggml

Updated Nov 16, 2024
C++

abacaj / mpt-30B-inference

Run inference on MPT-30B using CPU

ggml ctransformers mpt-30b

Updated Jun 30, 2023
Python

Maknee / minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

c machine-learning deep-learning cpp quantization multimodal ggml minigpt4

Updated Aug 8, 2023
C++

azkadev / whisper

Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models

Updated Sep 18, 2024
C++

the-crypt-keeper / can-ai-code

Self-evaluating interview for AI coders

ai transformers humaneval llm langchain llama-cpp ggml

Updated Nov 21, 2024
Python

monatis / clip.cpp

CLIP inference in plain C/C++ with no extra dependencies

c cpp image-search clip multimodal ggml

Updated Aug 18, 2024
C++

LLaMA-Cult-and-More

shm007g / LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

tensorflow transformers pytorch llama gpt alpaca loralib vicuna deepspeed gpt4 llm chatgpt ggml gptq

Updated Jun 1, 2023
HTML

azkadev / bark

WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference

dart machine-learning text-to-speech ai deep-learning clone neural-network voice fake tts bark ggml

Updated Apr 13, 2024
C++

azkadev / general_ai

GENERAL Ai Library For DART & Flutter

dart machine-learning library ai deep-learning ml artificial-intelligence flutter whisper piper azkadev stable-diffusion ggml

Updated Apr 13, 2024
C++

staghado / vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

c cpu ai computer-vision cpp image-classification edge-computing vision-transformer whisper-cpp llamacpp ggml

Updated Apr 11, 2024
C++

mayooear / private-chatbot-mpt30b-langchain

Chat with your data privately using MPT-30b

gpt llm langchain ggml

Updated Jun 29, 2023
Python

mgonzs13 / llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings llama gpt ros2 vlm reranking llm langchain llava llamacpp ggml gguf rerank llavacpp

Updated Nov 22, 2024
C++

Improve this page

Add a description, image, and links to the ggml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ggml topic, visit your repo's landing page and select "manage topics."