#

small-models

Here are 9 public repositories matching this topic...

SqueezeAILab / SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

natural-language-processing text-generation transformer llama quantization model-compression efficient-inference post-training-quantization large-language-models llm small-models localllm

Updated Aug 13, 2024
Python

SqueezeAILab / KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

natural-language-processing compression text-generation transformer llama quantization mistral model-compression efficient-inference efficient-model large-language-models llm small-models localllm localllama

Updated Aug 13, 2024
Python

aitomatic / openssa

OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving

domain-knowledge industrial-ai small-models specialist-agents

Updated Nov 19, 2024
Python

MCG-NJU / AMD

[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models

action-recognition video-understanding distillation self-supervised-learning temporal-action-detection foundation-models small-models cvpr2024

Updated Jun 4, 2024
Python

Decoder-Only-LLM

logic-OT / Decoder-Only-LLM

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Aug 27, 2024
Jupyter Notebook

zhangyifei01 / Awesome-Self-supervised-Learning-of-Tiny-Models

Overview of self-supervised learning of tiny models, including distillation-based methods (aks. self-supervised distillation) and non-distillation methods.

knowledge-distillation self-supervised binary-neural-networks self-supervised-distillation lightweight-models tiny-models small-models

Updated Nov 13, 2022

sfarhat / dapt

Code for "On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models"

synthetic-data distillation pre-training contrastive-learning small-models

Updated Apr 5, 2024
Python

ENSTA-U2IS-AI / optuMNIST

Help us define the Pareto front of small models for MNIST classification. Frugal AI.

deep-neural-networks deep-learning mnist-classification frugality small-models

Updated Jul 13, 2023
Python

antonio-f / Phi-3-Vision

Phi-3-Vision model test - running locally

machine-learning computer-vision jupyter-notebook artificial-intelligence image-to-text multimodal-learning hands-on hugging-face multimodal-models llms running-locally tiny-models small-models phi-3 phi-3-vision

Updated May 29, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the small-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the small-models topic, visit your repo's landing page and select "manage topics."