Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Aphrodite Engine to Local Apps #723

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

AlpinDale
Copy link

This PR adds Aphrodite Engine to the list of local apps.

Aphrodite is a tensor-parallel LLM inference engine based on vLLM, with support for almost all transformers models and quantization formats. It currently supports:

  • Hugging Face Transformers
  • GGUF
  • ExLlamaV2
  • GPTQ
  • AWQ
  • Bitsandbytes
  • Smoothquant+
  • EETQ
  • AQLM
  • QuIP#

Deeplink support is not planned because it's a CLI-only app. This is my first time writing TypeScript, please let me know if I've made a mistake. Cheers!

Here's the SVG, if needed.
pygchisel

@Vaibhavs10
Copy link
Member

Thanks for the contribution @AlpinDale - massive fan of your Hub work too! Sorry for the delay on this!

cc: @julien-c

@AlpinDale
Copy link
Author

I noticed there was a PR for vLLM which streamlined the quantization stuff a lot better. I'll probably update this PR to follow that.

@julien-c
Copy link
Member

ah i had missed that PR, thanks for pinging @Vaibhavs10!

aphrodite-engine looks cool 🔥

Copy link
Member

@Vaibhavs10 Vaibhavs10 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hey hey @AlpinDale - not sure if you're still planning to, but we'll launch a new batch of apps soon! Would be cool to include this as well.

Let me know if you need any help! 🤗

@AlpinDale
Copy link
Author

Thanks for reminding me @Vaibhavs10 ! I'll work on this again tonight and hopefully we can finish it up.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants