Replies: 1 comment 5 replies
-
Hi and thanks for your interest! It depends on what you're using Aphrodite for. Aphrodite creates an OpenAI-compatible API, meaning any frontend that works with an OpenAI API (and allows configuring the base OpenAI url to http://localhost), you should be fine. If you want a notebook style UI, I recommend mikupad, otherwise Open WebUI and LobeChat are good alternatives. As for the second question, you can create a shell script and make it an executable. For example, save this in a #/bin/bash
aphrodite run NousResearch/Meta-Llama-3.1-8B-Instruct -tp 4 --enable-chunked-prefill [other args go here as needed] Depending on your desktop environment and your file manager, there should be an option for making it an executable when you right click on it. You can also do it from the terminal by running On a somewhat related note, we will be getting support for asymmetric multi-GPU soon, e.g. allowing you to launch aphrodite on 5 GPUs using Tensor Parallelism. |
Beta Was this translation helpful? Give feedback.
-
I'm relatively new to AI. I recently completed a 5x 3090 build (and will be adding more as budget permits). I use it for work and learning (I have an interest in fine tuning).
I have two questions:
I am growing bored of Oobabooga and read that Aphrodite works great for multi-gpu rigs like mine. However, it has been a challenge to set up (due to my relative inexperience) and I had to resort to having ChatGPT help me. We got it set up correctly but I'm struggling to find a halfway decent GUI front end. Silly Tavern looks like role play stuff and that's of no interest to me. OpenWeb UI looked very nice but was unable to get it working with Aphrodite.
Is there a recommended front end for Aphrodite?
Question 2:
Is there a quicker way to switch between LLM models instead of using Terminal? Or do you guys just create script that you double click instead of having to type all that out into terminal when loading a model? If I'm not mistaken you have to load the model with all the parameters/etc?
And does anyone know if there is a halfway decent guide to optimizing Aphrodite? I mostly use GGUF and EXL2 models. I am also in the process of setting up my first fine-tune project.
Beta Was this translation helpful? Give feedback.
All reactions