SDv2 Dreambooth LoRA fine-tuning API #312

spillai · 2023-08-29T05:35:12Z

Summary

support for LoRA based fine-tuning of stable-diffusion via dreambooth
added LoRA dreambooth based inference with attn_procs swapping
added test training service

Related issues

Checks

make lint: I've run make lint to lint the changes in this PR.
make test: I've made sure the tests (make test-cpu or make test) are passing.
Additional tests:
- Benchmark tests (when contributing new models)
- GPU/HW tests

outtanames · 2023-08-31T03:57:07Z

nos/executors/ray.py

@@ -37,20 +41,16 @@ class RayRuntimeSpec:


 @dataclass
-class RayExecutor:
+class RayExecutor(metaclass=SingletonMetaclass):


What was the motivation for making this a singleton? Did we ever have more than one executor before?

It was already a singleton with the .get() classmethod returning a singleton Instance. This just makes it a simpler way to instantiate singleton classes

outtanames · 2023-08-31T04:02:29Z

nos/experimental/train/config.py

+from nos.logging import logger
+
+
+RUNTIME_ENVS = {


Would like to discuss this more I think. Is there a way we can do this all inside of a single env?

The need for this is to avoid having to pollute the main repo with all the dependencies especially for training purposes. For diffusers, it needed a specific revision which made it difficult to support as the base conda env.

oof, yea this might make the case for dedicated training containers, each with the dependencies needed for a particular training flow.

outtanames · 2023-08-31T04:08:00Z

nos/models/dreambooth/dreambooth.py

+        self,
+        prompts: Union[str, List[str]],
+        num_images: int = 1,
+        num_inference_steps: int = 50,


Will want to move these to the config eventually (or maybe expose in the API, so we can kick off longer training jobs)

outtanames · 2023-08-31T04:11:08Z

nos/proto/nos_service.proto

@@ -108,3 +108,14 @@ service InferenceService {
  // TODO (spillai): To be implemented later (for power-users)
  // rpc DeleteModel(DeleteModelRequest) returns (DeleteModelResponse) {}
 }
+
+
+message TrainingRequest {


Going to be a hassle to represent the full state needed for training over grpc I think. As discussed training might be something that doesn't run through the client (i.e. deeper integrations with pixeltable so it can run nos server code directly)

Yes, agreed. I'm not sure if we want to represent this here, since it's not standard across training tasks.

outtanames · 2023-08-31T04:16:02Z

nos/common/git.py

@@ -9,12 +11,19 @@


 def cached_repo(


Is there a way to implement the training flow without pulling/installing these repos in their entirety? Is this so the whole thing can be dynamic and not require each env to be declared as a pip dependency?

spillai added 2 commits August 28, 2023 14:04

Use singleton metaclass for ray executor

1f7de77

Training dreambooth example

927e3df

spillai added the feature New feature or request label Aug 29, 2023

spillai added this to the NOS v0.0.10 milestone Aug 29, 2023

spillai requested a review from outtanames August 29, 2023 05:35

spillai self-assigned this Aug 29, 2023

spillai added 2 commits August 29, 2023 14:25

Working local hub registry for stable diffusion lora models

9325f15

Added test dreambooth with custom dreambooth model hub

23bfb55

outtanames approved these changes Aug 31, 2023

View reviewed changes

spillai merged commit 5007e83 into main Sep 1, 2023
1 check passed

spillai deleted the spillai/sdv2-finetuning-api branch September 5, 2023 04:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SDv2 Dreambooth LoRA fine-tuning API #312

SDv2 Dreambooth LoRA fine-tuning API #312

spillai commented Aug 29, 2023 •

edited

Loading

outtanames Aug 31, 2023

spillai Aug 31, 2023

outtanames Aug 31, 2023

spillai Aug 31, 2023

outtanames Aug 31, 2023

outtanames Aug 31, 2023

outtanames Aug 31, 2023

spillai Aug 31, 2023

outtanames Aug 31, 2023

SDv2 Dreambooth LoRA fine-tuning API #312

SDv2 Dreambooth LoRA fine-tuning API #312

Conversation

spillai commented Aug 29, 2023 • edited Loading

Summary

Related issues

Checks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spillai commented Aug 29, 2023 •

edited

Loading