v0.9.5 - now with more robust flavour
Finetuning Terminus XL Velocity v2
What's Changed
- New cropping logic is now working across the board for parquet/json backends. Images are always cropped now, even when
cropped=false
, if necessary to maintain 8px or 64px alignment with the resulting dataset.- Resulting image sizes and aspect ratios did not change for
resolution_type=area
- Resulting image sizes and aspect ratios did change for
resolution_type=pixel
- This was necessary to avoid stretching/squeezing images when aligning to 64px interval
- Resulting image sizes and aspect ratios did not change for
- Discord webhook support, see the TUTORIAL for information.
- "Sensible defaults" are now set for
minimum_image_size
,maximum_image_size
, andtarget_downsample_size
to avoid unexpected surprises mostly when usingcrop=true
, but also for some benefits when usingcrop=false
as well. - Image upscaling restrictions have been relaxed, but it will refuse to upscale an image beyond 25%, and instead asks you to change the dataset configuration values.
- Image quality when training SDXL models has substantially improved thanks to the minimisation of the microconditioning input ranges:
Finetuning a particularly poorly-performing Terminus checkpoint with reduced high frequency patterning - Single subject dreambooth was benchmarked on SDXL with 30 diverse images, achieving great results in just 500 steps.
Commits
- Convert image to accepted format for calculate_luminance by @Beinsezii in #376
- vae cache fix for SDXL / legacy SD training
- epoch / resume step fix for a corner case where the path to the training data includes the dataset name by @bghira in #377
- when crop=false, we will crop from the intermediary size to the target size instead of squishing
- set default min_image_size, maximum_image_size, and target_downsample_size values to 100%, 150%, and 150% of the value set for resolution by @bghira in #378
- resolved bugged-out null embed when dropout is disabled
- discord webhook support
- cuda/rocm: bugfix for eval on final legacy (sd 1.5/2.1) training validations
- avoid stretching/squeezing images by always cropping to maintain 8/64px alignment
- set default values for minimum_image_size, maximum_image_size, and target_downsample_size by @bghira in #379
Full Changelog: v0.9.5-beta...v0.9.5-beta2