Skip to content

v0.9.5 - now with more robust flavour

Compare
Choose a tag to compare
@bghira bghira released this 06 May 20:58
· 1974 commits to release since this release
eddbfa6

image
Finetuning Terminus XL Velocity v2

What's Changed

  • New cropping logic is now working across the board for parquet/json backends. Images are always cropped now, even when cropped=false, if necessary to maintain 8px or 64px alignment with the resulting dataset.
    • Resulting image sizes and aspect ratios did not change for resolution_type=area
    • Resulting image sizes and aspect ratios did change for resolution_type=pixel
    • This was necessary to avoid stretching/squeezing images when aligning to 64px interval
  • Discord webhook support, see the TUTORIAL for information.
  • "Sensible defaults" are now set for minimum_image_size, maximum_image_size, and target_downsample_size to avoid unexpected surprises mostly when using crop=true, but also for some benefits when using crop=false as well.
  • Image upscaling restrictions have been relaxed, but it will refuse to upscale an image beyond 25%, and instead asks you to change the dataset configuration values.
  • Image quality when training SDXL models has substantially improved thanks to the minimisation of the microconditioning input ranges:
    image
    Finetuning a particularly poorly-performing Terminus checkpoint with reduced high frequency patterning
  • Single subject dreambooth was benchmarked on SDXL with 30 diverse images, achieving great results in just 500 steps.

Commits

  • Convert image to accepted format for calculate_luminance by @Beinsezii in #376
  • vae cache fix for SDXL / legacy SD training
  • epoch / resume step fix for a corner case where the path to the training data includes the dataset name by @bghira in #377
  • when crop=false, we will crop from the intermediary size to the target size instead of squishing
  • set default min_image_size, maximum_image_size, and target_downsample_size values to 100%, 150%, and 150% of the value set for resolution by @bghira in #378
  • resolved bugged-out null embed when dropout is disabled
  • discord webhook support
  • cuda/rocm: bugfix for eval on final legacy (sd 1.5/2.1) training validations
  • avoid stretching/squeezing images by always cropping to maintain 8/64px alignment
  • set default values for minimum_image_size, maximum_image_size, and target_downsample_size by @bghira in #379

Full Changelog: v0.9.5-beta...v0.9.5-beta2