Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compatibility #1

Open
inferno46n2 opened this issue Jun 3, 2024 · 6 comments
Open

Compatibility #1

inferno46n2 opened this issue Jun 3, 2024 · 6 comments

Comments

@inferno46n2
Copy link

Hello,

So this require a uniquely trained video model to run with? Did your team also finetune a SVD model to use with this?

Confused what type of video diffusion models accepts these compressed latents.

@yzhang2016
Copy link

The tuned SVD model will be released soon. The original SVD can also be used, but its performance is worse than the tuned one.

@frankchieng
Copy link

how to generate more frames with SVD and CV-VAE?

@sijeh
Copy link
Collaborator

sijeh commented Jun 4, 2024

Hello,

So this require a uniquely trained video model to run with? Did your team also finetune a SVD model to use with this?

Confused what type of video diffusion models accepts these compressed latents.

Any downstream models obtained from SD1.5 and SD2.1, such as various image diffusion community models, as well as video models, such as SVD, Videocrafter, and Animatediff, can be used with CV-VAE. In addition, compatibility can be further enhanced by performing a small amount of fine-tuning on the diffusion model.

@sijeh
Copy link
Collaborator

sijeh commented Jun 4, 2024

The tuned SVD model will be released soon. The original SVD can also be used, but its performance is worse than the tuned one.

The inference code and model weights of SVD will be coming soon.

@radna0
Copy link

radna0 commented Jul 10, 2024

Is it possible to fine-tune Open-Sora or Open-Sora-PLan with CV-VAE, Have your team tried comparing this with SVD? @sijeh

@sijeh
Copy link
Collaborator

sijeh commented Jul 12, 2024

Is it possible to fine-tune Open-Sora or Open-Sora-PLan with CV-VAE, Have your team tried comparing this with SVD? @sijeh

Open-Sora and Open-Sora-Plan are initialized from pixelart-alpha, which also uses the SD2.1 VAE. However, they later trained their own video VAE, and the latent space is no longer compatible with SD2.1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants