Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor output alignment issue #2317

Open
matthew99a opened this issue Sep 20, 2024 · 0 comments
Open

Minor output alignment issue #2317

matthew99a opened this issue Sep 20, 2024 · 0 comments

Comments

@matthew99a
Copy link

matthew99a commented Sep 20, 2024

When there are more than one segments in vc.pipeline, every new segment makes subsequent audio output out of place by one frame (i.e. length of self.window = 0.01s). The final audio is thereby also slightly shorter. This effect is barely perceptible, but can be verified using audio editing software.

After experimenting with several attempted fixes, this effect can be drastically reduced after changing line 388, 405, 423 and 440 of infer/modules/vc/pipeline.py from "self.t_pad_tgt : -self.t_pad_tgt" to "(self.t_pad_tgt - self.window) : -(self.t_pad_tgt - self.window)."

I was able to cut the input vs output length difference of a 4-minute audio from 0.04 seconds down to 0.002 seconds.

@matthew99a matthew99a changed the title Output alignment issue Minor output alignment issue Sep 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant