Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Larger segments as input #13

Open
ivandon15 opened this issue Jul 11, 2024 · 1 comment
Open

Larger segments as input #13

ivandon15 opened this issue Jul 11, 2024 · 1 comment

Comments

@ivandon15
Copy link

Hi DiffLinker Team,

Thank you for your great effort on this.
And I was wondering if the model is possible for sampling linkers for larger molecules (like connecting 2 10-mer peptides).
I tried using existing model (I know it's not appropriate, I just want to check if the model can run without error) , the model raised NanError during sample_p_zs_given_zt_only_linker. Is it because the inputs contains too many atoms?

Thank you for you patient and help!

@ivandon15
Copy link
Author

I might have found the solution.
The DDPM class calls the GCL class, which uses the unsorted_segment_sum function to process node features.
Within this function, there is a normalization_factor. The default value for normalization_factor is set to 100, likely because the original model was applied to small molecules (is that right?).
However, when I use peptides as input, the large number of atoms causes an explosion in the second step of diffusion. When I tried setting the normalization_factor to a larger int the model no longer produced NaN errors.
Right?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant