Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What method was used in the paper to convert the predictions into valid pairs? #34

Open
rebeccaronai opened this issue Jan 18, 2024 · 1 comment

Comments

@rebeccaronai
Copy link

I saw in the paper it states that "we converted the predicted matrix back to valid pairs by merging predictions to chromosomes and counting the discretized intensity value." I also saw that someone raised the issue and it was closed stating it takes a lot of effort. I am wondering if you can specify what method was used in the paper so that it can be replicated by others and FitHiC can be used for loop calling?

@tanjimin
Copy link
Owner

Hi @rebeccaronai , I'll describe the main idea here:

  1. After predicting with C.Origami, you will have a 2mb by 2mb or 256 by 256 bin matrix.
  2. For each pixel at location $i$, $j$ in the 256x256 grid, you can perform exponential transformation ((exp x) - 1) to convert log1p reads to reads $n$.
  3. The reads $n$ you get will be the predicted number of Hi-C valid pairs between location $i$ and $j$. Then you can convert $I$ and $j$ to actual genomic regions and add $n$ lines of $i$ $j$ interaction. You can do this for all regions across the genome then you will end up with a predicted valid pairs file.
  4. You can then run FitHiC on this predicted valid pairs file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants