-
Notifications
You must be signed in to change notification settings - Fork 9
Add a HPOA tab parser #27
Comments
@cmungall I started implementing this in pxftools. Should I move it here? Also, some questions about the format. I have been putting the disease as the cc @drseb |
There are actually two separate TSVs - HPOA and the phenote-HPO TSV. When I wrote the ticket I believe I intended HPOA, but the phenote TSV is now higher priority for webphenote migration. @DoctorBud had the same questions, for verifying that the d2p forms were complete w.r.t phenote functionality, For HPOA, we don't have GeneID, SexID, GenotypeID. Where we have these in phenote, they have only been used historically, and get ignored when @drseb compiles the phenote TSVs into HPOA-TSV. We should do the same for our purposes here. We will have the original phenote TSVs archived if we ever want to go back and use information that has been populated here. |
I saw that @pnrobinson still puts information into gene field. not sure why!? can you explain? I think it is better to pull gene-disease links from other sources (e.g. orphanet) |
Download a pxftools release and you can convert the Phenote TSVs to phenopackets like this:
Creator and date are left out until we add those to the API. |
Sure, it's doable in the Monarch Jenkins. If you want to go ahead with this instance, let me know where to get the source files. |
I've noticed that I'm missing some evidence code translations for IDs used in the files. The HPO TSVs use IDs like IEA, ITM, TAS, ICE, etc. I am using ECO and searching for exact synonyms. But many aren't in there. What is the source for these? |
Aha, I've found many of them in the HPO documentation. But |
I fixed TAE -> TAS. Not sure about TEA. Has been used intentionally and very often. @pnrobinson ? |
If we have a lot of TEA codes, then this must be an odd Phenote Bug. |
Redundant functionality with
monarch-initiative/dipper#277
But useful to have in one library
The text was updated successfully, but these errors were encountered: