-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow for more file types & formatting #3
Comments
What are the most commonly used file formats for this kind of thing? Is it CSV? |
Asking around tells me that tab-delimited files (as text-file) or comma-delimited (EXCEL) are being used. |
Nice idea. Should be fairly simple to do |
Okay nice ;) |
I also found that ISATab (http://isa-tools.org/) are the most coomon file format for metabolomics data somewhere. |
http://regexr.com/ for advanced users. |
Should we still only use one metabolites dataset, or can we (and do we want to) included other dataset possibilities (proteomics, (environmental) chemistry, toxicology). And do we want to use another dataset to validate the RP? |
There are many file types that are commonly used for large sets of metabolites or other compounds. BioPAX, Octave, SciLab, XML are just a few. It would be great to support all of these formats.
Moreover, formatting of the dataset can vary greatly and the programme currently only allows for the CAS ID followed by the IUPAC name in brackets. A customisable REGEX string as a method parameter would be a much better method to harvest the data from the source file
The text was updated successfully, but these errors were encountered: