-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
avoid caching on metalnx server? #364
Comments
It's not clear what you mean? |
Meanwhile, I got this to serve as an example: > ls -ltr tmp-ticket-files/
total 5089404
-rw-r----- 1 root root 5211545600 Oct 1 04:51 backup-20160419.tar The base path is I hope this helps to clarify? |
Are you downloading multiple data objects at the same time? |
This is a two-fold yes:
As data objects are potentially huge, a potential solution might not to cache any data and to connect the request to the irods server directly. As for the second item, I do not feel able to assess the feasibility. edit: PS: Thank you for looking into this! |
I'm surprised you mentioned Please provide a set of steps that will allow us to reproduce what you're seeing locally? |
Hi Kory, yes we use tickets in metalnx to grant public access to the data. Before we used the irods-rest, which is not developed any further. In irods-rest there was no disk caching implemented, irods-rest used streams. We now experinece trouble, if large files/folders get downloaded from outside our institution with wget or curl, it downloads the file only partly and stops with Error 500 (Server internal error). This seems to be a timeout problem in metalnx, that the file gets deleted before it is fully downloaded (see attached docker log). If that's the case increasing the internal timeout could help, but I didn't find it. Kind regards, The file from the above log is ~6GB large:
|
Hi,
On our cluster, we use iRODS to store big data (obviously). With the introduction of Metalnx there is a minor issue: all data get cached on the VM running Metalnx, so that any
curl
orwget
accumulates too much on that machine for all request exceeding a few gigabytes.Now, I am not the admin of said VM, but can this be configured away, such that the VM only mediates the request to the iRODS server and does not cache the data in between?
Cheers
Christian
The text was updated successfully, but these errors were encountered: