-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DAG, stderr, stdout logs not being retrieved and displayed in Metaflow UI #81
Comments
Do you see the tasks for each run in the UI? When you go to the DAG tab, what do you see? Do you see stderr, stdout, or cards showing on the task view? |
Have a look at the javascript console? Do you see any errors? Also take a look at the Network tab in your developer tools. Can you see any errors for requests (e.g. to /dag)? If you'd like, you can move this discussion across to the #ask-metaflow slack channel at https://outerbounds-community.slack.com/ |
Can you take a look at the metaflow-service logs? For around the time that the request to Logs look like -
Generally this type of error is because MFGUI can't talk to S3 due to permissions and/or auth. Hopefully we can see some evidence in the metaflow-service logs. |
Description
We've got metaflow and metaflow UI deployed on AWS (on local IP, so not publicly accessible), but the logs aren't being retrieved. There was an initially an issue with our ServiceInfoUI container not having enough memory, but this was upped. The RDS burst balance was also too low, but upping the storage to 1000 GiB removed this queue, and changed the error message to a generic error, so I don't think this is the issue any more.
The RDS is accessible, and appears to be storing the logs. The logs are also available from the relevant S3 buckets, Step Functions and Batch.
I can't find exactly where the UI is trying to pull data from, so not sure whether it's a permissions issue with access to the RDS, but the S3 bucket seems to be accessible. As far as I can see, the permissions/configuration is the same as the metaflow UI CF template, so was interested to know if anyone else had had/is having this issue.
Steps to Reproduce
Expected behavior:
DAG, stderr and stdlog display the error messages being logged in CloudWatch in the UI.
Actual behavior:
Error messages don't appear:
Reproduces how often:
Every time the UI is used. I've previously looked at a public example from Outerbounds, but can't view that at the moment. This one wasn't having the same issue a couple of months ago.
Versions
Application version: 1.1.4
Service version: 2.3.2
My machine: MacOS 12.6
Viewing on Safari: v16.0
The text was updated successfully, but these errors were encountered: