Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to reuse S3 File assets? #89

Closed
wangzhihao opened this issue Jul 25, 2020 · 1 comment
Closed

How to reuse S3 File assets? #89

wangzhihao opened this issue Jul 25, 2020 · 1 comment

Comments

@wangzhihao
Copy link
Collaborator

wangzhihao commented Jul 25, 2020

We have some s3 file assets like metastore-sync.sh, some jars. The jars can be up to 200MB.

Currently we upload them every time when a job is submit. There are two drawbacks.

  1. It wastes time since on the most of time the files are the same.
  2. It wastes storage in a long term since the uploaded files are not either cleaned up or overriden.

We currently use CDK assets for the upload. We can not use CDK resources for this end since CDK/Cloudformation only support S3 Bucket level as the resource.

CDK assets allows users to manually specify what assets to not rebuild. See link1 and link2. But not very helpful to us.
CDK assets also can't garbage collect itself. See here.

One promising way is to use Cloud Formation custom resources to define one for s3 object. Here is an example.

@wangzhihao
Copy link
Collaborator Author

We are to garbage collect the resources by #94

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant