Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read arbitrary number of directory/file paths from standard input or a file #130

Open
ao-apps opened this issue Feb 28, 2023 · 0 comments
Open

Comments

@ao-apps
Copy link

ao-apps commented Feb 28, 2023

Feature Request

We would like for rdfind to have the option to read the set of paths from standard input.

This could possibly be the default when no paths given.

There could also be a switch to provide the paths, with - meaning standard input.

By default, paths would be split by newline. A -0 switch would split by NULL.

Compares to rsync's --files-from and --from0 / -0 options.

Motivation

We are de-duplicating specific build artifacts within a large set of Jenkins/Maven build directories. This results in more files than can be passed through a single command via xargs, even with a maximum value to --max-chars. The result is multiple invocations of rdfind and incomplete de-duplication.

We are not able to safely de-duplicate entire directories, just specific filename patterns. For example, we can safely de-duplicate timestamped snapshots which are never overwritten. However, the *-SNAPSHOT.{aar,jar,war,…} files cannot be de-duplicated since they are overwritten in-place by Maven.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant