GitHub - svenwiltink/sparsecat: CLI tool that reduces bandwidth usage when transmitting sparse files.

SparseCat

Goal

Skipping the hole in sparse file when transmitting large files over the network. Using the filesystem seek capabilities hole can be detected. Instead of transmitting these zero bytes and wasting precious bandwidth only sections of the file containing data are sent.

Example usage

// create sparse image
truncate -s150G image.raw

// add some random data to the sparse file
dd if=/dev/urandom bs=4M count=10 conv=notrunc seek=30 of=image.raw

// send sparse file and reconstruct it on the other host. The amount
// of data transmitted will only be 40MB instead of 150G
sparsecat -if image.raw | pv | ssh GLaDOS sparsecat -r -of image.raw

But how does it work?

Sparsecat used the SEEK_HOLE and SEEK_DATA capabilities of lseek on linux. See the man pages for more information. Before sending the data inside a file Sparsecat creates a small header containing the size of the source file. The data sections follow this header. Each section consists of the offset in the target file, the length of the data section followed by the data itself. The wire format is identical to ceph rbd export-diff

When receiving a Sparsecat stream the Decoder detects if the target is an *os.File. When this is the case and the file is capable of seeking a fast path is used and the sparseness of the target file is preserved. When the target is not a file, such as an io.Copy to a buffer, Sparsecat will pad the output zero bytes. As if it is outputting the entire file.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
cmd/sparsecat		cmd/sparsecat
example		example
format		format
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
blockdevice.go		blockdevice.go
blockdevice_linux.go		blockdevice_linux.go
copy.go		copy.go
go.mod		go.mod
go.sum		go.sum
sparse.go		sparse.go
sparse_fallback.go		sparse_fallback.go
sparse_unix.go		sparse_unix.go
sparse_windows.go		sparse_windows.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SparseCat

Goal

Example usage

But how does it work?

About

Releases 4

Packages

Contributors 3

Languages

License

svenwiltink/sparsecat

Folders and files

Latest commit

History

Repository files navigation

SparseCat

Goal

Example usage

But how does it work?

About

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 3

Languages

Packages