-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add stac-geoparquet and stac-arrow #287
Conversation
aab7827
to
49bf5ba
Compare
49bf5ba
to
f1219de
Compare
@kylebarron redirecting your attention here (if you want). Pending a couple of fixups, I've got a working to/from stac-geoparquet converter: cargo install stac-cli
stac translate in.json out.parquet
stac translate out.parquet back.json And in Rust: let file = std::fs::File::open("data.parquet").unwrap();
let item_collection = stac_geoparquet::from_reader(file).unwrap();
let file = std::fs::File::create("out.parquet").unwrap();
stac_geoparquet::to_writer(file, item_collection.into()).unwrap(); Rough benchmarking (same procedure as in #256 (comment)) indicates that we're ~50% faster than the simplest Python converter: $ hyperfine --warmup 3 'target/release/stac translate ~/Desktop/longmont.json ~/Desktop/longmont-rs.parquet' 'python benchmark.py'
Benchmark 1: target/release/stac translate ~/Desktop/longmont.json ~/Desktop/longmont-rs.parquet
Time (mean ± σ): 1.114 s ± 0.020 s [User: 0.918 s, System: 0.177 s]
Range (min … max): 1.095 s … 1.157 s 10 runs
Benchmark 2: python benchmark.py
Time (mean ± σ): 1.736 s ± 0.086 s [User: 1.475 s, System: 0.212 s]
Range (min … max): 1.657 s … 1.925 s 10 runs
Summary
target/release/stac translate ~/Desktop/longmont.json ~/Desktop/longmont-rs.parquet ran
1.56 ± 0.08 times faster than python benchmark.py Caveats:
|
f1219de
to
468104e
Compare
Did you hit specific issues with latest main of geoarrow-rs, or just that it requires some changes to your code? |
468104e
to
262b2e1
Compare
Yeah, I hadn't opened an issue yet, but it looks like the |
Ah yes good catch. If you can create an issue to track that would be helpful |
Done, geoarrow/geoarrow-rs#682 |
Co-authored-by: Kyle Barron <kylebarron2@gmail.com>
d785499
to
9fe4a85
Compare
Closes
Description
A re-start of the arrow/parquet work. geoarrow-rs seems to be in a bit of flux (especially w.r.t. 2 vs. 3 dimensions) so we're on a sorta-random SHA right now there.
Checklist
cargo fmt
)cargo test