Replies: 1 comment 2 replies
-
The readers automatically perform IO pushdown, they will only fetch the byte ranges needed, this includes column projection, and extends through to row group and page pruning, late materialization, etc... The readers aim to be batteries included, you shouldn't need to worry about pages, column chunks, etc... it will just do the right thing |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Can read data from the provided column indexes.But the entire parquet data needs to be provided.
Can we directly read individual column? Only transmit relevant data for the column?
FYI, in arrow2 can only read column data.
PageReader can consume all pages.
Beta Was this translation helpful? Give feedback.
All reactions