-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for parquet_writer_version
session property
#10573
Conversation
✅ Deploy Preview for meta-velox canceled.
|
c422238
to
7c619ce
Compare
velox/dwio/parquet/writer/Writer.cpp
Outdated
auto parquetDataPageVersionString = | ||
getParquetDataPageVersion(config, kParquetSessionDataPageVersion); | ||
|
||
if (parquetDataPageVersionString == "PARQUET_1_0") { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe having a helper method with a switch to do the mapping, and throwing in case you get an invalid value instead of defaulting to v2?
parquetDataPageVersionString = toParquetDataPage(config);
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see what you mean, but also realized I could just be doing the mapping in getParquetDataPageVersion()
now, in combination with validation, and directly return a arrow::ParquetDataPageVersion
value. Let me know if that looks ok.
ec3c78b
to
3bec581
Compare
Reopened due to conflicts and rebasing - #11151 |
Allow the Presto session property
parquet_writer_version
, which is currently ignored by Velox, to toggle the parquet writer datapage version (V1 or V2). The value can be set as a session property or can be provided in the Hive config. Defaults to V2.