You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When utilizing Velox to read data from Spark, we've observed that certain data types are not represented identically between Spark and Parquet files. This discrepancy results in a runtime error when the data returned by the Parquet reader differs from what Spark anticipates. We've identified the following types as problematic:
u8 -> i16
u16 -> i32
u32 -> i64
u64 -> decimal(20, 0)
DateType ignores rebaseMode conf
TimeStampType ignores rebaseMode conf
For instance, while reading columns through Velox, Gluten creates a Velox scan node based on the format expected by Spark. However, due to the incompatible data representation, an error arises as exemplified by the following log:
The conversations done by Spark here sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/ParquetVectorUpdaterFactory.java needs to be honored.
…ckend (oap-project#355)
* [CH-70][CH-82][CH-69] Upgrade spark 3.2.2 and delta for ClickHouse Backend
1. Upgrade Spark 3.2.2
2. Upgrade Delta
3. Support AQE for ClickHouse Backend
* Add the parameters for clickhouse backend ut
Bug description
When utilizing Velox to read data from Spark, we've observed that certain data types are not represented identically between Spark and Parquet files. This discrepancy results in a runtime error when the data returned by the Parquet reader differs from what Spark anticipates. We've identified the following types as problematic:
For instance, while reading columns through Velox, Gluten creates a Velox scan node based on the format expected by Spark. However, due to the incompatible data representation, an error arises as exemplified by the following log:
This error states that a BIGINT type was returned for the field n0_0 at position 0, while a DECIMAL(20,0) was expected.
System information
Velox System Info v0.0.2
Commit: 3fa47086d3f81927a21d4da2cabff40bfd73331c
CMake Version: 3.25.2
System: Linux-5.4.0-1109-azure
Arch: x86_64
C++ Compiler: /usr/bin/c++
C++ Compiler Version: 9.4.0
C Compiler: /usr/bin/cc
C Compiler Version: 9.4.0
CMake Prefix Path: /usr/local;/usr;/;/usr;/usr/local;/us
Relevant logs
No response
The text was updated successfully, but these errors were encountered: