Skip to content

kedro-datasets-1.8.0

Compare
Choose a tag to compare
@merelcht merelcht released this 24 Oct 10:30
· 240 commits to main since this release
0c1f472

Major features and improvements

  • Moved PartitionedDataSet and IncrementalDataSet from the core Kedro repo to kedro-datasets and renamed to PartitionedDataset and IncrementalDataset.
  • Added polars.LazyPolarsDataset, a GenericDataSet using polars's Lazy API.
  • Renamed polars.GenericDataSet to polars.EagerPolarsDataset to better reflect the difference between the two dataset classes.
  • Added a deprecation warning when using polars.GenericDataSet or polars.GenericDataset that these have been renamed to polars.EagerPolarsDataset
  • Delayed backend connection for pandas.SQLTableDataset, pandas.SQLQueryDataset, and snowflake.SnowparkTableDataset. In practice, this means that a dataset's connection details aren't used (or validated) until the dataset is accessed. On the plus side, the cost of connection isn't incurred regardless of when or whether the dataset is used.

Bug fixes and other changes

  • Fix erroneous warning when using an cloud protocol file path with SparkDataSet on Databricks.
  • Updated PickleDataset to explicitly mention cloudpickle support.

Community contributions

Many thanks to the following Kedroids for contributing PRs to this release: