kedro-datasets-1.8.0
merelcht
released this
24 Oct 10:30
·
240 commits
to main
since this release
Major features and improvements
- Moved
PartitionedDataSet
andIncrementalDataSet
from the core Kedro repo tokedro-datasets
and renamed toPartitionedDataset
andIncrementalDataset
. - Added
polars.LazyPolarsDataset
, aGenericDataSet
using polars's Lazy API. - Renamed
polars.GenericDataSet
topolars.EagerPolarsDataset
to better reflect the difference between the two dataset classes. - Added a deprecation warning when using
polars.GenericDataSet
orpolars.GenericDataset
that these have been renamed topolars.EagerPolarsDataset
- Delayed backend connection for
pandas.SQLTableDataset
,pandas.SQLQueryDataset
, andsnowflake.SnowparkTableDataset
. In practice, this means that a dataset's connection details aren't used (or validated) until the dataset is accessed. On the plus side, the cost of connection isn't incurred regardless of when or whether the dataset is used.
Bug fixes and other changes
- Fix erroneous warning when using an cloud protocol file path with SparkDataSet on Databricks.
- Updated
PickleDataset
to explicitly mentioncloudpickle
support.
Community contributions
Many thanks to the following Kedroids for contributing PRs to this release: