Releases: alteryx/evalml
Releases · alteryx/evalml
v0.31.0
v0.31.0 Aug. 19, 2021
Enhancements
- Updated the high variance check in AutoMLSearch to be robust to a variety of objectives and cv scores #2622
- Use Woodwork's outlier detection for the
OutliersDataCheck
#2637 - Added ability to utilize instantiated components when creating a pipeline #2643
- Sped up the all Nan and unknown check in
infer_feature_types
#2661
Fixes
Changes
- Deleted
_put_into_original_order
helper function #2639 - Refactored time series pipeline code using a time series pipeline base class #2649
- Renamed
dask_tests
toparallel_tests
#2657 - Removed commented out code in
pipeline_meta.py
#2659
Documentation Changes
- Add complete install command to README and Install section #2627
Testing Changes
Breaking Changes
TimeSeriesRegressionPipeline
no longer inherits fromTimeSeriesRegressionPipeline
#2649
v0.30.2
v0.30.2 Aug. 16, 2021
Fixes
- Updated changelog and version numbers to match the release. Release 0.30.1 was released erroneously without a change to the version numbers. 0.30.2 replaces it.
v0.30.1
v0.30.1 Aug. 13, 2021
Enhancements
- Added
DatetimeFormatDataCheck
for time series problems #2603 - Added
ProphetRegressor
to estimators #2242 - Updated
ComponentGraph
to handle not calling samplers' transform during predict, and updated samplers' transform methods s.t.fit_transform
is equivalent tofit(X, y).transform(X, y)
#2583 - Updated
ComponentGraph
_validate_component_dict
logic to be stricter about input values #2599 - Patched bug in
xgboost
estimators where predicting on a feature matrix of only booleans would throw an exception. #2602 - Updated
ARIMARegressor
to use relative forecasting to predict values #2613 - Updated to support Woodwork 0.5.1 #2610
Fixes
- Updated
get_best_sampler_for_data
to consider all non-numeric datatypes as categorical for SMOTE #2590 - Fixed inconsistent test results from
TargetDistributionDataCheck
#2608 - Adopted vectorized pd.NA checking for Woodwork 0.5.1 support #2626
Changes
- Renamed SMOTE samplers to SMOTE oversampler #2595
- Changed
partial_dependence
andgraph_partial_dependence
to raise aPartialDependenceError
instead ofValueError
. This is not a breaking change becausePartialDependenceError
is a subclass ofValueError
#2604 - Cleaned up code duplication in
ComponentGraph
#2612
Documentation Changes
- To avoid local docs build error, only add warning disable and download headers on ReadTheDocs builds, not locally #2617
Testing Changes
- Changed the lint CI job to only check against python 3.9 via the
-t
flag #2586 - Installed Prophet in linux nightlies test and fixed
test_all_components
#2598 - Refactored and fixed all
make_pipeline
tests to assert correct order and address new Woodwork Unknown type inference #2572 - Removed
component_graphs
as a global variable intest_component_graphs.py
#2609
Breaking Changes
- Renamed SMOTE samplers to SMOTE oversampler. Please use
SMOTEOversampler
,SMOTENCOversampler
,SMOTENOversampler
instead ofSMOTESampler
,SMOTENCSampler
, andSMOTENSampler
#2595
v0.30.0
v0.30.0 Aug. 4, 2021
Enhancements
- Added
LogTransformer
andTargetDistributionDataCheck
#2487 - Issue a warning to users when a pipeline parameter passed in isn't used in the pipeline #2564
- Added Gini coefficient as an objective #2544
- Added
repr
toComponentGraph
#2565 - Added components to extract features from
URL
andEmailAddress
Logical Types #2550 - Added support for
NaN
values inTextFeaturizer
#2532 - Added
SelectByType
transformer #2531 - Added separate thresholds for percent null rows and columns in
HighlyNullDataCheck
#2562 - Added support for
NaN
natural language values #2577
Fixes
- Raised error message for types
URL
,NaturalLanguage
, andEmailAddress
inpartial_dependence
#2573
Changes
- Updated
PipelineBase
implementation for creating pipelines from a list of components #2549 - Moved
get_hyperparameter_ranges
toPipelineBase
class from automl/utils module #2546 - Renamed
ComponentGraph
'sget_parents
toget_inputs
#2540 - Removed
ComponentGraph.linearized_component_graph
andComponentGraph.from_list
#2556 - Updated
ComponentGraph
to enforce requiring.x
and.y
inputs for each component in the graph #2563
Documentation Changes
- Added documentation for
DaskEngine
andCFEngine
parallel engines #2560 - Improved detail of
TextFeaturizer
docstring and tutorial #2568
Testing Changes
- Added test that makes sure
split_data
does not shuffle for time series problems #2552
Breaking Changes
- Moved
get_hyperparameter_ranges
toPipelineBase
class from automl/utils module #2546 - Renamed
ComponentGraph
'sget_parents
toget_inputs
#2540 - Removed
ComponentGraph.linearized_component_graph
andComponentGraph.from_list
#2556 - Updated
ComponentGraph
to enforce requiring.x
and.y
inputs for each component in the graph #2563
v0.29.0
v0.29.0 Jul. 22, 2021
Enhancements
- Updated 1-way partial dependence support for datetime features #2454
- Added details on how to fix error caused by broken ww schema #2466
- Added ability to use built-in pickle for saving AutoMLSearch #2463
- Updated our components and component graphs to use latest features of ww 0.4.1, e.g.
concat_columns
and drop in-place. #2465 - Added new, concurrent.futures based engine for parallel AutoML #2506
- Added support for new Woodwork
Unknown
type in AutoMLSearch #2477 - Updated our components with an attribute that describes if they modify features or targets and can be used in list API for pipeline initialization #2504
- Updated
ComponentGraph
to accept X and y as inputs #2507 - Removed unused
TARGET_BINARY_INVALID_VALUES
fromDataCheckMessageCode
enum and fixed formatting of objective documentation #2520
Fixes
- Fixed
FraudCost
objective and reverted threshold optimization method for binary classification toGolden
#2450 - Added custom exception message for partial dependence on features with scales that are too small #2455
- Ensures the typing for Ordinal and Datetime ltypes are passed through _retain_custom_types_and_initalize_woodwork #2461
- Updated to work with Pandas 1.3.0 #2442
- Updated to work with sktime 0.7.0 #2499
Changes
- Updated XGBoost dependency to
>=1.4.2
#2484, #2498 - Added a
DeprecationWarning
about deprecating the list API forComponentGraph
#2488 - Updated
make_pipeline
for AutoML to create dictionaries, not lists, to initialize pipelines #2504 - No longer installing graphviz on windows in our CI pipelines because release 0.17 breaks windows 3.7 #2516
Documentation Changes
- Moved docstrings from
__init__
to class pages, added missing docstrings for missing classes, and updated missing default values #2452 - Build documentation with sphinx-autoapi #2458
- Change
autoapi_ignore
to only ignore files inevalml/tests/*
#2530
Testing Changes
- Fixed flaky dask tests #2471
- Removed shellcheck action from
build_conda_pkg
action #2514 - Added a tmp_dir fixture that deletes its contents after tests run #2505
- Added a test that makes sure all pipelines in
AutoMLSearch
get the same data splits #2513 - Condensed warning output in test logs #2521
Breaking Changes
NaN
values in theNatural Language
type are no longer supported by the Imputer with the pandas upgrade. #2477
v0.28.0
v0.28.0 Jul. 2, 2021
Enhancements
- Added support for showing a Individual Conditional Expectations plot when graphing Partial Dependence #2386
- Exposed
thread_count
for Catboost estimators asn_jobs
parameter #2410 - Updated Objectives API to allow for sample weighting #2433
Fixes
- Deleted unreachable line from
IterativeAlgorithm
#2464
Changes
- Pinned Woodwork version between 0.4.1 and 0.4.2 #2460
- Updated psutils minimum version in requirements #2438
- Updated
log_error_callback
to not include filepath in logged message #2429
Documentation Changes
Testing Changes
v0.27.0
v0.27.0 Jun. 23, 2021
Enhancements
- Adds force plots for prediction explanations #2157
- Removed self-reference from
AutoMLSearch
#2304 - Added support for nonlinear pipelines for
generate_pipeline_code
#2332 - Added
inverse_transform
method to pipelines #2256 - Add optional automatic update checker #2350
- Added
search_order
toAutoMLSearch
'srankings
andfull_rankings
tables #2345 - Updated threshold optimization method for binary classification #2315
- Updated demos to pull data from S3 instead of including demo data in package #2387
- Upgrade woodwork version to v0.4.1 #2379
Fixes
- Preserve user-specified woodwork types throughout pipeline fit/predict #2297
- Fixed
ComponentGraph
appending target tofinal_component_features
if there is a component that returns both X and y #2358 - Fixed partial dependence graph method failing on multiclass problems when the class labels are numeric #2372
- Added
thresholding_objective
argument toAutoMLSearch
for binary classification problems #2320 - Added change for
k_neighbors
parameter in SMOTE Oversamplers to automatically handle small samples #2375 - Changed naming for
Logistic Regression Classifier
file #2399 - Pinned pytest-timeout to fix minimum dependence checker #2425
- Replaced
Elastic Net Classifier
base class withLogistsic Regression
to avoidNaN
outputs #2420
Changes
- Cleaned up
PipelineBase
'scomponent_graph
and_component_graph
attributes. UpdatedPipelineBase
__repr__
and added__eq__
forComponentGraph
#2332 - Added and applied
black
linting package to the EvalML repo in place ofautopep8
#2306 - Separated
custom_hyperparameters
from pipelines and added them as an argument toAutoMLSearch
#2317 - Replaced
allowed_pipelines
withallowed_component_graphs
#2364 - Removed private method
_compute_features_during_fit
fromPipelineBase
#2359 - Updated
compute_order
inComponentGraph
to be a read-only property #2408 - Unpinned PyZMQ version in requirements.txt #2389
- Uncapping LightGBM version in requirements.txt #2405
- Updated minimum version of plotly #2415
- Removed
SensitivityLowAlert
objective from core objectives #2418
Documentation Changes
- Fixed lead scoring weights in the demos documentation #2315
- Fixed start page code and description dataset naming discrepancy #2370
Testing Changes
- Update minimum unit tests to run on all pull requests #2314
- Pass token to authorize uploading of codecov reports #2344
- Add
pytest-timeout
. All tests that run longer than 6 minutes will fail. #2374 - Separated the dask tests out into separate github action jobs to isolate dask failures. #2376
- Refactored dask tests #2377
- Added the combined dask/non-dask unit tests back and renamed the dask only unit tests. #2382
- Sped up unit tests and split into separate jobs #2365
- Change CI job names, run lint for python 3.9, run nightlies on python 3.8 at 3am EST #2395 #2398
- Set fail-fast to false for CI jobs that run for PRs #2402
Breaking Changes
AutoMLSearch
will acceptallowed_component_graphs
instead ofallowed_pipelines
#2364- Removed
PipelineBase
's_component_graph
attribute. UpdatedPipelineBase
__repr__
and added__eq__
forComponentGraph
#2332 pipeline_parameters
will no longer acceptskopt.space
variables since hyperparameter ranges will now be specified throughcustom_hyperparameters
#2317
v0.25.0
v0.25.0 Jun. 2, 2021
Enhancements
- Upgraded minimum woodwork to version 0.3.1. Previous versions will not be supported #2181
- Added a new callback parameter for
explain_predictions_best_worst
#2308
Fixes
Changes
- Deleted the
return_pandas
flag from our demo data loaders #2181
Documentation Changes
Testing Changes
- Ignoring
test_saving_png_file
while building conda package #2323
Breaking Changes
v0.24.2
v0.24.2 May. 24, 2021
Enhancements
- Added oversamplers to AutoMLSearch #2213 #2286
- Added dictionary input functionality for
Undersampler
component #2271 - Changed the default parameter values for
Elastic Net Classifier
andElastic Net Regressor
#2269
Fixes
- Set default
n_jobs
to 1 forStackedEnsembleClassifier
andStackedEnsembleRegressor
until fix for text-based parallelism in sklearn stacking can be found #2295
Changes
- Updated
start_iteration_callback
to accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290
Documentation Changes
Testing Changes
- Use codecov action to update coverage reports #2238
- Removed MarkupSafe dependency version pin from requirements.txt and moved instead into RTD docs build CI #2261
Breaking Changes
- Updated
start_iteration_callback
to accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290
v0.24.1
v0.24.1 May. 17, 2021
Enhancements
- Integrated
ARIMARegressor
into AutoML #2009 - Updated
HighlyNullDataCheck
to also perform a null row check #2222 - Set
max_depth
to 1 in calls to featuretools dfs #2231
Fixes
- Removed data splitter sampler calls during training #2253
- Set minimum required version for for pyzmq, colorama, and docutils #2254
- Changed BaseSampler to return None instead of y #2272
Changes
- Updated pipeline
repr()
andgenerate_pipeline_code
to return pipeline instances without generating custom pipeline class #2227
Documentation Changes
- Capped Sphinx version under 4.0.0 #2244
Testing Changes
- Change number of cores for pytest from 4 to 2 #2266
- Add minimum dependency checker to generate minimum requirement files #2267
(evalml_dev) RM-MB-151:evalml karsten.chu$ tools/format_release_notes.sh
v0.24.1 May. 17, 2021
Enhancements
- Integrated
ARIMARegressor
into AutoML #2009 - Updated
HighlyNullDataCheck
to also perform a null row check #2222 - Set
max_depth
to 1 in calls to featuretools dfs #2231
Fixes
- Removed data splitter sampler calls during training #2253
- Set minimum required version for for pyzmq, colorama, and docutils #2254
- Changed BaseSampler to return None instead of y #2272
Changes
- Updated pipeline
repr()
andgenerate_pipeline_code
to return pipeline instances without generating custom pipeline class #2227
Documentation Changes
- Capped Sphinx version under 4.0.0 #2244