19 Aug 20:05

chukarsten

ed74174

v0.31.0

v0.31.0 Aug. 19, 2021

Enhancements

Updated the high variance check in AutoMLSearch to be robust to a variety of objectives and cv scores #2622
Use Woodwork's outlier detection for the OutliersDataCheck #2637
Added ability to utilize instantiated components when creating a pipeline #2643
Sped up the all Nan and unknown check in infer_feature_types #2661

Fixes

Changes

Deleted _put_into_original_order helper function #2639
Refactored time series pipeline code using a time series pipeline base class #2649
Renamed dask_tests to parallel_tests #2657
Removed commented out code in pipeline_meta.py #2659

Documentation Changes

Add complete install command to README and Install section #2627

Testing Changes

Breaking Changes

TimeSeriesRegressionPipeline no longer inherits from TimeSeriesRegressionPipeline #2649

Assets 2

16 Aug 20:15

chukarsten

v0.30.2

2b0f42e

v0.30.2

v0.30.2 Aug. 16, 2021

Fixes

Updated changelog and version numbers to match the release. Release 0.30.1 was released erroneously without a change to the version numbers. 0.30.2 replaces it.

Assets 2

13 Aug 21:13

chukarsten

v0.30.1

948e3b4

v0.30.1

v0.30.1 Aug. 13, 2021

Enhancements

Added DatetimeFormatDataCheck for time series problems #2603
Added ProphetRegressor to estimators #2242
Updated ComponentGraph to handle not calling samplers' transform during predict, and updated samplers' transform methods s.t. fit_transform is equivalent to fit(X, y).transform(X, y) #2583
Updated ComponentGraph _validate_component_dict logic to be stricter about input values #2599
Patched bug in xgboost estimators where predicting on a feature matrix of only booleans would throw an exception. #2602
Updated ARIMARegressor to use relative forecasting to predict values #2613
Updated to support Woodwork 0.5.1 #2610

Fixes

Updated get_best_sampler_for_data to consider all non-numeric datatypes as categorical for SMOTE #2590
Fixed inconsistent test results from TargetDistributionDataCheck #2608
Adopted vectorized pd.NA checking for Woodwork 0.5.1 support #2626

Changes

Renamed SMOTE samplers to SMOTE oversampler #2595
Changed partial_dependence and graph_partial_dependence to raise a PartialDependenceError instead of ValueError. This is not a breaking change because PartialDependenceError is a subclass of ValueError #2604
Cleaned up code duplication in ComponentGraph #2612

Documentation Changes

To avoid local docs build error, only add warning disable and download headers on ReadTheDocs builds, not locally #2617

Testing Changes

Changed the lint CI job to only check against python 3.9 via the -t flag #2586
Installed Prophet in linux nightlies test and fixed test_all_components #2598
Refactored and fixed all make_pipeline tests to assert correct order and address new Woodwork Unknown type inference #2572
Removed component_graphs as a global variable in test_component_graphs.py #2609

Breaking Changes

Renamed SMOTE samplers to SMOTE oversampler. Please use SMOTEOversampler, SMOTENCOversampler, SMOTENOversampler instead of SMOTESampler, SMOTENCSampler, and SMOTENSampler #2595

Assets 2

04 Aug 22:12

chukarsten

v0.30.0

455210b

v0.30.0

v0.30.0 Aug. 4, 2021

Enhancements

Added LogTransformer and TargetDistributionDataCheck #2487
Issue a warning to users when a pipeline parameter passed in isn't used in the pipeline #2564
Added Gini coefficient as an objective #2544
Added repr to ComponentGraph #2565
Added components to extract features from URL and EmailAddress Logical Types #2550
Added support for NaN values in TextFeaturizer #2532
Added SelectByType transformer #2531
Added separate thresholds for percent null rows and columns in HighlyNullDataCheck #2562
Added support for NaN natural language values #2577

Fixes

Raised error message for types URL, NaturalLanguage, and EmailAddress in partial_dependence #2573

Changes

Updated PipelineBase implementation for creating pipelines from a list of components #2549
Moved get_hyperparameter_ranges to PipelineBase class from automl/utils module #2546
Renamed ComponentGraph's get_parents to get_inputs #2540
Removed ComponentGraph.linearized_component_graph and ComponentGraph.from_list #2556
Updated ComponentGraph to enforce requiring .x and .y inputs for each component in the graph #2563

Documentation Changes

Added documentation for DaskEngine and CFEngine parallel engines #2560
Improved detail of TextFeaturizer docstring and tutorial #2568

Testing Changes

Added test that makes sure split_data does not shuffle for time series problems #2552

Breaking Changes

Moved get_hyperparameter_ranges to PipelineBase class from automl/utils module #2546
Renamed ComponentGraph's get_parents to get_inputs #2540
Removed ComponentGraph.linearized_component_graph and ComponentGraph.from_list #2556
Updated ComponentGraph to enforce requiring .x and .y inputs for each component in the graph #2563

Assets 2

22 Jul 13:56

chukarsten

v0.29.0

5f22f03

v0.29.0

v0.29.0 Jul. 22, 2021

Enhancements

Updated 1-way partial dependence support for datetime features #2454
Added details on how to fix error caused by broken ww schema #2466
Added ability to use built-in pickle for saving AutoMLSearch #2463
Updated our components and component graphs to use latest features of ww 0.4.1, e.g. concat_columns and drop in-place. #2465
Added new, concurrent.futures based engine for parallel AutoML #2506
Added support for new Woodwork Unknown type in AutoMLSearch #2477
Updated our components with an attribute that describes if they modify features or targets and can be used in list API for pipeline initialization #2504
Updated ComponentGraph to accept X and y as inputs #2507
Removed unused TARGET_BINARY_INVALID_VALUES from DataCheckMessageCode enum and fixed formatting of objective documentation #2520

Fixes

Fixed FraudCost objective and reverted threshold optimization method for binary classification to Golden #2450
Added custom exception message for partial dependence on features with scales that are too small #2455
Ensures the typing for Ordinal and Datetime ltypes are passed through _retain_custom_types_and_initalize_woodwork #2461
Updated to work with Pandas 1.3.0 #2442
Updated to work with sktime 0.7.0 #2499

Changes

Updated XGBoost dependency to >=1.4.2 #2484, #2498
Added a DeprecationWarning about deprecating the list API for ComponentGraph #2488
Updated make_pipeline for AutoML to create dictionaries, not lists, to initialize pipelines #2504
No longer installing graphviz on windows in our CI pipelines because release 0.17 breaks windows 3.7 #2516

Documentation Changes

Moved docstrings from __init__ to class pages, added missing docstrings for missing classes, and updated missing default values #2452
Build documentation with sphinx-autoapi #2458
Change autoapi_ignore to only ignore files in evalml/tests/* #2530

Testing Changes

Fixed flaky dask tests #2471
Removed shellcheck action from build_conda_pkg action #2514
Added a tmp_dir fixture that deletes its contents after tests run #2505
Added a test that makes sure all pipelines in AutoMLSearch get the same data splits #2513
Condensed warning output in test logs #2521

Breaking Changes

NaN values in the Natural Language type are no longer supported by the Imputer with the pandas upgrade. #2477

Assets 2

02 Jul 22:14

dsherry

v0.28.0

a3aa403

v0.28.0

v0.28.0 Jul. 2, 2021

Enhancements

Added support for showing a Individual Conditional Expectations plot when graphing Partial Dependence #2386
Exposed thread_count for Catboost estimators as n_jobs parameter #2410
Updated Objectives API to allow for sample weighting #2433

Fixes

Deleted unreachable line from IterativeAlgorithm #2464

Changes

Pinned Woodwork version between 0.4.1 and 0.4.2 #2460
Updated psutils minimum version in requirements #2438
Updated log_error_callback to not include filepath in logged message #2429

Documentation Changes

Sped up docs #2430
Removed mentions of DataTable and DataColumn from the docs #2445

Testing Changes

Added slack integration for nightlies tests #2436
Changed build_conda_pkg CI job to run only when dependencies are updates #2446
Updated workflows to store pytest runtimes as test artifacts #2448
Added AutoMLTestEnv test fixture for making it easy to mock automl tests #2406

Assets 2

23 Jun 14:35

chukarsten

v0.27.0

fbbc8b3

v0.27.0

v0.27.0 Jun. 23, 2021

Enhancements

Adds force plots for prediction explanations #2157
Removed self-reference from AutoMLSearch #2304
Added support for nonlinear pipelines for generate_pipeline_code #2332
Added inverse_transform method to pipelines #2256
Add optional automatic update checker #2350
Added search_order to AutoMLSearch's rankings and full_rankings tables #2345
Updated threshold optimization method for binary classification #2315
Updated demos to pull data from S3 instead of including demo data in package #2387
Upgrade woodwork version to v0.4.1 #2379

Fixes

Preserve user-specified woodwork types throughout pipeline fit/predict #2297
Fixed ComponentGraph appending target to final_component_features if there is a component that returns both X and y #2358
Fixed partial dependence graph method failing on multiclass problems when the class labels are numeric #2372
Added thresholding_objective argument to AutoMLSearch for binary classification problems #2320
Added change for k_neighbors parameter in SMOTE Oversamplers to automatically handle small samples #2375
Changed naming for Logistic Regression Classifier file #2399
Pinned pytest-timeout to fix minimum dependence checker #2425
Replaced Elastic Net Classifier base class with Logistsic Regression to avoid NaN outputs #2420

Changes

Cleaned up PipelineBase's component_graph and _component_graph attributes. Updated PipelineBase __repr__ and added __eq__ for ComponentGraph #2332
Added and applied black linting package to the EvalML repo in place of autopep8 #2306
Separated custom_hyperparameters from pipelines and added them as an argument to AutoMLSearch #2317
Replaced allowed_pipelines with allowed_component_graphs #2364
Removed private method _compute_features_during_fit from PipelineBase #2359
Updated compute_order in ComponentGraph to be a read-only property #2408
Unpinned PyZMQ version in requirements.txt #2389
Uncapping LightGBM version in requirements.txt #2405
Updated minimum version of plotly #2415
Removed SensitivityLowAlert objective from core objectives #2418

Documentation Changes

Fixed lead scoring weights in the demos documentation #2315
Fixed start page code and description dataset naming discrepancy #2370

Testing Changes

Update minimum unit tests to run on all pull requests #2314
Pass token to authorize uploading of codecov reports #2344
Add pytest-timeout. All tests that run longer than 6 minutes will fail. #2374
Separated the dask tests out into separate github action jobs to isolate dask failures. #2376
Refactored dask tests #2377
Added the combined dask/non-dask unit tests back and renamed the dask only unit tests. #2382
Sped up unit tests and split into separate jobs #2365
Change CI job names, run lint for python 3.9, run nightlies on python 3.8 at 3am EST #2395 #2398
Set fail-fast to false for CI jobs that run for PRs #2402

Breaking Changes

AutoMLSearch will accept allowed_component_graphs instead of allowed_pipelines #2364
Removed PipelineBase's _component_graph attribute. Updated PipelineBase __repr__ and added __eq__ for ComponentGraph #2332
pipeline_parameters will no longer accept skopt.space variables since hyperparameter ranges will now be specified through custom_hyperparameters #2317

Assets 2

02 Jun 15:09

chukarsten

v0.25.0

961584a

v0.25.0

v0.25.0 Jun. 2, 2021

Enhancements

Upgraded minimum woodwork to version 0.3.1. Previous versions will not be supported #2181
Added a new callback parameter for explain_predictions_best_worst #2308

Fixes

Changes

Deleted the return_pandas flag from our demo data loaders #2181

Documentation Changes

Testing Changes

Ignoring test_saving_png_file while building conda package #2323

Breaking Changes

Deleted the return_pandas flag from our demo data loaders #2181
Upgraded minimum woodwork to version 0.3.1. Previous versions will not be supported #2181
Due to the weak-ref in woodwork, set the result of infer_feature_types to a variable before accessing woodwork #2181

Assets 2

25 May 01:00

chukarsten

v0.24.2

bcfd02f

v0.24.2

v0.24.2 May. 24, 2021

Enhancements

Added oversamplers to AutoMLSearch #2213 #2286
Added dictionary input functionality for Undersampler component #2271
Changed the default parameter values for Elastic Net Classifier and Elastic Net Regressor #2269

Fixes

Set default n_jobs to 1 for StackedEnsembleClassifier and StackedEnsembleRegressor until fix for text-based parallelism in sklearn stacking can be found #2295

Changes

Updated start_iteration_callback to accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290

Documentation Changes

Testing Changes

Use codecov action to update coverage reports #2238
Removed MarkupSafe dependency version pin from requirements.txt and moved instead into RTD docs build CI #2261

Breaking Changes

Updated start_iteration_callback to accept a pipeline instance instead of a pipeline class and no longer accept pipeline parameters as a parameter #2290

Assets 2

17 May 15:58

chukarsten

v0.24.1

93e043c

v0.24.1

v0.24.1 May. 17, 2021

Enhancements

Integrated ARIMARegressor into AutoML #2009
Updated HighlyNullDataCheck to also perform a null row check #2222
Set max_depth to 1 in calls to featuretools dfs #2231

Fixes

Removed data splitter sampler calls during training #2253
Set minimum required version for for pyzmq, colorama, and docutils #2254
Changed BaseSampler to return None instead of y #2272

Changes

Updated pipeline repr() and generate_pipeline_code to return pipeline instances without generating custom pipeline class #2227

Documentation Changes

Capped Sphinx version under 4.0.0 #2244

Testing Changes

Change number of cores for pytest from 4 to 2 #2266
Add minimum dependency checker to generate minimum requirement files #2267
(evalml_dev) RM-MB-151:evalml karsten.chu$ tools/format_release_notes.sh

v0.24.1 May. 17, 2021

Enhancements

Integrated ARIMARegressor into AutoML #2009
Updated HighlyNullDataCheck to also perform a null row check #2222
Set max_depth to 1 in calls to featuretools dfs #2231

Fixes

Removed data splitter sampler calls during training #2253
Set minimum required version for for pyzmq, colorama, and docutils #2254
Changed BaseSampler to return None instead of y #2272

Changes

Updated pipeline repr() and generate_pipeline_code to return pipeline instances without generating custom pipeline class #2227

Documentation Changes

Capped Sphinx version under 4.0.0 #2244

Testing Changes

Change number of cores for pytest from 4 to 2 #2266
Add minimum dependency checker to generate minimum requirement files #2267

Assets 2

Releases: alteryx/evalml

v0.31.0

v0.31.0 Aug. 19, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.30.2

v0.30.2 Aug. 16, 2021

Fixes

v0.30.1

v0.30.1 Aug. 13, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.30.0

v0.30.0 Aug. 4, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.29.0

v0.29.0 Jul. 22, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.28.0

v0.28.0 Jul. 2, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

v0.27.0

v0.27.0 Jun. 23, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.25.0

v0.25.0 Jun. 2, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.24.2

v0.24.2 May. 24, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

Breaking Changes

v0.24.1

v0.24.1 May. 17, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes

v0.24.1 May. 17, 2021

Enhancements

Fixes

Changes

Documentation Changes

Testing Changes