Enanchment in CF Generation #259

giandos200 · 2022-01-13T11:59:09Z

Hi, I created this pull request to give some hints on CF generation @gaugup.
Regarding Random, although there is very clear and fast, finding combinations between feature sampling and substitution is unclear. The Loop inside, instead of gradually replacing more features actually in your code, only replaces one feature as :
selected_features = np.random.choice(self.features_to_vary, (sample_size, 1), replace=True)
1 should be replaced by num_features_to_vary and then .loc instead of .at.

This method is slower but certainly more complete and still faster than Genetic/KDtree (I have deliberately left it commented out for you).
If you want to leave a single variation, I suggest changing .at to ._get_value in the replacement for faster access.
As far as genetic is concerned, in the case of datasets with many features, a random initialization is very slow and seems never to end. For this reason, I suggest increasing the population of the KDtree initialization (which is also lowering the initialization time a lot). In addition, I recommend switching to a binary search in the case of requests for a large number of CFs.

gaugup · 2022-01-13T16:43:45Z

Thanks @giandos200 for this PR. I executed all the gates. Could you please examine the failures and re-submit to make all the tests and linting pass? It looks like your PR has a lot of changes which are out of scope of the performance improvement. It will be great if you could clean all this and send out a commit focusing just on the perf improvements.

Regards,

giandos200 · 2022-01-13T17:32:23Z

It should be ok now, @gaugup . Maybe I have an older version because I have never changed the imports.

gaugup

Please take a look at the failing gates.

dice_ml/explainer_interfaces/dice_genetic.py

dice_ml/explainer_interfaces/dice_random.py

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com> Signed-off-by: giandos200 <giando95menico@gmail.com>

Signed-off-by: giandos200 <giando95menico@gmail.com>

…@gmail.com> Signed-off-by: giandos200 <giando95menico@gmail.com>

Signed-off-by: giandos200 <giando95menico@gmail.com>

giandos200 requested review from amit-sharma and gaugup as code owners January 13, 2022 11:59

giandos200 force-pushed the master branch from 86dc77f to 2a88433 Compare January 13, 2022 12:00

giandos200 force-pushed the master branch from b440eb1 to 4c355dc Compare January 13, 2022 17:29

giandos200 force-pushed the master branch 14 times, most recently from 19f2e47 to 785bc4a Compare January 14, 2022 13:29

gaugup requested changes Jan 14, 2022

View reviewed changes

dice_ml/explainer_interfaces/dice_genetic.py Outdated Show resolved Hide resolved

dice_ml/explainer_interfaces/dice_genetic.py Outdated Show resolved Hide resolved

giandos200 force-pushed the master branch from 447c339 to 00dda52 Compare January 14, 2022 16:20

giandos200 requested a review from gaugup January 14, 2022 16:22

giandos200 commented Jan 15, 2022

View reviewed changes

dice_ml/explainer_interfaces/dice_random.py Show resolved Hide resolved

giandos200 force-pushed the master branch from 92fda2c to 09b4550 Compare January 15, 2022 13:57

gaugup and others added 5 commits January 24, 2022 14:27

Add flake8-breakpoint to avoid code checkin with active breakpoints

0f8c954

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com> Signed-off-by: giandos200 <giando95menico@gmail.com>

Enanchment in CF Generation

7874455

Signed-off-by: giandos200 <giando95menico@gmail.com>

import update

7e9cf74

Signed-off-by: giandos200 <giando95menico@gmail.com>

import update

25d858c

Signed-off-by: giandos200 <giando95menico@gmail.com>

import update

4502327

Signed-off-by: giandos200 <giando95menico@gmail.com>

giandos200 and others added 8 commits January 24, 2022 14:27

notebook updated

b07e017

Signed-off-by: giandos200 <giando95menico@gmail.com>

all test passed and updated

dc2c6ae

Signed-off-by: giandos200 <giando95menico@gmail.com>

adding signoff Signed-off-by: Giandomenico Cornacchia <giando95menico…

7d84a63

…@gmail.com> Signed-off-by: giandos200 <giando95menico@gmail.com>

Update Benchmarking_different_CF_explanation_methods.ipynb

b046fe0

Signed-off-by: giandos200 <giando95menico@gmail.com>

benchUpdated

25f1263

Signed-off-by: giandos200 <giando95menico@gmail.com>

review

dd098bc

Signed-off-by: giandos200 <giando95menico@gmail.com>

flake8 E125/W292 bestpractice reviewed

ef3de56

Signed-off-by: giandos200 <giando95menico@gmail.com>

update

b12c369

Signed-off-by: giandos200 <giando95menico@gmail.com>

giandos200 force-pushed the master branch from 632865b to b12c369 Compare January 24, 2022 13:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enanchment in CF Generation #259

Enanchment in CF Generation #259

giandos200 commented Jan 13, 2022

gaugup commented Jan 13, 2022

giandos200 commented Jan 13, 2022

gaugup left a comment

Enanchment in CF Generation #259

Are you sure you want to change the base?

Enanchment in CF Generation #259

Conversation

giandos200 commented Jan 13, 2022

gaugup commented Jan 13, 2022

giandos200 commented Jan 13, 2022

gaugup left a comment

Choose a reason for hiding this comment