Mismatch in Feature Names Between Classifier's Training and Prediction Phases #17

rakshit-upadhyay214 · 2024-10-03T06:16:24Z

While using the model for classification, a ValueError is raised due to a mismatch between the feature names used during the training phase (fit) and those passed during the inference phase (predict). The features passed at prediction time have inconsistent names compared to those seen during training.

Steps to Reproduce:

Load the trained RF_mining_model.pkl .
Attempt to make predictions using test data obtained from the original dataset's train-test split.
Observe the ValueError due to mismatched feature names.

Expected Behavior: The feature names should match and remain consistent throughout.

Proposed Solution: Training classifier model against the exact feature names as they appear in the dataset.

github-actions · 2024-10-03T06:16:48Z

👋 Thank you for raising an issue! We appreciate your effort in helping us improve. Our team will review it shortly. Stay tuned!

Devanik21 · 2024-10-03T13:44:33Z

Can you specify which model is it ?
GBoost or RForest

rakshit-upadhyay214 · 2024-10-03T16:26:12Z

It was Random Forest.

Devanik21 self-assigned this Oct 3, 2024

Devanik21 added the bug Something isn't working label Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mismatch in Feature Names Between Classifier's Training and Prediction Phases #17

Mismatch in Feature Names Between Classifier's Training and Prediction Phases #17

rakshit-upadhyay214 commented Oct 3, 2024

github-actions bot commented Oct 3, 2024

Devanik21 commented Oct 3, 2024

rakshit-upadhyay214 commented Oct 3, 2024

Mismatch in Feature Names Between Classifier's Training and Prediction Phases #17

Mismatch in Feature Names Between Classifier's Training and Prediction Phases #17

Comments

rakshit-upadhyay214 commented Oct 3, 2024

github-actions bot commented Oct 3, 2024

Devanik21 commented Oct 3, 2024

rakshit-upadhyay214 commented Oct 3, 2024