Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

issues in ISRO SPACE mining .py file #12

Open
Sakeebhasan123456 opened this issue Oct 2, 2024 · 4 comments
Open

issues in ISRO SPACE mining .py file #12

Sakeebhasan123456 opened this issue Oct 2, 2024 · 4 comments

Comments

@Sakeebhasan123456
Copy link
Contributor

Sakeebhasan123456 commented Oct 2, 2024

@Devanik21 i analyze the mining python file and see lots of abnormalities ,i would to work on it

there is list of problem s that i can solve

Data Leakage:** Pre-trained models may have been trained using the full dataset, leading to inflated performance.

Overfitting:** Models show unrealistically high accuracy, suggesting they are overfitting to the training data.

Imbalanced Dataset:** The target variable might be imbalanced, which could bias the model’s predictions.

I### nadequate Evaluation:** Overreliance on accuracy without considering metrics like ROC AUC or F1-score.

No Cross-Validation:** Cross-validation is missing, which could lead to overestimating model performance.

Missing Feature Scaling:** Numerical features are not scaled, which may affect the performance of some models.

Categorical Encoding:** The categorical feature 'Celestial Body' is not properly encoded.

Unused Imports:** Libraries like TensorFlow and Keras are imported but not used.

Inconsistent Random Seeds:** Random seeds are not consistently set, making results non-reproducible.

No Hyperparameter Tuning:** Models are used with default parameters without sufficient tuning.

@Devanik21 Devanik21 changed the title issus in ISRO SPACE minning .py file issues in ISRO SPACE mining .py file Oct 2, 2024
@Devanik21
Copy link
Owner

@Devanik21 i analyze the mining python file and see lots of abnormalities ,i would to work on it

there is list of problem s that i can solve

Data Leakage:** Pre-trained models may have been trained using the full dataset, leading to inflated performance.

Overfitting:** Models show unrealistically high accuracy, suggesting they are overfitting to the training data.

Imbalanced Dataset:** The target variable might be imbalanced, which could bias the model’s predictions.

I### nadequate Evaluation:** Overreliance on accuracy without considering metrics like ROC AUC or F1-score.

No Cross-Validation:** Cross-validation is missing, which could lead to overestimating model performance.

Missing Feature Scaling:** Numerical features are not scaled, which may affect the performance of some models.

Categorical Encoding:** The categorical feature 'Celestial Body' is not properly encoded.

Unused Imports:** Libraries like TensorFlow and Keras are imported but not used.

Inconsistent Random Seeds:** Random seeds are not consistently set, making results non-reproducible.

No Hyperparameter Tuning:** Models are used with default parameters without sufficient tuning.

Thanks .
well said, plz try with all of them with overfitting issue, cross validation , hyper parameter tuning in priority.

@JahnaviDhanaSri
Copy link

Hi @Devanik21

I would like to take on the task of resolving the identified issues in the ISRO SPACE mining .py file.
I will analyze the model's training process to implement regularization techniques and potentially adjust the model architecture to reduce overfitting.
I intend to introduce k-fold cross-validation to ensure more reliable model evaluation and performance metrics.
I will utilize techniques like Grid Search or Random Search to optimize hyperparameters and enhance the model's predictive accuracy.

@Devanik21
Copy link
Owner

Hi @Devanik21

I would like to take on the task of resolving the identified issues in the ISRO SPACE mining .py file. I will analyze the model's training process to implement regularization techniques and potentially adjust the model architecture to reduce overfitting. I intend to introduce k-fold cross-validation to ensure more reliable model evaluation and performance metrics. I will utilize techniques like Grid Search or Random Search to optimize hyperparameters and enhance the model's predictive accuracy.

hello , are you working on it?

@Devanik21 Devanik21 added enhancement New feature or request gssoc-ext level3 labels Oct 6, 2024
@JahnaviDhanaSri
Copy link

Yeah I am working on it

@Devanik21 Devanik21 pinned this issue Oct 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants