{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":661359797,"defaultBranch":"main","name":"datSci","ownerLogin":"oshinrathor","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-07-02T15:46:00.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/112712889?v=4","public":true,"private":false,"isOrgOwned":false},"refInfo":{"name":"","listCacheKey":"v0:1688314468.0","currentOid":""},"activityList":{"items":[{"before":"82db7f57f7e4cc7d72d4af51b7ea0eaf845ff067","after":"ce6840cb486a85f9cc2355aeb0afad367d407b28","ref":"refs/heads/main","pushedAt":"2024-08-04T13:47:26.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload","shortMessageHtmlLink":"Add files via upload"}},{"before":"8e1130bca276a73980d18b4adbc267f39b9bda04","after":"82db7f57f7e4cc7d72d4af51b7ea0eaf845ff067","ref":"refs/heads/main","pushedAt":"2024-08-04T13:22:28.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload","shortMessageHtmlLink":"Add files via upload"}},{"before":"25c71c6d2aad1d49879663e2c0757aad99862ade","after":"8e1130bca276a73980d18b4adbc267f39b9bda04","ref":"refs/heads/main","pushedAt":"2024-05-12T18:58:39.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload\n\nAims to address bias in a dataset by employing statistical analysis and natural language processing techniques. Initially, the team imports the dataset and creates a graph to visualize the distribution of bias labels. They identify a significant bias towards one label (82.2% to 17.8%) and set out to mitigate it. Through statistical analysis, they determine mean ratings and positive feedback counts for different department categories. Utilizing this information, they update the dataset, reassigning labels based on deviation from department-specific mean values. This statistical approach significantly reduces bias, achieving a more balanced distribution (62% to 38%). Further preprocessing involves text normalization, stemming, and lemmatization to reduce feature space. TF-IDF vectorization is employed to calculate term frequency-inverse document frequency weights, enriching the dataset representation. In conclusion, the code successfully mitigates bias through a comprehensive statistical and NLP-based approach, enhancing the dataset's utility for subsequent analysis and modeling tasks.","shortMessageHtmlLink":"Add files via upload"}},{"before":"2c78fe83b3063106098f5893f2efcc21ca606254","after":"25c71c6d2aad1d49879663e2c0757aad99862ade","ref":"refs/heads/main","pushedAt":"2024-05-12T18:52:04.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload\n\nA project focused on customer segmentation and analysis using RFM (Recency, Frequency, Monetary) modeling. The project starts with data loading and cleaning, where missing values are handled, and data types are adjusted. Then, RFM attributes are derived from the dataset, including monetary, frequency, and recency. Outlier analysis is conducted to ensure data quality. Subsequently, the features are scaled using StandardScaler for model compatibility. The core of the project lies in model building using K-means clustering, where an optimal number of clusters is determined using the elbow method. Finally, the K-means model is trained and saved, and customer segments are visualized through box plots. The end goal of the project is to provide actionable insights for marketing strategies and customer relationship management, achieved by effectively segmenting customers based on their transactional behavior.","shortMessageHtmlLink":"Add files via upload"}},{"before":"46e56a9332e8280713c7b29252355257669f0b7a","after":"2c78fe83b3063106098f5893f2efcc21ca606254","ref":"refs/heads/main","pushedAt":"2024-05-02T15:27:50.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload\n\nThis GitHub repository hosts a language identification project employing natural language processing techniques. Using Python and libraries like Pandas, NumPy, NLTK, and Scikit-learn, it accurately identifies the language of text inputs. The project includes data visualization, text preprocessing, model training with a Multinomial Naive Bayes classifier, and evaluation. It also offers functions for saving and loading trained models, with a user-friendly testing feature.","shortMessageHtmlLink":"Add files via upload"}},{"before":"75e5f19896ebff041f1891318882ebbe343beadc","after":"46e56a9332e8280713c7b29252355257669f0b7a","ref":"refs/heads/main","pushedAt":"2024-05-01T13:42:36.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload","shortMessageHtmlLink":"Add files via upload"}},{"before":"37796de9a1041edaf7a30622cffc297e1c77ffd7","after":"75e5f19896ebff041f1891318882ebbe343beadc","ref":"refs/heads/main","pushedAt":"2024-05-01T13:38:36.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Delete vectorizer.pkl","shortMessageHtmlLink":"Delete vectorizer.pkl"}},{"before":"b22831042381054da48fd21823d4f983c25ab6a4","after":"37796de9a1041edaf7a30622cffc297e1c77ffd7","ref":"refs/heads/main","pushedAt":"2024-05-01T13:38:19.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Delete spam.csv","shortMessageHtmlLink":"Delete spam.csv"}},{"before":"b6f1afd4f35021dc574d505c92e1c7dfd8d4a7fa","after":"b22831042381054da48fd21823d4f983c25ab6a4","ref":"refs/heads/main","pushedAt":"2024-05-01T13:38:03.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Delete smsSpamClsfr.ipynb","shortMessageHtmlLink":"Delete smsSpamClsfr.ipynb"}},{"before":"b73537cd243aad1a3694e84c9cbc103d23d03962","after":"b6f1afd4f35021dc574d505c92e1c7dfd8d4a7fa","ref":"refs/heads/main","pushedAt":"2024-05-01T13:37:49.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Delete model.pkl","shortMessageHtmlLink":"Delete model.pkl"}},{"before":"bda80beb7113709dbb8db30eb24601adae842de5","after":"b73537cd243aad1a3694e84c9cbc103d23d03962","ref":"refs/heads/main","pushedAt":"2024-05-01T13:36:58.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload","shortMessageHtmlLink":"Add files via upload"}},{"before":"acdc88ca56b16e577093162c317c5492ed01c84d","after":"bda80beb7113709dbb8db30eb24601adae842de5","ref":"refs/heads/main","pushedAt":"2023-07-29T16:15:04.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Created using Colaboratory","shortMessageHtmlLink":"Created using Colaboratory"}},{"before":"952b62c57c8543eadfcc7319abf83ebf43c109be","after":"acdc88ca56b16e577093162c317c5492ed01c84d","ref":"refs/heads/main","pushedAt":"2023-07-28T15:06:13.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Created using Colaboratory","shortMessageHtmlLink":"Created using Colaboratory"}},{"before":null,"after":"952b62c57c8543eadfcc7319abf83ebf43c109be","ref":"refs/heads/main","pushedAt":"2023-07-02T16:14:28.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"oshinrathor","name":"OSHIN RATHOR","path":"/oshinrathor","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/112712889?s=80&v=4"},"commit":{"message":"Add files via upload","shortMessageHtmlLink":"Add files via upload"}}],"hasNextPage":false,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEkU_EvgA","startCursor":null,"endCursor":null}},"title":"Activity ยท oshinrathor/datSci"}