Welcome to my GitHub repository for machine learning projects and experiments! This repository contains a variety of Jupyter Notebook files showcasing different machine learning techniques, data analysis, and data engineering tasks. Feel free to explore the projects and experiments I've worked on.
-
Anomaly Detection Using AutoEncoder with Pytorch
- Detect anomalies in data using an AutoEncoder model implemented with PyTorch.
-
Bubble Detection for the Crypto Market
- Analyze and detect bubbles in the cryptocurrency market using data analysis and visualization techniques.
-
- Perform data engineering tasks and exploratory data analysis to gain insights from various datasets.
-
- Convert 2D images to 3D depth maps using a machine learning approach.
-
Feature Importance with XGBoost vs ppscore
- Compare feature importance techniques between XGBoost and ppscore libraries.
-
Fine-Tune LLaMA 2 with QLoRA for QA Datasets
- Fine-tune the LLaMA 2 model using QLoRA for question-answering datasets.
-
- Build a complete machine learning pipeline including preprocessing, model training, and evaluation.
-
McKinsey ProHack Challenge 2020 with XGBoost
- Participate in the McKinsey ProHack Challenge 2020 using XGBoost with optimization techniques.
-
- Explore various natural language processing (NLP) cases and techniques.
-
Quantize Llama 2.7B Model and Fine-Tune on QA Datasets
- Quantize the Llama 2.7B model and fine-tune it on question-answering datasets.
-
Quantum Method for Portfolio Optimization
- Utilize a quantum approach for portfolio optimization.
-
Rule-Based Turkish Language Sentiment Analysis
- Perform sentiment analysis on Turkish text using a rule-based approach.
-
- Build a Seasonal Autoregressive Integrated Moving Average (SARIMA) model for time series forecasting.
-
- Mine and analyze data related to conflicts in Turkey.
-
- Perform text classification on Turkish text using machine learning techniques.
-
- Tutorial on working with vector databases in the context of machine learning.
-
- Experiment and explore the Llama 2.13B model.
-
- Analyze trends in medical data.
-
- Optimize investment portfolios using machine learning techniques.
-
- Classify stroke and non-stroke cases using machine learning.
-
- Analyze tapu (land registry) and kadastro (cadastre) data.
The project notebooks are implemented using various libraries and frameworks. To reproduce the results or run the notebooks, please refer to the requirements.txt
file for the necessary dependencies.
Feel free to reach out if you have any questions or suggestions. Happy exploring and learning!
Note: Some notebooks were created using Colaboratory, and others were developed locally. Make sure you have the required libraries and packages installed before running the notebooks.