Skip to content
View fahmizainal17's full-sized avatar
πŸ“ˆ
Learning and Progressing Every Second ⚑
πŸ“ˆ
Learning and Progressing Every Second ⚑

Block or report fahmizainal17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fahmizainal17/README.md

Greetings to everyone, I'm Fahmi Zainal

Data Scientist | Analytics Engineer | Survey | Digital Marketing | Software Development | ML & AI | ETL | Databricks | API Deployment | Azure & AWS DevOps Practitioner | Second Lieutenant Reserved Officer πŸŽ–

fahmizainal17

fahmizainal17


🌱 Professional Development

INVOKE SOLUTIONS, Kuala Lumpur (November 2023 – Now)

Data Scientist

Surveys Making, Automation and Optimization:

  • Led the design and implementation of the IVR Data Cleaner, Questionnaire Definer, and Keypress Decoder, significantly improving the efficiency of data processes from 8 hours to 30 minutes.
  • Spearheaded and handled survey projects for call sampling, weighting, and visualization through crosstabs and charts utilizing Google Colab, Azure Databricks, and in-house web applications.
  • Developed and integrated Unified Survey web applications using Streamlit, enhancing collaboration by merging functionalities with team-developed applications in Streamlit and Shiny Apps utilizing FastAPI, supported by Docker, AWS Services, and GitHub Actions (CI/CD).
  • Architected data structure and profiling for utilizing data mining techniques and Geocoding API to map the locality (latitude and longitude) and postcode and other demographics data.
  • Taught and guided interns for in-house training in conducting surveys and utilizing Streamlit for proof of concept.

Machine Learning Development:

  • Prepared data, pipelines, designed, and developed predictive models to analyze ROAS Benchmark and Campaign Benchmark, consumer behaviors, and market trends utilizing MongoDB, Databricks, Azure Blob Storage for workflow, and using advanced matching and machine learning techniques, thus enhancing the company’s revenue by up to 10% by engaging potential regular clients.
  • Created innovative EDA and model deployment to predict Personal Attributes such as Salary and Income Group, improving demographic targeting accuracy to up to 90% accuracy.

ROAS Dashboard Application:

  • Streamlined and innovated solutions for the ROAS Dashboard application to deliver more precise and filtered results. Enhanced interactivity with users by providing clear graphs for visualization.
  • Worked with Frontend Developer and interacted by creating API Endpoints of the functions to pass the data as of difference in packages usage.

Computer Vision Applications Maintenance:

  • Maintained computer vision projects of FGV oil palm company to optimize the pesticide application.
  • Maintained document parsing automation by implementing TensorFlow and YOLOv5 technologies, facilitating the digitization of invoices and receipts into structured formats for detailed analysis.

Data Warehouse Management:

  • Oversaw the maintenance of the INVOKE’s data warehouse, ensuring robust integration and customization of dashboards to optimize data analytics capabilities.

EXCELERATE ASIA, Kuala Lumpur (September 2023 – October 2023)

Data Analyst (General Assembly)

  • Workforce Dynamics Insights Project: Enhanced data integrity and workforce management through expert data wrangling with Excel and development of Interactive Tableau Dashboards, providing crucial insights into Retention, Compensation Fairness, and Diversity.
  • Global Superstore SQL Project: Leveraged PostgreSQL for robust data management and extraction to Tableau, analyzing trends in product returns to support strategic business decisions thus completing analytics bootcamp training.

πŸ† Achievements

KAGGLE WORLD COMPETITION, Online (April 2022 – June 2023)

Data Scientist

  • Achieved top 28% ranking out of 1908 teams in a Binary Prediction Competition on Smoker Status Using Bio-Signals, leveraging logistic regression, random forest, and tree-boosted algorithms such as Gradient Boosting, LightGBM, XGBoost, and CatBoost.
  • Planned the work meticulously, conducting nightly meetings to discuss strategies, optimize models using Optuna for hyperparameter tuning, and implement a weighted voting classifier for enhanced performance, resulting in an ROC-AUC score of 0.87178 which is 87% accuracy.

πŸ›‘ Leadership

MALAYSIAN ARMED FORCE, Kuala Lumpur (September 2019 – February 2023)

Second Lieutenant Reserved Army

  • Leadership: Commanded a platoon of 89 cadets, achieving high training ratings and improving administrative efficiency, reducing errors by 5%.
  • Communication: Excelled in high-pressure scenarios, coordinating team efforts and earning recognition for leadership during critical simulations.

  • πŸ“ I regularly write articles on www.linkedin.com/in/muhammadfahmibinmohdzainal

  • πŸ’¬ Ask me about Data Science, Data Analysis, Physics, Military, Silat Cekak, Skop Production Movies πŸ˜…

  • πŸ“« How to reach me fahmizainal9@gmail.com

  • πŸ“„ Know about my experiences www.linkedin.com/in/muhammadfahmibinmohdzainal

  • ⚑ Fun fact I love listening to podcasts, reading articles, watching educational content to upgrade my skills in Data Analytics, Data Science, Data Engineering, and Business because that is what I'm passionate about. Not to mention, I love fishing and snorkeling.

  • πŸ” My Expertise

Area Technologies/Tools
IVR and Calls Analytics Google Colab, Streamlit Web Application, Visual Studio Code, Google Sheet API, Google Sheet Dashboard, DataBricks, PySpark
Languages Python, SQL, R, Html5, CSS3, JavaScript
Cloud & Databases Databricks, AWS API Gateway, AWS Fargate, AWS S3, AWS EC2, AWS ECR, AWS ECS, PostgreSQL
Data Processing PySpark
Dashboarding Excel, Google Sheet, Google Colab, PowerBI, Tableau
Web Development Streamlit Web App, FastAPI
Containerization Docker

Connect with me:

fahmizainal17_ muhammadfahmibinmohdzainal fahmizainal fahmizainal fahmizainal17 @fahmizainal7695

Languages and Tools:

aws azure cplusplus firebase gcp hadoop html5 javascript matlab opencv pandas postgresql python pytorch scikit_learn seaborn selenium tensorflow

🌟 Support My Work:

Buy me a coffee
fahmizainal17 fahmizainal17
fahmizainal17

πŸš€ My Wakatime This Week!

wakatime GitHub followers GitHub Stars

πŸ“Š This Week I Spent My Time On:

From: 08 September 2024 - To: 15 September 2024

Total Time: 10 hrs 34 mins

Python   10 hrs 30 mins  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–“   99.10 %
TOML     3 mins          β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   00.54 %
Other    2 mins          β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   00.36 %

Pinned Loading

  1. Fahmi_Zainal_Portfolio Fahmi_Zainal_Portfolio Public

    A dynamic portfolio website built with Streamlit to showcase Fahmi Zainal 's professional journey, achievements, and projects in an engaging and interactive way.

    Python 1

  2. FastAPI_ROAS_Dashboard_Project FastAPI_ROAS_Dashboard_Project Public

    This project separates the backend from the Streamlit frontend, providing a robust API built with FastAPI. It includes comprehensive backend testing and endpoint testing using Pytest, ensuring the …

    Python

  3. Advanced_SQL_Use_Cases_for_Vehicle_Auction_Analysis_Project Advanced_SQL_Use_Cases_for_Vehicle_Auction_Analysis_Project Public

    This project showcases advanced SQL queries and data analysis techniques applied to a dataset of vehicle auctions. The dataset includes data from multiple competitors over several months, with deta…

    2

  4. Streamlit_IVR_Data_Cleaning_Automation_Project Streamlit_IVR_Data_Cleaning_Automation_Project Public

    A web application built with Streamlit for automating the cleaning of IVR (Interactive Voice Response) data, primarily used for analytics purposes.

    Python 2

  5. Tensorflow_Flood_Prediction_Project Tensorflow_Flood_Prediction_Project Public

    This project leverages TensorFlow and Keras to build and train a neural network model for predicting flood probability based on various environmental and socio-economic factors.

    Jupyter Notebook 2

  6. HR_Analytics_Identifying_Key_Factors_Contributing_to_High_Employee_Attrition_Rates_Project HR_Analytics_Identifying_Key_Factors_Contributing_to_High_Employee_Attrition_Rates_Project Public

    This project uses HR analytics to identify key factors contributing to high employee attrition rates, helping organizations understand and mitigate turnover issues.

    1