Researcher Data Scientist with over 3 years of experience in data, specializing in time series forecasting, causal inference and spatial analysis using Python.
I have professional experience in market and business data analysis, which has contributed to the development of strong analytical and data communication skills.
My Master’s degree in Applied Economics provided me with a solid foundation in statistical and mathematical modeling, as well as research skills. My undergraduate degree in engineering developed my ability to solve complex problems and learn autonomously.
I work on data science projects utilizing Machine Learning models in Python, covering areas such as sales forecasting, demand price elasticity, as well as customer classification and clustering. These projects demonstrate my ability to apply algorithms with best coding practices to extract insights and guide strategic decisions.
-
Master's degree in Applied Economics with a focus on time series and spatial econometrics. The dissertation analyzes how Brazilian soybean exports relate to macroeconomic variables, providing insights into the economic impacts of global changes.
-
Bachelor's degree in Agricultural Engineering with a focus on agribusiness. For my final project, I used R to create price forecasting models for key agricultural commodities using the ARIMA methodology, published in the Journal of Economics and Agribusiness.
Currently working on the expansion of dengue fever in the Brazilian Legal Amazon to identify spatiotemporal patterns. Previously, I analyzed the dynamics of the ethanol market for Ipiranga's intelligence sector.
During my time at Ipiranga, I was responsible for analyzing and communicating market information, particularly focusing on ethanol market dynamics for the Supply & Trading team. In addition to my data expertise, I'm proficient in English and experienced in using the Microsoft Office suite.
I'm a proactive individual that is willing to learn new skills and work in a fast-paced and collaborative environment.
Here I highlight some of the projects I have worked on throughout my professional career.
In addition to my data science work, I also have a strong interest in econometrics. Here are some of my econometric projects:
-
Using ARIMA to Forecast Olist Revenue: Applied the Autoregressive Integrated Moving Average model to predict Olist's revenue for the next 14 days, providing the company with robust forecasting models to improve financial planning and resource optimization. Read more on Medium.
-
Price Elasticity of Demand Analysis: Analyzed price-demand elasticity to understand how price variations impact consumer demand for elastic products. This project provides a brief overview of how this economic theory can be applied to real-world scenarios and be a valuable resource for businesses. Explore the insights with StreamlitApp.
These projects showcase my expertise in applying econometric techniques to real-world problems, particularly in forecasting and demand analysis.
I'm passionate about leveraging data to solve business problems. Here are some of my recent projects:
-
InStyle - Customer Classification: Used different Machine Learning algorithms to train and predict customer satisfaction. The project provided insights and recommendations to classify customers and predict dissatisfaction, enabling InStyle to address customer satisfaction issues and improve overall experience.
-
Cardiovascular Disease Detector: Created a model to enhance the accuracy of cardiovascular disease diagnosis, improving from 65% to 75% accuracy, resulting in a significant revenue increase for Cardio Catch Disease.
-
Sales Forecasting: Employed various Machine Learning models and chose the best performing model to forecast sales for Rossmann stores. The project provided a comprehensive step-by-step Data Science project to optimize the resources allocation to renovate Rossmann stores.
-
Taxi Destination Prediction: Developed a predictive model to infer the final destination of taxi rides, reducing empty mileage and optimizing operational planning.
- Customer Clustering for Marketing Campaign: The goal is to enhance business and marketing strategies by clustering customers using the K-Means algorithm. This helps understand customer behaviors and preferences, enabling personalized strategies for each group. Benefits include more effective marketing, better customer experience, and optimized resource allocation for maximum business impact.
- Beyond "You May Also Like": Understanding Consumer Choices: Conducted an Exploratory Data Analysis to understand purchases, products, consumer temporal behavior, and the price-quantity relationship. Used Market Basket Analysis to identify association patterns among products, classifying those frequently bought together, and Recommendation System to generate product suggestions for customers based on their similarity to others. This project equips Rex London with information to enhance customer experience through personalized recommendations, potentially increasing sales.
-
Credit Score: Built an internal risk model to optimize profitability and business security while balancing market expansion, providing key insights on loan approval strategies through credit score decile analysis. Streamlit.
-
Fraud Detection: Developed a fraud detection model using Exploratory Data Analysis, data preparation techniques, and various Machine Learning models, achieving high accuracy and recall to confidently identify fraud.
-
Credit Score Analysis: Conducted a comprehensive credit score analysis to increase bank profitability by implementing and comparing Machine Learning models, resulting in a detailed client decile classification for optimal profit and risk management.
- Exploratory Data Analysis for Olist: Performed a comprehensive exploratory data analysis for Olist, uncovering key business insights related to consumer behavior, customer satisfaction, sales patterns, and regional differences across states. The analysis provided quality and actionable informations to support strategic decision-making.
All my projects are structured around solving business problems, employing regression, clustering, and classification algorithms to deliver optimal solutions. You can read more about them on my Portfolio Page and find the codes on their repositories.
- Programming Languages: Python, R, SQL
- Feature Engineering: Encoding, Scaling, Imputers, Resampling
- Machine Learning: Regression, Classification, Clustering, Cross-Validation, Hyperparameter Tuning, Random Search
- Metrics: Confusion Matrix, Accuracy, Precision, Recall, F1 Score, MAE, MAPE, RMSE, R²
- Visualization: Power BI, Streamlit Dashboards
- Cloud: AWS (Glue, S3, Athena, SageMaker)
- Professional Skills: Advanced English, Microsoft Office