Skip to content
View benitomartin's full-sized avatar
💭
Book a Consultation on my Website ☎️
💭
Book a Consultation on my Website ☎️

Highlights

  • Pro

Organizations

@lewagon @mlops-club @Real-World-ML @end-to-end-mlops-databricks

Block or report benitomartin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
benitomartin/README.md

 

⬇️ You can find me here! ⬇️

LinkedIn Badge Website Badge Website Badge Website Badge Medium Badge

 

 

🎵 While you're here, why not enhance your visit with a melodious twist? Tune into this enchanting Spanish AI Song. A perfect blend of technology and art. Enjoy the vibes! 🎷🎶

🌟 Crafting each piece of content is a journey that demands both time and passion. If you enjoy my work, consider fueling my creativity Buying me a Coffee ☕ or supporting me on GitHub Sponsors 🚀

💰 My website has been created using Hostinger. If you want to create your own one, using this referral code will provide you 20% discount on the selected plan 💶

👉 CONTACT ME! 👉 Book a Consultation, or use this Form 🚀

 

👨‍💻 My Profile

Innovative and dynamic Data Scientist providing a diverse range of services, including project development, teaching, workshops, technical writing, and career coaching. My skill set includes (not limited to):

 

  • Data Science, Analytics & ML: Python, TensorFlow, Scikit-learn
  • AI: Langchain, LlamaIndex, Hugging Face, Transformers, Vector Databases
  • Key Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Neural Networks, Ensemble Methods, Clustering, Dimensionality Reduction
  • Data Engineering: dbt, Terraform, SQL, BigQuery, PySpark, Databricks
  • MLOps: MLflow, Prefect, Comet, Docker, Kubernetes
  • APIs: Flask, FastAPI
  • Apps: Streamlit, Gradio
  • Cloud Platforms: GCP, AWS
  • Version Control: Git

 

📄 Projects Portfolio

💰 My personal end-to-end projects can be found in these repositories. Feel free to click ⭐ if you like them 😎

 

Project Name Main Libraries/Tools Cloud Service App DevOps Best Practices
ML/MLOps
MLOps Credit Default Scikit-learn
LightGBM
MLflow
Databricks
AWS/Databricks Experiment Tracking
Model Registry
Model/Data Monitoring
Data Validation
Linting
Formatting
Testing
Error Handling
Pre-Commit
IaC
CI/CD
Medical Insurance Costs Prediction Scikit-learn
TensorFlow
SageMaker
Comet ML
Flask
AWS Experiment Tracking
Model Registry
Model/Data Monitoring
Model/Data Monitoring
Linting
Formatting
Testing
Error Handling
Coverage
CI/CD
Stroke Prediction Scikit-learn
XGBoost
SageMaker
Comet ML
Flask
Docker
AWS Experiment Tracking
Model Registry
Model Monitoring
Containerization
Testing
Error Handling
Car Price Prediction Scikit-learn
TensorFlow
MLFlow
Prefect
Flask
Docker
Grafana
Terraform
AWS Experiment Tracking
Model Registry
Model Monitoring
Orchestration
Containerization
Linting
Formatting
Testing
Error Handling
IaC
CI/CD
Taxi Rides Prediction Scikit-learn
TensorFlow
MLFlow
Prefect
FastAPI
Docker
GCP Experiment Tracking
Model Registry
Model Monitoring
Orchestration
Containerization
Error Handling
Music Clustering Scikit-learn
FastAPI
Docker
AWS Streamlit Containerization
CI/CD
Birds Classification Pytorch Gradio
Food Prediction Scikit-learn
TensorFlow
OpenCV
FastAPI
Docker
GCP Streamlit Containerization
LLM, RAG and Fine-tuning
RAG Hybrid Search and Semantic Caching Qdrant
FastEmbed
SPLADE
Hugging Face Transformers
Error Handling
Linting
Formatting
Multimodal Bill Scan System AWS Bedrock
AWS DynamoDB
AWS SQS/SNS
AWS CDK
Claude 3 Sonnet
AWS Error Handling
Linting
Formatting
IaC
IaC in RAG Applications with Terraform AWS Bedrock
LangChain
AWS Opensearch
Terraform
Titan
AWS Testing
Error Handling
Linting
Formatting
IaC
Scalable RAG in AWS with Fargate OpenAI
LlamaIndex
Qdrant
AWS CDK/Fargate
FastAPI
AWS Testing
Error Handling
RAG Deployment with Azure Functions OpenAI
LangChain
Qdrant
Azure Functions App
Azure Linting
Formatting
Testing
Error Handling
Scalable RAG with Kubernetes OpenAI
LlamaIndex
Qdrant
Docker
FastAPI
GKE
GCP Streamlit Containerization
Linting
Formatting
Testing
Error Handling
CI/CD
Research Papers Semantic Search OpenAI
LangChain
Qdrant
Docker
AWS API Gateway
AWS Streamlit Containerization
Linting
Formatting
Testing
Error Handling
Video Summarization Hugging Face Transformers
Whisper
Langchain
ChromaDB
Streamlit Error Handling
Multimodal RAG with Video Frames Gemini
LlamaIndex
Qdrant
Books Reranking Semantic Search OpenAI
LlamaIndex
Deep Lake
RAG Evaluation with Ragas OpenAI
Hugging Face Transformers
Faiss
LangChain
Ragas
PII RAG LlamaIndex Milvus OpenAI
Presidio
LlamaIndex
Milvus
Multimodal RAG with PyMuPDF OpenAI
Qdrant
LlamaIndex
PyMuPDF
Agentic RAG LlamaIndex Milvus OpenAI
Claude
LlamaIndex
Milvus
Agentic RAG with LangChain OpenAI
Groq
LangChain
Pinecone
Agentic RAG with CrewAI OpenAI
LangChain
Qdrant
CrewAI Agents
Fine Tuning Gemma 2B Hugging Face Transformers
PEFT (LoRA/QLoRA)
Hugging Face
Data Analysis + Modeling
News Classification Scikit-learn (Multinomial Naive Bayes)
Tensorflow (CNN, RNN, feedforward)
Streamlit
Breast Cancer Classification Scikit-learn
Spark
IBM
Bank Churn Classification Scikit-learn
LightGBM
XGBoost
CatBoost
Data Engineering
Hotel Reviews Prefect
Spark
SQL
BigQuery
dbt
Terraform
Looker
GCP Orchestration
Linting
Formatting
Error Handling
Pre-Commit
IaC
CI/CD
Air Quality Switzerland Mage
dbt
SQL
BigQuery
Docker
Terraform
Looker
GCP Orchestration
IaC
Containerization
CI/CD
Miscellaneous
Justicio Web Scraping Beautiful Soup
MySQL
Error Handling

 

💸 Additionally, you can find my Power BI projects:

:basecamp: Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions

 

🧮 Tech Stack

Visual Studio Code HTML5 CSS3 Jupyter Notebook MySQL SQLite Tableau PowerBI Looker Studio Python Pandas NumPy Plotly Matplotlib Databricks Spark scikit-learn TensorFlow OpenAI FastAPI Flask Docker Kubernetes Anaconda Linux Ubuntu Google Cloud AWS Terraform Prefect dbt MLflow GitHub Actions Git Streamlit

Pinned Loading

  1. crewai-rag-langchain-qdrant crewai-rag-langchain-qdrant Public

    Agentic RAG with Langchain, Qdrant and CrewAI

    Jupyter Notebook 41 8

  2. multimodal-llm-pymupdf4llm multimodal-llm-pymupdf4llm Public

    Multimodal LLM Application with PyMuPDF4LLM

    Jupyter Notebook 31 6

  3. rag-aws-qdrant rag-aws-qdrant Public

    Serverless Application with AWS Lambda and Qdrant for Semantic Search

    Python 15 3

  4. mlops-databricks-credit-default mlops-databricks-credit-default Public

    End-to-end MLOps Credit Default Project using DABs

    Python 11 9

  5. aws-bedrock-opensearch-langchain aws-bedrock-opensearch-langchain Public

    RAG Application with LangChain, Terraform, AWS Opensearch and AWS Bedrock

    Python 7 1

  6. rag-langchain-ragas rag-langchain-ragas Public

    RAG project for QA retrieval using LanChain and Ragas

    Jupyter Notebook 6 2