Skip to content

Analysis of "Mountain Project" which is one of the largest climbing-related websites in the world, by using SQL for data manipulation and Tableau dashboard for presenting insights.

License

Notifications You must be signed in to change notification settings

PoonAthitS/mountain-project-website-analysis

Repository files navigation

Mountain Project - Climbing Community Website Analysis

1. Introduction

As a member in the climbing and mountaineering community (both in Thailand and UK), I have spent a lot of my time on Mountain Project website which is one of the largest websites about the climbing-related topics. It serves as a guidebook or climbing information database with more than 100k climbing routes across the globe. As a data analyst, this piques my curiosity about relationship between variety of climbing-related data from the website such as number of routes, route height, difficulty grades, etc. Therefore, this project aims to process the raw data in SQL Server and present the insights in Tableau Dashboard.

2. data manipulation in SQL

In this project, the raw data contain many issues that are required to be addressed with SQL data manipulation/transformation, as follows.

  • A large amount of null values in multiple fields
  • Improper decimal places and units
  • Categorical data that are too varied to be analysed such as Route_type field
  • Outliers / errors such as impossible length, impossible number of pitches
  • Duplicated records (As users may share the same route as others')
  • Long field with separators (require to use STRING_SPLIT instead of the more user-friendly function: PARSENAME)
  • Special conditions in Location field which its values are different when the route is in USA

3. Tableau dashboard and key findings

The database in SQL Server is then loaded to Tableau to develop an interactive dashboard as depicted in the image below.



The dashboard provides the key findings as follows: (for example)

  • Most climbing routes are from the USA (Almost 80k routes), which is aligned with the fact this website is originated in the US
  • The majority of climbing routes are around 15m with 5.9 Difficulty grade (from mode + trend)
  • The satisfactions of the routes are decribed by voting stars which the trend shows that most common stars are 2.5 while the spikes support the reason that users are allowed to vote only in integer number.
  • The satisfactions of the users to each routes have the possitive relationship to the length of route (longer = more people like); However, from the graph, not a many routes are long.
  • Meanwhile, the more chanllenging routes (higher grade) also have more stars
  • Many routes that specifies their difficulty grade unclearly (such as 5.10 instead of 5.10c) are the longer routes which are likely to be multi-pitch route due to the fact that it is harder to define the exact grade on the climbing with many pitches (climbign sections)

4. About the programming

Data

The Raw data are gathered from Kaggle Mountain Project dataset which is scraped from Mountain Project website in 2020 by Patricia. Please check out her GitHub project which is about analyzing the sentiment of the forums on www.MountainProject.com to determine which climbing gear is thought to be the best

To learn more about Poon Athit S., visit his LinkedIn profile

All rights reserved 2022. All codes are developed and owned by Poon Athit S. or the metioned team member(s). If you use this code, please visit his LinkedIn and give him a skill endorsement in Data analytics and the aforementioned coding technologies.

About

Analysis of "Mountain Project" which is one of the largest climbing-related websites in the world, by using SQL for data manipulation and Tableau dashboard for presenting insights.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published