Skip to content

Latest commit

 

History

History
29 lines (18 loc) · 3.17 KB

about.md

File metadata and controls

29 lines (18 loc) · 3.17 KB

Value 4 Value in Action

The Podcast Index is an independent and open catalog of podcasts feeds serving as the backbone of what is referred to as the Podcasting 2.0 initiative. The data contained in the Podcast Index is available through a robust REST API as well as a SQLite database updated every week.

The PodcastIndex Dashboard is my attempt to give back to the amazing Podcasting 2.0 initiative. A key concept that drives the engagement and enthusiasm in this community is the unique ways each of us can contribute time, talent, and treasure to benefit everyone.

In previous episodes of Podcasting 2.0, Dave Jones lamented that duplicate podcast entries in the Podcast Index can cause annoying issues for many podcast apps and other services relying on the integrity of the index. Seeing an opportunity to help this amazing project, I sent a boost to the show in episode 156 to offer up a new solution powered by the R statistical computing language for identifiying potential duplicates alongside other data quality issues. Hence the objectives of this dashboard are to highlight potential duplicate podcast entries as well as perform quality assessments of the index to highlight potential issues.

rpodcast@getalby.com{width=10%}

Tech Stack

Much like the ethos behind podcasting 2.0, the PodcastIndex Dashboard is proudly built on the foundations of open-source:

  • Quarto technical publishing system with the new capability of dashboards.
  • The R project for statistical computing with the following amazing packages:

Analysis Pipeline

The duplicate records and data quality analysis pipelines are executed weekly (after the Podcast Index SQLite database is refreshed) as scheduled GitHub Action workflows. Visit the GitHub repository at https://github.com/rpodcast/pod-db-checker to find the following scripts: