A space for football analytics projects by Edd Webster, including a curated list of publicly available resources published by the football analytics community.
The README of this repository is a concise list of learning resources, data sources, libraries, papers, blogs, podcasts, etc., created by all those that have made contributions to the football analytics community. This will be a constant work in progress so if you can think of any resources that I've missed, or you yourself have created something that you believe should be added and is currently not available, please feel free to create a pull request or send me a message.
Credits to the Soccer Analytics Handbook
by Devin Pleuler, Awesome Soccer Analytics
by Matias Mascioto, and Jan Van Haaren's Soccer Analytics 2021 Review, Soccer Analytics 2020 Review and soccer-analytics-resources
Github repo, which were all used to plug gaps in the list once it was published. Credit also to Matias Singers for his awesome-readme
repository used to restyle this README.
If you like the repo, please feel free to give it a ⭐ (top right). Cheers!
For an Excel spreadsheet version of this README to keep track of the parts you have already read/worked on, see the following Google Sheets spreadsheet kindly put together by Melanie Loeper [link].
For more information about this repository and the author, I am available through all the following channels:
Table of Contents
- About This Repository and Author
- Table of Contents
- Prerequisites
- Repository Structure
- Notebooks
- Data Visualisation and Tableau
-
Resources
- Getting Started with Football Analytics
- Data
- Data Sources
- Event data
- Tracking data
- Aggregated Player/Team Performance data
- Team Rating data
- Physical data
- Results and Matchsheet data
- Financial, Valuation, and Transfer data
- Odds, Betting, and Predictions data
- Plotting tools
- Reference data
- Miscellaneous data
- Documentation
- Data Companies and Types
- Tutorials
- Libraries
- GitHub Repositories
- Apps
- Data Visualisation Resources and Tools
- Written Pieces
- Video
- YouTube Playlists
- YouTube Channels
- Video Analysis
- Webinars and Lectures
- Ted Talks
- Documentaries
- Match Highlights
- Other
- Podcasts
- Notable Figures and Twitter Accounts
- Events and Conferences
- Competitions
- Courses
- Jobs
- Discord / Slack Groups
- Key Concepts
- History of Football Analytics
- Expected Goals (xG) Modeling
- Web Scraping Football Data
- Tracking Data
- Pitch Control Modeling
- Passing Networks
- Possession Value (PV) Frameworks
- General
- Expected Threat (xT)
- Valuing Actions by Estimating Probabilities (VAEP)
- Goals Added (g+)
- On-Ball Value (OBV)
- Dixon Coles Modeling
- Player Similarity and Style Analysis
- Team Playing Style Analysis
- Reinforcement Learning for Football Simulation
- Set Pieces
- Radars
- Recruitment Analysis
- Player Valuation Modeling
- Quantifying Relative Club and League Strength
- Tactics
- Game Win Probability Modeling
- Goalkeeper Analysis
- Citations
- Contributing
- Star Tracker
- Acknowledgements
The only prerequisites for using this GitHub repo is that you have a computer, internet connection and the desire to learn more about football analytics.
The following open-source Python libraries listed below are some of the most commonly used in Data Science that feature in the the notebooks in this repository. Most of these libraries can be obtained by downloading and installing Anaconda. Step-by-step guides to do this can be found for Windows here and Mac here, as well as in the Anaconda documentation itself here.
- NumPy;
- pandas;
- matplotlib;
- Plotly;
- record linkage;
- scikit-learn;
- SciPy;
- XGBoost;
- Hyperopt;
- MLflow; and
- SHAP.
The contents of this GitHub repository is organised as the following:
football analytics github repository
.
│
├── dashboards
│
├── data
│ │
│ ├── capology
│ │
│ ├── elo
│ │
│ ├── export
│ │
│ ├── fbref
│ │
│ ├── fifa
│ │
│ ├── guardian
│ │
│ ├── metrica-sports
│ │
│ ├── opta
│ │
│ ├── reference
│ │
│ ├── sb
│ │
│ ├── shots
│ │
│ ├── stats-perform
│ │
│ ├── stratabet
│ │
│ ├── tm
│ │
│ ├── touchline-analytics
│ │
│ ├── twenty-first-group
│ │
│ ├── understat
│ │
│ └── wyscout
│
├── docs
│ ├── centre-circle
│ ├── metrica-sports
│ ├── opta
│ ├── sb
│ ├── shots
│ ├── stratabet
│ └── wyscout
│
├── gif
│ └── fig
│
├── img
│ │
│ ├── club_badges
│ │
│ ├── eddwebster
│ │
│ ├── fig
│ │
│ ├── logos
│ │
│ ├── pitches
│ │
│ └── vizpiration
│
├── notebooks
│ │
│ ├── 1_data_scraping
│ │ ├── Capology Player Salary Web Scraping.ipynb
│ │ ├── FBref Player Stats Web Scraping.ipynb
│ │ └── TransferMarkt Player Bio and Status Web Scraping.ipynb
│ │
│ ├── 2_data_parsing
│ │ ├── ELO Team Ratings Data Parsing.ipynb
│ │ ├── StatsBomb Data Parsing.ipynb
│ │ └── Wyscout Data Parsing.ipynb
│ │
│ ├── 3_data_engineering
│ │ ├── Capology Player Salary Data Engineering.ipynb
│ │ ├── Centre Circle Opta CPL Data Engineering.ipynb
│ │ ├── FBref Player Stats Data Engineering.ipynb
│ │ ├── Opta #mcfcanalytics PL 2011-2012.ipynb
│ │ ├── StatsBomb Data Engineering.ipynb
│ │ ├── StrataBet Data Engineering.ipynb
│ │ ├── The Guardian Player Recorded Transfer Fees Data Engineering.ipynb
│ │ ├── TransferMarkt Historical Market Value Data Engineering.ipynb
│ │ ├── TransferMarkt Player Bio and Status Data Engineering.ipynb
│ │ ├── TransferMarkt Player Recorded Transfer Fees Data Engineering.ipynb
│ │ ├── Understat Data Engineering.ipynb
│ │ └── Wyscout Data Engineering.ipynb
│ │
│ ├── 4_data_unification
│ │ └── Unification of Aggregated Seasonal Football Datasets.ipynb
│ │
│ ├── 5_data_analysis_and_projects
│ │ │
│ │ ├── player_similarity_and_clustering
│ │ │ └── PCA and K-Means Clustering of 'Piqué-like' Defenders.ipynb
│ │ │
│ │ ├──tracking_data
│ │ │ │
│ │ │ ├── metrica_sports
│ │ │ │ └── Metrica Tracking Data EDA.ipynb
│ │ │ │
│ │ │ └── signality
│ │ │ ├── Signality Tracking Data Engineering.ipynb
│ │ │ └── Signality Tracking Data EDA.ipynb
│ │ │
│ │ └──xg_modeling
│ │ │ │
│ │ │ ├── shots_dataset
│ │ │ │ │
│ │ │ │ ├── chance_quality_modelling
│ │ │ │ │ ├── 1) Logistic Regression Expected Goals Model.ipynb
│ │ │ │ │ ├── 2) XGBoost Expected Goals Model.ipynb
│ │ │ │ │ └── 3) CatBoost Expected Goals Model.ipynb
│ │ │ │ │
│ │ │ │ └── metrica-sports
│ │ │ │ └── Metrica Sports.ipynb
│ │ │ │
│ │ │ ├── statsbomb_dataset
│ │ │ │ └── Introduction to Building Expected Goals Models Using StatsBomb 360 Data.ipynb
│ │ │ │
│ │ │ └── opta_dataset
│ │ │ └── Training of an Expected Goals Model Using Opta Event Data.ipynb
│ │ │
│ └── 6_data_visualisation
│
├── research
│ ├── papers
│ └── slides
│
├── scripts
│
├── spreadsheets
│
└── video
Nearly all code in this repository is in Jupyter notebooks, organised in the following workflow:
- Webscraping;
- Data Parsing;
- Data Engineering;
- Data Unification; and
- Data Analysis - projects include working with Tracking data, constructing VAEP models (as introduced by SciSports), building xG models using Logistic Regression, Random Forests and Gradient Booested Decision Tree algorithms such as XGBoost and CatBoost, and analysing player similarity using PCA and K-Means clustering).
For Tableau dashboards produced using the data engineered in the notebooks in this repository, please see my Tableau Public profile: public.tableau.com/profile/edd.webster.
Example Tableau dashboards:
- 2018 FIFA Men's World Cup;
- FA WSL;
- ‘Big 5’ European leagues;
- EFL;
- StrataBet Chance creation; and
- Opta #mcfcanalytics (see #mcfcanalytics).
Good resources for those new for the use of data in football:
- Articles and blog posts:
- What do you need to learn to work in football analytics? by David Sumpter for Barca Innovation Hub;
- Getting Into Scouting by Luke Griffin
- You Want to be a Performance Analyst? by Rob Carroll
- An Introduction to Soccer Analytics by John Muller - check out his Newsletter space space space
- Introduction to Analytics in...Soccer by Valentin Stolbunov
- Getting into Sports Analytics and Getting into Sports Analytics 2.0 by Sam Gregory
- Soccer Analytics 101 by Kevin Minkus (using Web Archive)
- A Career in Football Analytics blog posts by Benoit Pimpaud. Check out his Substack newsletter From An Engineer Sight. See also the accompanying Twitter thread by Jan Van Haaren that discusses these posts [link]
- Football Reference 101 — Finding your way through a gold mine by Ninad Barbadikar
- Mikhail Zhilkin: How to hire your first data scientist by Training Ground Guru;
- Gerard Moore on the "challenging but extremely rewarding" life" of a professional football analyst for Twenty3
- How to get started in data and the football industry by Liam Henshaw
- How to get into football analysis by La Notice
- Getting Started with Football Analytics by OddAlerts
- Want to Learn Football Analytics? by Irfan Alghani Khalid
- How to get a job in Sports Analysis... by Chris Gill
- 7 Easy Steps to Get Started in Football Data & Analytics by Jobs in Football
- 11 tips to get started in the Football industry by Jobs in Football
- A Friendly Introduction to FPL Analytics by Sertalp B. Çay
- GitHub repositories:
- Twitter threads:
- Measureables (Brendan Kent)'s Sports Analytics 101 unrolled Twitter thread [link]:
- Tom Worville's Twitter thread
- Will Spearman's Twitter thread
- Jan Van Haaren's Twitter thread for free, open-source software libraries for computing and visualising advanced soccer analytics metrics
- Measureables (Brendan Kent)'s Twitter thread for resources for learning to code in the context of sports analytics [link]
- Sancho Quinn's unrolled Twitter thread for learning more about video/performance analysis [link]
- Ninad Barbadikar's 'big football analytics' Twitter thread for getting started with football analytics [link]
- McKay Johns's Twitter threads for the best resources in football analytics [link] and [link]
- Joe Gallagher's Twitter thread for the best resources to get started [link]
- Sam Goldberg's Twitter thread for "lessons American Soccer Analysis wish we knew prior to working in sports analytics." [link]
- Floris Goes-Smit's Tweet's:
- Mathew Barlowe's Twitter thread for "how to get into the sports analytics industry" [link]
- Aaron Moniz's Tweet and responses [link]
- LinkedIn Posts:
- WHERE TO LEARN FOOTBALL ANALYTICS? by Irfan Alghani Khalid
- The following LinkedIn post by Hadi Sotudeh
- Videos:
- Friends of Tracking videos:
- How to become a football data scientist with Pascal Bauer, Javier Fernández, Sudarshan 'Suds Gopaladesikan, Fran Peralta, and David Sumpter
- Tools for getting started in football analytics. talk for Friends of Tracking with David Sumpter, Laurie Shaw, Pascal Bauer, Sudarshan 'Suds' Gopaladesikan and Fran Peralta
- What do data analysts and data scientists do at a football club? talk for Friends of Tracking with David Sumpter, Ashwin Raman, Hannah Roberts, Sam Gregory, and Rob Suddaby;
- HANIC Panel "How to get into Sports Analytics & Media + Analytics" with Alison Lukan, Sarah Bailey, Harman Dayal, Asmae Toumi Mike Johnson, Alison Lukan;
- Careers in Sports Analytics
- Chris Gill's Sports Analysis YouTube Channel, including videos for Writing the perfect CV, How to get a job in sports analysis, LinkedIn tips, amoungst other videos added regularly
- Friends of Tracking videos:
- Glossaries:
- The Athletic’s football analytics glossary: explaining xG, PPDA, field tilt and how to use them by Mark Carey and Tom Worville (requires subscription)
- Stat Glossary by Ashwin Raman;
- Football Analytics Glossary by Ashwin Raman and Mark Thompson
- Podcasts:
- Fanalytics podcast with Mike Lewis - Getting Your Foot in the Door with Sean Steffen
- What is sports analytics? episode of the Measureables podcast by Measureables (Brendan Kent)
- Some of the useful resources in Football Analytics by @VenkyReddevil;
All publicly available data sources and datasets relating to football, from Tracking data, Event data, aggregated player performance data, detailed match statistics, injury records and transfer values, and more.
Data sources that have been used in the code and analysis in this repository can be found in the data
subfolder of this repository or in Google Drive (due to GitHub's 100mb file limit) [link]. All code however in this repository should enable you to scrape, parse, and engineer the datasets as per the output used for analysis and visualisations featured..
To learn more about the different types of data available, such as Event and Tracking data, please see the "Where can I get data?" section of Devin Pleuler's soccer_analytics_handbook
[link].
For a quick primer of the free football data resources available, see the following Twitter thread by James Nalton [link].
- StatsBomb Open Data - 360 data for UEFA Men's Euro 2020 and event data for the FA Women's Super League (18/19-20/21), The Lionel Messi Data Biography (04/05-19/20), Arsenal Invincibles Season (03/03), UEFA Men's Champions League (99/00-18/19), FIFA Men's 2018 World Cup, FIFA Women's 2019 World Cup, NWSL 2018;
- StrataData Chance shooting data provided by StrataBet (now defunct);
- Soccer Video and Player Position Dataset - dataset of elite soccer player movements and corresponding videos. See the accompanying paper [link];
- Opta Sports match-by-match aggregated player performance data for the 11/12 season and F24 Event data for a 11/12 match of Manchester City vs. Bolton Wanders [link] as part of the #mcfcanalytics initiative
- Understat shooting and meta data including xG values for the 'Big 5' European leagues and Russian Premier League
- This data can be scraped in the following ways:
- Python:
ScraperFC
by Owen Seymourunderstat
by Amos Bastian (see the following for docs [link])scraping-understat-dataset
by Douglas
- R:
understatr
package orworldfootballR
package by Jason Zivkovic (see guide [link]).
- Python:
- This data also also regularly made available by the following contributers
- Abrar via Kaggle - see [link]
- Sagnik Das, using code created by both him and Mark Wilkins - see [link] for shot data, [link] for meta data, and Mark's Tweet [link])
- This data can be scraped in the following ways:
- Opta Event data (available through WhoScored?) for twenty leagues, including the 'Big 5' European leagues, going back since the 09/10 season. This data can can be scraped using:
-
ScraperFC
library by Owen Seymour - The method in the following blog post by Karol Działowski - Football Data Visualizations - Passing Networks. This blog post on how to create passing networks from first principles, using Opta Event data acquired from WhoScored, with the subsequent data visualised using matplotlib. - Wyscout Event data for the 17/18 season for the 'Big 5' European leagues, Euro 2016 Chanpionship, and 2018 World Cup made available by Luca Pappalardo, Alessio Rossi, and Paolo Cintia. See their paper A public data set of spatio-temporal match events in soccer competitions
- Last Row Tracking-like data by Ricardo Tavares. See the Liverpool Analytics Challenge for which this data was used (winners discussed on Friends of Tracking [link])
- Metrica Sports Sample Tracking and corresponding Event data. For code to work with this data, see the
LaurieOnTracking
GitHub repo by Laurie Shaw and the corresponding Friends of Tracking tutorials - Signality Tracking data. The password to download the data is not publicly available, but can be found in the Uppsala Mathematical Modelling of Football Slack group [link]. For access, contact Novosom Salvador Twitter and rsalvadords@gmail.com, or feel free to contact myself. Note, that the 2nd half of the Hammarby-Örebro match is incomplete
- SkillCorner broadcast Tracking data
- DAVIES estimated player evaluation data by Sam Goldberg and Mike Imburgio for American Soccer Analysis
- FBref season-on-season aggregated player performance data provided by StatsBomb. See my FBref Player Stats Web Scraping notebook for Python code to scrape FBref data or access saved CSV files in data subfolder;
- This data can also be scraped in the following ways:
- Python:
- Data can be read as a DataFrame using panda's
read_html
function. There is a direct link to this table under Share & Export --> Embed this Table (see the following StackOverflow answer for instructions [link]) ScraperFC
by Owen SeymourScrape-FBref-data
by Parth Athale, ich in turn was written using code from Christopher Martin's repository, however, this code hasn't worked since around February 2021.
- Data can be read as a DataFrame using panda's
- R: Jason Zivkovic 's
worldfootballR
package (see guide [link]) - Google Sheets: see Rob Carroll's YouTube tutorial [link].
- Python:
- Every FBref metric for every 2020-21 'Big 5' European league player by Ronan, see [link]], [link] and [Tweet]. A 'tidied' version have been made by goaltergeist, see [link]
- 2,823 players in Europe's top 5 leagues on FBref, with their positions as listed on Transfermarkt by Rahul Iyer, see [link] and [Tweet]
- This data can also be scraped in the following ways:
- Stats Perform and Centre Circle Canadian Premiere League Event data. See Google Drive [link]
- ELO club rankings. See their API [link]
- FiveThirtyEight Club Ranking - Global Club Soccer Rankings. How 637 international club teams compare by Soccer Power Index
- 2018 FIFA World Cup Rosters - goals, caps, club, and date of birth for players on 2018 FIFA World Cup rosters. Source: data.world
- engsoccerdata - English and European soccer results 1871-2017
- FIFA World Cup Match Results - matchups and results of FIFA World Cup matches from 1930 - 2014. Source: data.world
- FotMob - dataset including team and play stats including xG and post-shot xG.
- Football Lineups
international_results
- repository of 42,452 results of international football matches starting from the very first official match in 1972 up to 2019- smarterscout - scouting and player rating information
- SofaScore - live scores, lineups, standings, heatmaps, and basic teams, coaches and player data
- Soccerway - matchsheet data
- Capology - a sports salaries platform. See my Capology Player Salary Web Scraping notebook for Python code to scrape Capology data or access saved CSV files in data subfolder;
- KPMG Football Benchmark player valuation data;
- The Price of Football Master Spreadsheet - data from the finance/business aspect of football by Kieran Maguire;
- spotrac - player contracts, salaries, and transfer information for the Premier League, MLS, and NWSL;
- TransferMarket player bio and fiscal data. See my TransferMarkt Player Bio and Status Web Scraping notebook for Python code to scrape TransferMarkt data or access saved CSV files in data subfolder;
- This data can also be scraped in the following ways:
- Python:
Tyrone Mings
webscraper by FCrSTATS - R:
worldfootballR
package by Jason Zivkovic (see guide [link])
- Python:
- An extract of player data for 2010-2021 for the 'Big 5' European leagues has been made available by John Muller, see [link]
- This data can also be scraped in the following ways:
- Transfers data:
transfers
GitHub repo for European football clubs' player transfers from 1992/93-2020/21 (as per TransferMarkt) by ewenme- Player Transfer Data collated by Tom Worville (see Tweet [link]
- BetExplorer - odds data;
- FiveThirtyEight Soccer Predictions database - football prediction data;
- Football-Data.co.uk - free bets and football betting, historical football results and a betting odds archive, live scores, odds comparison, betting advice and betting articles;
- International football results from 1872 to 2020 - an up-to-date dataset of over 40,000 international football results by Mart Jürisoo;
Also see Mark Wilkin's Twitter thread [link]:
- Football (soccer) pitch tracker by John Burn-Murdoch
- Expected Goals Event Logger by Ben Torvaney
- Chalkboard by Neil Charles
- League-wide xT values from the 2017-18 Premier League season (12x8 grid) by Karun Singh [link]
- EPV grid by Laurie Shaw [link]
- Zones on a pitch for Tableau visualisation by Rob Carroll [link]
- Alphabetic country codes [link]
- Awesome Football: A collection of awesome football (national teams, clubs, match schedules, players, stadiums, etc.) datasets;
- Data Hub Football data;
- European Soccer Database - 25k+ matches, players & teams attributes for European Professional Football
- FIFA 15-22 player rating data scraped from SoFIFA by Stefano Leone;
- FIFA 18 Player Ratings - 17k+ players, 70+ attributes extracted from FIFA 18, provided by sofifa
FootballData
- "A hodgepodge of JSON and CSV Football data"footballcsv
- Historical soccer results in CSV format;- football.db - A free and open public domain football database & schema for use in any (programming) language (e.g. uses plain datasets);
- Football Geek by Dinesh Vatvani (site now on hiatus);
- Football xG;
- Guide to Football/Soccer data and APIs by Joe Kampschmidt;
- My Football Facts;
- Physio Room;
- PlusMinusData - play by play data from espn.com;
- Rec.Sport.Soccer Statistics Foundation - Historical league tables and football results;
- RoboCup Soccer Simulator - RoboCup Soccer Simulator Data;
- Squawka;
- Stat Bunker;
- Tableau data resources including sports data;
- Transfer League;
- Twelve Football; and
- wosostats - Data about women's soccer from around the world.
All documentation saved locally in the documentation subfolder, including:
- DataFactory
- InStat
- K-Sport
- Opta Sports
- smarterscout
- Sportlogiq
- Sport radar
- Stats Peform
- StatsBomb
- StrataBet (now defunct)
- TransferMarket
- understat
- WhoScored? (data provided by Opta Sports data)
- Wyscout
- Catapult
- ChyronHego
- Metrica Sports
- Second Spectrum
- Signality
- SkillCorner
- STATS SportVU
- Kinexon
- Oliver
- dataFootball
- ERIC Sports
- Futbolytics
- hudl
- LBi Dynasty
- LongoMatch
- MEDIACOACH
- nacsport
- Olocip
- SICO
- Wise
- Friends of Tracking YouTube channel [link] and Mathematical Modelling of Football course by Uppsala University [link]. The GitHub repo with all code featured can be found at the following [link]. Lectures of note include:
- Laurie Shaw's Metrica Sports Tracking data series for Friends of Tracking - Introduction, Measuring Physical Performance, Pitch Control modelling, and Valuing Actions. See the following for code [link];
- Lotte Bransen and Jan Van Haaren's 'Valuating Actions in Football' series - Valuing Actions in Football: Introduction, Valuing Actions in Football 1: From Wyscout Data to Rating Players, Valuing Actions in Football 2: Generating Features, Valuing Actions in Football 3: Training Machine Learning Models, and Valuing Actions in Football 4: Analyzing Models and Results. See the following for code [link];
- David Sumpter's Expected Goals webinars - How to Build An Expected Goals Model 1: Data and Model, How to Build An Expected Goals Model 2: Statistical fitting, and The Ultimate Guide to Expected Goals. See the following for code 3xGModel, 4LinearRegression, 5xGModelFit.py, and 6MeasuresOfFit;
- Peter McKeever's 'Good practice in data visualisation' webinar. See the following for code [link];
- Serio Llana's step-by-step guide for creating Passing Networks [link];
- Luca Pappalardo and Paolo Cintia's step-by-step guide to exploring the Wyscout Event data - Video 1 and Video 2. See their paper A public data set of spatio-temporal match events in soccer competitions.
- Soccer Analytics Handbook by Devin Pleuler. See tutorial notebooks (also available in Google Colab): 1. Data Extraction & Transformation, 2. Linear Regression, 3. Logistic Regression, 4. Clustering, 5. Database Population & Querying, 7. Data Visualization, 8. Non-Negative Matrix, 9. Pitch Dominance, 10. Convolutional Neural Networks;
- FC Python tutorials [link];
- DataViz, Python, and matplotlib tutorials by Peter McKeever [link] - his website is currently in redevelopment, with many of the old tutorials not currently available (28/02/2021). Check out his revamped How to Draw a Football Pitch tutorial;
- McKay Johns YouTube channel;
- soccer_analytics GitHub repo by CleKraus - a Python project that facilitates the starting point for analytics
- Python for Fantasy Football series by Fantasy Futopia (Thomas Whelan). This series covers the basics of working with data in Python, working with APIs and parsing StatsBomb JSON data, scraping data using Beautifulsoup and Selenium, and Machine Learning with scikit-learn and XGBoost, See GitHub repo for all code [link];
- Learn how to make visualisations in Python's matplotlib with football (soccer) data by Son of a Corner. If you appreciate their work, consider supporting them by joining their membership program [link]
- Football Data Visualizations - Passing Networks by Karol Działowski - a great blog post on how to create passing networks from first principles, using Opta Event data acquired from WhoScored. This data is then visualised using matplotlib.
- Tech how-to: build your own Expected Goals model by Jan Van Haaren and SciSports. See the Bitbucket repository for all code [link]; and
Football-Analytics-With-Python
by Anmol Durgapal.
- FCrSTATS tutorials [link];
- Mark Wilkins's BiscuitChaserFC blog. See his Twitter thread of R tutorials [link]. Tutorials include: Shot Maps In R With StatsBomb Data, Getting Started with StatsBomb Data in R, Understat Meta/Shot Data, FBref - Pressures;
- Sudarshan 'Suds' Golaladesikan's R series for Friends of Tracking - Getting Started with R + StatsBomb | Analyzing Squad Rotation & Clustering Passes and creating interactive shot maps - Part 1/3), Part 2/3, (I believe no part 3 currently). See the following for code [link]; and
- Creating a pass flow graph in R by Abhishek Mishra.
Check out the Tableau for Sports Discord server organised by Ninad Barbadikar, to interact with a community of Tableau developers
For a YouTube playlist of Tableau-football videos and tutorials that I have collated from various sources including the Tableau Football User Group, Rob Carroll, Tom Goodall, and Ninad Barbadikar, see the following [link].
- Tableau Football User Group - featuring Eva Murray, Oscar Hall, James Smith, Rob Carroll, Tom Goodall, Ravi Mistry, Adam Cook, Hannah Roberts, Chris Baker, Rusty Parker, Ruud van Elk, Johannes Riegger, and Sebastien Coustou;
- Tableau for Sport by Rob Carroll - completely free tutorials for using football data in Tableau, including creating shot maps, pass maps, pass matrxces, xG race-chart timelines. See also his YouTube playlist [link];
- Tom Goodall's Tactics, Training & Tableau: Football Tableau User Group. Check out his Football Tableau training courses [link. Check out also as an unrolled Twitter thread, how he uses Tableau to create an opposition report for Burton vs. Gillingham on 9th January 2021 [link];
- Visually Analysing Direct Set Pieces in Football using StatsBomb Data, R and Tableau by James Smith;
- CJ Mayes's Tableau blog, with posts including how to make a Radial Tournament Bracket, Understat data in Tableau, and Player Metrics Pizza plots;
- Tableau Tunnel series by Ninad Barbadikar. His tutorials include: Welcome to the Tableau Tunnel!, Improving on scatter-plots and building basic bar graphs, Player dashboards, Tableau Tunnel #4 — Making Beeswarm plots, Creating Shot Maps, Pass Maps in Football using FC Python Video Coder for Tableau, and Creating xG Trendlines on Tableau using moving averages. Check out his Twitter thread [link] and his YouTube channel [link];
- Medium blog posts by Sagnik Das - Tableau Guide #1: Making Shot Maps, Tableau Guide #2: Making Pass Maps, Tableau Guide #3: Convex Hulls, Tableau Guide #4 : Football Radars;
- Medium blog posts by Rahul Iyer - Guide to Creating Passing Networks in Tableau , Guide to Creating Pass Sonars in Tableau, Guide to Creating Hexagonal Shot Maps in Tableau;
- A Guide to Player Comparison Bar Graphs (And How I Make Them) by Ashwin Raman
- Creating a Shop Map by James Vaughan;
- How to create Football Pitches/Goals as Backgrounds in Tableau by James Smith;
- Creating Waffle Charts in Tableau by Harsh Krishna;
- Exporting your pass flow map to Tableau by Abhishek Mishra;
- Plotting Starting XIs in Tableau by Marton Balla;
- Tableau Public profiles of note (not exhaustive by any means):
- Ashwin Raman
- Brian Prestidge
- Carlon Carpenter
- CJ Mayes
- Eva Murray
- Foot en Stats
- James Smith
- James Vaughan - see his Twitter thread of projects [link]
- Mark Carey
- Matt Trevillion
- Ninad Barbadikar - see his Tableau Tunnel series
- Oscar Hall
- Paul Riley
- Peter McKeever
- Rahul Iyer
- Ravi Mistry
- Rob Carroll
- Rob Suddaby
- Sathish Prasad V.T - see his Canadian Premier League post-match reports [link]
- Sancho Quinn
- Sushruta Nandy
- Tom Worville
For a YouTube playlist of Power BI-football videos and tutorials that I have collated from various sources including Futbol AnalysR and PowerBI for Sports, see the following [link].
- Futbol AnalysR by Josh Trewin - for PowerBI tutorials. Check out his website [link]
- PowerBI for Sports by Roberto D'Onofrio Rondón
- Maram AlBaharna's Medium post - Yes, Powerpoint: xG Trend Line;
- Luke Griffin's pitch graphics - slides. Drop him a donation of PayPal if you're using his work [link]. See original Tweet [link]; and
- Tony Bambrick's short video describing the process of creating an animated tactics board using PowerPoint. See original Tweet [link].
codeball
- data driven tactical and video analysis of soccer games;Football Packing
- a Python package to calculate packing rate for a given pass in football by Samira Kumar. This is a variation of the metric created by Impect;kloppy
- a Python package providing (de)serializers for soccer tracking- and event data, standardized data models, filters, and transformers designed to make working with different tracking- and event data like a breeze. See the YouTube tutorial [link];matplotsoccer
- a Python library for visualising soccer event data by Tom Decroos;mplsoccer
- a Python library for drawing soccer/football pitches in Matplotlib and loading StatsBomb open-data by Andrew Rowlinson;nayra
- API that allows you track soccer player from camera inputs, and evaluate them with an Expected Discounted Goal (EDG) Agent. See the Evaluating Soccer Player paper by Paul Garnier and Théophane Gregoir;northpitch
- a Python football plotting library that sits on top of Matplotlib by Devin Pleuler;PCA_Player_Finder
by Parth Athale;PySport
includingPySport Soccer
- collection of open-source sport packages including many of those mentioned in this section, by Koen Vossen;PyWaffle
- an open source, MIT-licensed Python package for plotting waffle charts by Peter McKeever;ScraperFC
- a Python package by Owen Seymour to scrape FiveThirtyEight data, aggregated StatsBomb data from FBref, Understat shooting and player meta data including values for xG, xA, xGChain, xGBuildup, player salary data from Capology, and WhoScored? Opta Event provided by StatsPerform;Scrape-FBref-data
- Python library to scrape aggregated StatsBomb data via FBref by Parthe Athale, which in turn was updated from Christopher Martin's repository;statsbombapi
- a Python API wrapper and dataclasses for StatsBomb data;statsbombpy
- a Python library written by Francisco Goitia to access StatsBomb data;statsbomb-parser
- Python library to convert StatsBomb's JSON data into easy-to-use CSV format;socceraction
- a Python library for valuing the individual actions performed by soccer players. Includes an Expected Threat (xT) implementation by Tom Decroos et. al.;soccermix
- a soft clustering technique based on mixture models that decomposes event stream data into a number of prototypical actions of a specific type, location, and direction by Tom Deccoos and ML-KULeuven;soccer_xg
- a Python package for training and analyzing expected goals (xG) models in football;soccerplots
- a Python package that can be used for making visualizations for football analytics by Anmol Durgapal;sync.soccer
- a Python package to synchronise football datasets, so that an event in one dataset is matched to the corresponding event or snapshot in the other by Marek Kwiatkowski. This repository contains an implementation that aligns Opta's (now Stat Perform) F24 feeds to ChyronHego's Tracab files. More formats may be added in the future. See the following blog post for methodology [link];tmscrape
- a Python TransferMarkt webscraper by danzn1;Tyrone Mings
- a Python TransferMarkt webscraper by FCrSTATS;understat
- a Python webscraper by Amos Bastian to scrape Understat shooting and player meta data.
ggsoccer
- a soccer visualisation library in R from Ben Torvaney;ggshakeR
- an analysis and visualisation R package that works with publicly available soccer data by Abhishek Mishra. See the library at the following [link]soccerAnimate
- an R package to create 2D animations of soccer tracking data;soccermatics
- an R package for the visualisation and analysis of soccer tracking and event data by Joe Gallagher;- soccer_ggplots by Ryo Nakagawara;
worldfootballR
- a R package to scrape aggregated StatsBomb data via FBref and valuations and transfer data from TransferMarkt by Jason Zivkovic (see guide on how to use this package [link]); andunderstatr
- a R package to scrape Understat shooting and player meta data.
analytics-handbook
by Devin PleulerExploring spatio-temporal soccer events using public event data
by Luca Pappalardo, Alessio Rossi, and Paolo Cintia. See the paper: A public data set of spatio-temporal match events in soccer competitions;expected_goals_deep_dive
by Andrew Puopolo;Expected Goals Thesis
by Andrew RowlinsonFriends-of-Tracking-Data-FoTD
;footballcsv
- Historical soccer results in CSV format;football-crunching
by Ricardo Tavares. Accompanying Medium posts [link];Google Research Football
;LaurieOnTracking
by Laurie Shaw - Python code for working with Metrica tracking data;Metrica-pitch-control
by Will Thompson - a Python implementation of Javier Fernández and Luke Bornn's Pitch Control model from their paper Wide Open Spaces: A statistical technique for measuring space creation in professional soccer (2018) and Will Spearman's Pitch Control model from his paper Beyond Expected Goals (2018). The respectively Google Colab notebooks are available [link] and [link];Pass-Flow
- create animated flow velocity fields using passing data by Open Goal App;passing-networks-in-python
- repository for building customizable passing networks with matplotlib for Friends of Tracking. The code is prepared to use both eventing (StatsBomb) and tracking data (Metrica Sports);penaltyblog
- a package tbat contains code from http://pena.lt/y/blog for working with footbal data by Martin Eastwood;pitchly
- Python Plotly wrapper for simple football plots by Vinay Warrier;SoccermaticsForPython
- repo by David Sumpter dedicated for people getting started with Python using the concepts derived from the book Soccermaticssoccer_analytics
by CleKraus - a Python project trying to facilitate and being a starting point for analytics projects in soccer including EDA of Event data, goal kick analysis, passing analysis, xG modelling, and an introduction to Tracking data; andtracking_tagger
by @danzn1. See app [[link];understat-db
- a project to scrape data from Understat and store it in a Postgres database by Ben Torvaney.Valuing actions in football
by Lotte Bransen and Jan Van Haaren of SciSports.
- StatsBomb R Guide 2.0 by Euan Dewar
FoundationsInR
by Sudarshan Golaladesikan - getting started with R using the StatsBomb dataset.
- ALPHONSO 2.0 by Sam Goldberg and Mike Imburgio for American Soccer Analysis
- Football Slices by Football Slices (DyslexicDdue]) (now offline)
- Player Finishing Overview by Harsh Krishna, an app that generates a dashboard of visualisations that can be useful in getting an overview of a football player's finishing ability. See the accompanying Twitter thread [link]
- Player Replacement Shortlist Generator by Hugh Klein. See the accompanying Twitter thread [link]
- Pizza Vizz App by Johnny Vizz. See the accompanying blog post [link] and Twitter thread [link] to access the Vizz App (subscription only)
- Statsbomb-Json-Parse by Rob Carroll. A small app that lets you input a StatsBomb JSON file and get a CSV file back (you need to create a free account to run it. For a video explainer, see the following [link]
- Scouting Tool by Renzo Cammi - a scouting tool created with Streamlit from StatsBomb data via FBref, that lets you filter players stats from the 'Big 5' European Leagues
- Soccer Analytics Library by Lars Maurath
- Tracking Tagger by @danzn1. See GitHub repo [[link]
- Twelve Football
- YouTubeCoder Event video tagging by FC Python
Resources to aid data visualisation:
- How to create Football Pitches/Goals as Backgrounds in Tableau by James Smith. Download his pitch and goal templates here
Viz-Templates
by McKay Johns- For club badges for the 'Big 5' European leagues and English leagues, see the
club_badges
subfolder of this GitHub repository. See also the Club crests put together by Ninad Barbadikar that is available for download. - Matplotlib for Football - gallery for Typical Football Plots created using matplotlib by Abhishek Sharma. See his Twitter thread [link] and GitHub repository [link];
- PL 21-22 player images by Karan Popli;
- StatsBomb media pack
- URLs of images of all first team players from the Premier League website by Alfred - see CSV
- Team colour codes, for the HEX, RGB, and HSL colours of top flight football teams; and
- Pitch templates, put together by Tony Bambrick (see tweet [link]);
- [Luke Griffin]'s pitch graphics - slides. Drop him a donation of PayPal if you're using his work [link]. See original Tweet [link];
- Peter McKeever's 'Good practice in data visualisation' webinar for Friends of Tracking. See the following for code [link];
- John Burn-Murdoch's Data visualisation is about words webinar for Friends of Tracking; and
- Peter McKeever's Twitter thread about data viz [link].
Many of these blog posts are recommended in Sam Gregory's Best Football Analytics Pieces piece and Tom Worville's “What’s the best Football Analytics piece you’ve ever read?”, both articles now a few years old. This section is very subjective so if I've missed anything obvious, apologies.
- Assessing The Performance of Premier League Goalscorers by Sam Green;
- Counting Across Borders by Ben Torvaney;
- Is Soccer Wrong About Long Shots? by John Muller;
- Where Goals Come From by Jamon Moore and Carlon Carpenter;
- Defending Your Patch by Thom Lawrence;
- The DePO Models: Bringing Moneyball to Professional Soccer by Sam Goldberg and Mike Imburgio;
- Using Data to Analyse Team Formations by Laurie Shaw;
- Structure in football: putting formations into context by Laurie Shaw;
- Inside Arsenal’s Attack: In-Depth Analysis Of Arteta’s Problems & Possible Solutions by Ashwin Raman;
- Premier League Projections and New Expected Goals by Michael Caley;
- Introducing Passing Combinations by Piotr Wawrzynów;
- Pass Footedness in the Premier League by James Yorke;
- Messi Walks Better Than Most Players Run by Bobby Gardiner;
- Introduction Expected Goals on Target (xGoT) by Jonny Whitmore;
- Tools for tiny teams by Ben Torvaney:
- Anatomy of a Shot by Thom Lawrence;
- Soccer Analytics 101 by Kevin Minkus;
- An Introduction to Soccer Analytics by John Muller;
- Valuing On-the-Ball Actions in Soccer: A Critical Comparison of xT and VAEP by Jesse Davis, Tom Decroos, Pieter Robberechts, Maaike Van Roy;
- Game of Throw-Ins by Eliot McKinley;
- Expected Threat by Karun Singh. Check out also as an unrolled Twitter thread [link] Karun's Twitter thread for the many resources out there around this topic, including: Episode 19 of The Football Fanalytics Podcast, Karun's StatsBomb conference presentation [link] and slides [link], Rob Hickman's StatsBomb conference presentation where he extended xT to take defensive risk into account [link], Last Row View (Ricardo Tavares)'s blog post for evaluating off-the-ball player movements by combining xT and tracking data, and Karun's xT values as a 12x8 grid to download as a JSON file [link];
- Lionel Messi’s ten stages of greatness by Michael Cox and Tom Worville;
- Passing Out at the Back by Will Gürpinar-Morgan;
- The 10 Commandments of Football Analytics by Tom Worville;
- Borussia Dortmund - What's gone wrong? by Colin Trainor for StatsBomb;
- Breaking Down Set Pieces: Picks, Packs, Stacks and More by Euan Dewar;
- Data Based Coaching: How to Incorporate Data-Driven Decision into Your Coaching Workflow by Kieran Doyle
- Coaches Reward Goalscorers. But Should They? by Eliot McKinley and John Muller
- Soccer Analysis Summary at Behind the Net by Hawerchuk
The following list contains those blogs that are still maintained, as well as the original blogs from the OGs of football analytics.
For a Twitter thread of the football analytics blogs from 2009 an earlier, see the following Twitter thread from Tiotal Football [link].
- 11tegen11 by 11tegen (Sander IJtsma);
- 21st Club - blog posts available in hard-copy form in their Changing the Conversation series;
- 2+2=11 by Will Gürpinar-Morgan;
- 5 Added Minutes by Omar Chaudhuri (last updated 03/09/2016);
- 8 Yards 8 Feet by Simon Lock;
- Abel Lorincz by Abel Lorincz;
- Abhishek Amol Mishra's Medium blog - check out his Learning Machines With Me. series;
- Absolute Unit;
- All Things Football;
- Alex Rathke by Alex Rathke;
- American Soccer Analysis;
- Analyse Football by Ravi Ramineni (last updated 06//04/2015);
- The Analyst by Stats Perform;
- Analytics FC. For the blog, see [link];
- Attacking Center-back by JP Quinn;
- Barça Innovation Hub
- Benoit Pimpaud's Medium blog
- BiscuitChaserFC by Mark Wilkins. See his Twitter thread of R tutorials [link];
- Bosemessi GitHub blog by Soumyajit Bose;
- Brendan Kent. Check out his Sports Analytics 101 series;
- Brisink by Jerome
- Carey Analytics by Mark Carey;
- Crab Stats by Scott Willis
- Danny Page's Medium blog
- Dato Fútbol by Dato Fútbol (Ismael Gómez Schmidt)
- davidfombella.github.io by David Fombella
- DeepxG by Thom Lawrence (last updated 29/11/2017);
- Differentgame by Paul Riley;
- DTAI Sports Analytics Lab by KU Leuven, featuring posts from Jesse Davis, Pieter Robberechts, Maaike Van Roy, Lotte Bransen, Jan Van Haaren, Tom Decroos, and more;
- The Economics of Sport;
- EddWebster.com by Edd Webster
- EFL Numbers by EFL Numbers;
- EightyFivePoints by Laurie Shaw;
- Experimental 361 by Ben Mayhew;
- FC Python by FC Python;
- FiveThirtyEight Sports;
- Football Crunching by Ricardo Tavares;
- Football Data Science by Dr. Garry Gelade;
- Football Philosophy by Joost van der Leij;
- Football Science by Michael C. Rumpf
- Football Whispers
- Futbol AnalysR by Josh Trewin
- The Futebolist by Ashwin Raman
- Get Goalside! by Mark Thompson
- The Harvard Sports Analysis Collective
- Liam Henshaw's Medium blog
- Hockey Graphs
- Hudl
- James W Grayson by James W Grayson;
- Jan Van Haaren by Jan Van Haaren;
- jogall.github.io by Joe Gallagher;
- Karun Singh by Karun Singh;
- kubamichalczyk.github.io by Kuba Michalczyk
- kwiatkowski.io by Marek Kwiatkowski;
- The Last Man Analytics by The Last Man Anayltics (Ciaran Grant)
- lufcdata by @LUFCDATA;
- LukeBornn.com by Luke Bornn;
- Mackay Analytics by Nils Mackay;
- Mackinaw Stats by Mackinaw Stats;
- Maram AlBaharna's Medium blog;
- Mark's Notebook (Substack) and Mark's Notebook (Ghost) by Mark Thompson;
- Mixed kNuts by Ted Knutson including post pre-StatsBomb blog;
- MRKT Insights with Tim Keech, Ram Srinivas, Matt Lawrence, Kevin Elphick, and Andy McGregor. Formally Jay Socik;
- Modern Fitba (currently archived);
- Nandy47 GitHub blog by Sagnik Das;
- Ninad Barbadikar Medium blog by Ninad Barbadikar;
- North Yard Analytics by Dan Altman;
- openGoal by Charles William;
- Opta Pro - old blogs removed but can be found using Wayback Machine;
- patricklucey.com by Patrick Lucey;
- Penal.lt/y by Martin Eastwood;
- Piotr Wawrzynów – Football Analysis by Piotr Wawrzynów;
- Phil Birnbaum's Blog by Phil Birnbaum
- The Power of Goals by Mark Taylor;
- Proform AFC by Proform Analytics (Mladen Sormaz and Dan Nichol);
- Ravi Mistry's Medium blog;
- robert-hickman.eu;
- R by R(yo) by Ryo Nakagawara;
- SaddlersStats;
- Sam Gregory's Medium blog;
- SciSports;
- Sergi's Blog by Sergi_Lehkyi;
- The Significant Game by Lars Maurath;
- Soccermatics Medium blog by David Sumpter;
- soccerNurds;
- space space space by John Muller;
- StatDNA (last updated 01/06/2011 before Arsenal bought the company);
- StatsBomb;
- Stats Perform;
- Stats and snakeoil by Ben Torvaney;
- Tiago Estêvão's Medium blog by Tiago Estêvão;
- Tony ElHabr's blog by Tony ElHabr;
- Training Ground Guru. Check out their accompanying podcast [link];
- Tom Worville's Medium blog by Tom Worville (last updated 14/08/2017). Tom now writes for The Athletic [link];
- winningwithanalytics.com by Bill Gerrard;
- Wooly Jumpers for Goal Posts by The Woolster;
- Worville Analysis by Tom Worville;
- Wyscout;
- x+football by Niklas Hemmer
- xG per Shot by Parthe Athale; and
- Zonal Marking. by Michael Cox. Michael now writes for The Athletic [link].
Many of the papers included in this list have been included after reading Jan Van Haaren's Soccer Analytics 2021 Review and Soccer Analytics 2020 Review. Props to him for reading a paper a week and making his thoughts publicly available!
The papers included in this list have been
The following Shiny App from Lars Maurath is a great tool for looking up publications [link].
- 6MapNet: Representing Soccer Players from Tracking Data a Triplet Network (2021) by Hyunsung Kim, Jihun Kim, Dongwook Chung, Jonghyun Lee, Jinsung Yoon and Sang-Ki Ko;
- A Bayesian Approach to In-Game Win Probability in Soccer (2021) by Pieter Robberechts, Jan Van Haaren, and Jesse Davis. See the accompanying blog [link];
- A Career in Football: What Is Behind an Outstanding Market Value? (2021) by Balázs Ács and László Toka;
- A Copula-Based Hidden Markov Model for Classification of Tactics in Football (2021) by Marius Oetting. See accompanying NESSIS talk [link];
- A Framework for the Fine-Grained Evaluation of the Instantaneous Expected Value of Soccer Possessions (2021) by Javier Fernández, Luke Bornn and Daniel Cervone;
- A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer) (2021) by Gabriel Anzer and Pascal Bauer;
- A Poisson Betting Model with a Kelly Criterion Element for European Soccer (2021) by Kushal Shah, James Hyman and Dominic Samangy;
- A Risk-Reward Assessment of Passing Decisions: Comparison Between Positional Roles Using Tracking Data from Professional Men’s Soccer (2021) by Floris Goes, Edgar Schwarz, Marije Elferink-Gemser, Koen Lemmink and Michel Brink;
- Analyzing Learned Markov Decision Processes using Model Checking for Providing Tactical Advice in Professional Soccer (2021) by Maaike Van Roy, Wen-Chi Yang, Luc De Raedt and Jesse Davis;
- Anatomy of Receiving and Turning with the Ball (2021) by Soumyajit Bose and Manas Saraswat;
- Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of Their Representations for Action Spotting (2021) by Anthony Cioppa, Adrien Deliege, Floriane Magera, Silvio Giancola, Olivier Barnich, Bernard Ghanem and Marc Van Droogenbroeck;
- Combining Machine Learning and Human Experts to Predict Match Outcomes in Football: A Baseline Model (2021) by Ryan Beal, Stuart Middleton, Timothy Norman, Sarvapali Ramchurn;
- Data-Driven Detection of Counterpressing in Professional Football (2021) by Pascal Bauer and Gabriel Anzer;
- Determining the Phases of Play Using Graph Neural Network Embeddings (2021) by Juan Camilo Campos;
- Evaluating Soccer Player: from Live Camera to Deep Reinforcement Learning (2021) by Paul Garnier and Théophane Gregoir. See the
nayra
library for code. - Extended Model for Expected Threat in Soccer by Jirka Poropudas;
- From Motor Control to Team Play in Simulated Humanoid Football (2021) by Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, Ali Eslami, Daniel Hennes, Wojciech Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan Tracey, Karl Tuyls, Thore Graepel and Nicolas Heess;
- How Soccer Scouts Identify Talented Players (2021) by Tom Bergkamp, Wouter Frencken, Susan Niessen, Rob Meijer and Ruud den Hartigh;
- Identifying and Evaluating the Efficiency of Each Player During the Pressing Phase Against an Opponent’s Controlled Build-Up Play (2021) by Caterina De Bacco;
- Inferring the Strategy of Offensive and Defensive Play in Soccer with Inverse Reinforcement Learning (2021) by Pegah Rahimian and László Toka;
- Learning Football Body-Orientation as a Matter of Classification (2021) by Adrià Arbués-Sangüesa, Adrián Martín, Paulino Granero, Coloma Ballester and Gloria Haro;
- Leaving Goals on the Pitch: Evaluating Decision Making in Soccer (2021) by Maaike Van Roy, Pieter Robberechts, Wen-Chi Yang, Luc De Raedt, and Jesse Davis. See the accompanying blog post [link] and research poster [link];
- Making Offensive Play Predictable - Using a Graph Convolutional Network to Understand Defensive Performance in Soccer (2021) by Paul Power, Michael Stöckl, and Thomas Seidel for Opta Pro Forum 2021. See the accomanpying talk on Vimeo [link];
- Measuring the Effectiveness of Pressing in Soccer by Simon Merckx, Pieter Robberechts, Yannick Euvrard and Jesse Davis;
- Modelling Team Performance in Soccer Using Tactical Features Derived from Position Tracking Data (2021) by Floris Goes, Matthias Kempe, Jan van Norel and Koen Lemmink;
- Optimally Disrupting Opponent Build-Ups (2021) by Maaike Van Roy, Pieter Robberechts and Jesse Davis;
- Optimising Long-Term Outcomes using Real-World Fluent Objectives: An Application to Football (2021) by Ryan Beal, Georgios Chalkiadakis, Timothy Norman and Sarvapali Ramchurn;
- Potential Penetrative Pass (P3) (2021) by Hadi Sotudeh;
- Predicting Player Transfers in the Small World of Football (2021) by Roland Kovács and László Toka;
- Similarity of Football Players Using Passing Sequences (2021) by Alberto Barbosa, Pedro Ribeiro and Inês Dutra;
- SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos (2021) by Adrien Deliege, Anthony Cioppa, Silvio Giancola, Meisam Seikavandi, Jacob Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas Moeslund and Marc Van Droogenbroeck;
- Temporally-Aware Feature Pooling for Action Spotting in Soccer Broadcasts (2021) by Silvio Giancola and Bernard Ghanem;
- The Interpretable Representation of Football Player Roles Based on Passing/Receiving Patterns by Arsalan Sattari, Ulf Johansson, Erik Wilderoth, Jasmin Jakupovic and Peter Larsson-Green;
- The Origins of Goals in the German Bundesliga (2021) by Pascal Bauer, Gabriel Anzer and Ulf Brefeld;
- The Quest for the Right Pass: Quantifying Players’ Decision Making (2021) by Borja Burriel and Javier Buldú;
- What Happened Next? Using Deep Learning to Value Defensive Actions in Football Event-Data (2021) by Charbel Merhej, Ryan Beal, Sarvapali Ramchurn and Tim Matthews;
- “Why Would I Trust Your Numbers?” On the Explainability of Expected Values in Soccer (2021) by Jan Van Haaren; and
- Women's football analyzed: interpretable expected goals models for women (2021) by Lotte Bransen and Jesse Davis.
- Automatic Pass Annotation from Soccer Video Streams based on Object Detection and LSTM (2020) by Danilo Sorano, Fabio Carrara, Paolo Cintia, Fabrizio Falchi and Luca Pappalardo;
- A Framework for the Fine-Grained Evaluation of the Instantaneous Expected Value of Soccer Possessions (2020) by Javier Fernández, Luke Bornn and Daniel Cervone;
- A new look into Off-ball Scoring Opportunity: taking into account the continuous nature of the game (2020) by Hugo M. R. Rios-Neto, Wagner Meira Jr., Pedro O. S. Vaz-de-Melo;
- Cracking the Black Box: Distilling Deep Sports Analytics (2020) by Xiangyu Sun, Jack Davis, Oliver Schulte and Guiliang Liu;
- Deep Soccer Analytics: Learning an Action-Value Function for Evaluating Soccer Players (2020) by Guiliang Liu, Yudong Luo, Oliver Schulte and Tarak Kharrat;
- Game Plan: What AI can do for Football, and What Football can do for AI (2020) by Karl Tuyls, Shayegan Omidshafiei, Paul Muller, Zhe Wang, Jerome Connor, Daniel Hennes, Ian Graham, Will Spearman, Tim Waskett, and Dafydd Steele, Pauline Luc, Adria Recasens, Alexandre Galashov, Gregory Thornton, Romuald Elie, Pablo Sprechmann, Pol Moreno, Kris Cao, Marta Garnelo, Praneet Dutta, Michal Valko, Nicolas Heess, Alex Bridgland, Julien P´erolat, Bart De Vylder, Ali Eslami, Mark Rowland, Andrew Jaegle, Remi Munos, Trevor Back, Razia Ahamed, Simon Bouton, Nathalie Beauguerlange, Jackson Broshear, Thore Graepel, and Demis Hassabis;
- Google Research Football: A Novel Reinforcement Learning Environment (2020) by Karol Kurach, Anton Raichuk, Piotr Stańczyk, Michał Zając, Olivier Bachem, Lasse Espeholt, Carlos Riquelme, Damien Vincent, Marcin Michalski, Olivier Bousquet, Sylvain Gelly. See the GitHub repo [link];
- Group Activity Detection From Trajectory and Video Data in Soccer (2020) by Ryan Sanford, Siavash Gorji, Luiz Hafemann, Bahareh Pourbabaee and Mehrsan Javan;
- Interpretable Prediction of Goals in Soccer (2020) by Tom Decroos and Jesse Davis;
- Inverse Reinforcement Learning for Team Sports: Valuing Actions and Players (2020) by Yudong Luo, Oliver Schulte and Pascal Poupart. See the code [link];
- Learning the Value of Teamwork to Form Efficient Teams (2020) by Ryan Beal, Narayan Changder, Timothy Norman, Sarvapali Ramchurn;
- Player Chemistry: Striving for a Perfectly Balanced Soccer Team (2020) by Lotte Bransen. See the accompanying Friends of Tracking video tutorials [link] and chapter 4 of the Barca Innovation Hub Football Analytics 2021 publication, titled: 'How does context affect player performance in football?' by Lotte Bransen, Pieter Robberechts, Jesse Davis, Tom Decroos, and Jan Van Haaren [link];
- Ready Player Run: Off-ball run identification and classification (2020) by Sam Gregory;
- The Right Place at the Right Time: Advanced Off-Ball Metrics for Exploiting an Opponent’s Spatial Weakenesses in Soccer (2020) by Sergio Llana, Pau Madrero and Javier Fernández;
- Optimising Game Tactics for Football (2020) by Ryan Beal, Georgios Chalkiadakis, Timothy Norman and Sarvapali Ramchurn;
- Routine Inspection: A Playbook for Corner Kicks (2020) by Laurie Shaw and Sudarshan 'Suds' Gopaladesikan. Accompanying talk - 2020 Harvard Sports Analytics Lab];
- Seeing in to the future: using self-propelled particle models to aid player decision-making in soccer (2020) by Fran Peralta, Pablo Piñones Arce, David Sumpter and Javier Fernández;
- SoccerMap: A Deep Learning Architecture for Visually-Interpretable Analysis in Soccer (2020) by Javier Fernández and Luke Bornn;
- SoccerMix: Representing Soccer Actions with Mixture Models (2020) by Tom Decroos, Maaike Van Roy and Jesse Davis;
- Soccer Analytics Meets Artificial Intelligence: Learning Value and Style from Soccer Event Stream Data (2020) by Tom Decroos
- The Tactics of Successful Attacks in Professional Association Football: Large-Scale Spatiotemporal Analysis of Dynamic Subgroups Using Position Tracking Data (2020) by Floris Goes, Michel Brink, Marije Elferink-Gemser, Matthias Kempe and Koen Lemmink
- Using Player’s Body-Orientation to Model Pass Feasibility in Soccer (2020) by Adrià Arbués-Sangüesa, Adrián Martín, Javier Fernández, Coloma Ballester and Gloria Haro;
- Valuing On-the-Ball Actions in Soccer: A Critical Comparison of xT and VAEP (2020) by Maaike Van Roy, Pieter Robberechts, Tom Decroos and Jesse Davis;
- Actions Speak Louder Than Goals: Valuing Player Actions in Soccer (2019) by Tom Decroos, Lotte Bransen, Jan Van Haaren, and Jesse Davis. See accompany presentation at SIGKDD 2019 by Tom Decroos [link];
- Decomposing the Immeasurable Sport: A deep learning expected possession value framework for soccer (2019) by Javier Fernández, Bornn, and Dan Cervone. Accompanying talks - SSAC19, StatsBomb conference;
- Dynamic Analysis of Team Strategy in Professional Football (2019) by Laurie Shaw and Mark Glickman. Accompanying talks - NESSIS 2019, 2020 Google Sports Analytics Meetup;
- Invalid Interpretation of Passing Sequence Data to Assess Team Performance in Football Repairing the Tarnished Legacy of Charles Reep (2019) by Richard Pollard
- Measuring soccer players’ contributions to chance creation by valuing their passes (2019) by Lotte Bransen, Jan Van Haaren, and Michel van de Velden.
- Modelling the Collective Movement of Football Players (2019) by Fran Peralta; and
- Player Vectors: Characterizing Soccer Players’ Playing Style from Match Event Streams (2019) by Tom Decroos and Jesse Davis.
- Beyond Expected Goals (2018) by Will Spearman;
- Chance involvement in goal scoring in football (2018) by Martin Lames
- Predicting football results using machine learning techniques (2018) by Corentin Herbinet
- Replaying the NBA (2018) by Luke Bornn
- Wide Open Spaces: A statistical technique for measuring space creation in professional soccer (2018) by Javier Fernandez and Luke Bornn;
- Spatial analysis of shots in MLS: A model for expected goals and fractal dimensionality (2018) by Alexandera Fairchild, Konstantinos Pelechrinis, Mariosa Kokkodis; and
- High-resolution shot capture reveals systematic biases and an improved method for shooter evaluation (2018) by Rachel Marty.
- Beyond crowd judgments: Data-driven estimation of market value in association football by Oliver Müller, Alexander Simons, and Markus Weinmann
- Data-Driven Ghosting using Deep Imitation Learning (2017) by Hoang M. Le, Peter Carr, Yisong Yue, and Patrick Lucey
- “The Leicester City Fairytale?”: Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons (2017) by Hector Ruiz, Paul Power, Xinyu Wei, and Patrick Lucey
- Physics-Based Modeling of Pass Probabilities in Soccer (2017) by Will Spearman, Austin Basye, Greg Dick, Ryan Hotovy, and Paul Pop
- Valuing passes in football using ball event data (2017) by Lotte Bransen;
- Not all passes are created equal: objectively measuring the risk and reward of passes in soccer from tracking data (2017) by Paul Power, Hector Ruiz, Xinyu Wei, and Patrick Lucey. See Paul Power's talk [link] (downloadable MP4), and the webpage [link];
- Plus-Minus Player Ratings for Soccer (2017) by Tarak Kharrat, Javier Pena, and Ian McHale
- An examination of expected goals and shot efficiency in soccer (2017) by Alex Rathke; and
- Predicting goal probabilities for possessions in football (2017) by Nils Mackay.
- Spatio-Temporal Analysis of Team Sports – A Survey (2016) by Joachim Gudmundsson and Michael Horton;
- Valuing Individual Player Involvements in Norwegian Association Football (2016) by Olav Nørstebø, Vegard Rødseth Bjertnes, and Eirik Vabo; and
- Expected Goals in Soccer (2016) by Harm Eggels.
- “Quality vs Quantity”: Improved Shot Prediction in Soccer using Strategic Features from Spatiotemporal Data (2015) by Patrick Lucey, Alina Bialkowski, Mathew Monfort, Peter Carr, and Iain Matthews;
- Quantifying Shot Quality in the NBA by ; and
- Soccer video and player position dataset (2015) by S. A. Pettersen, D. Johansen, H. Johansen, V. Berg-Johansen, V. R. Gaddam, A. Mortensen, R. Langseth, C. Griwodz, H. K. Stensland, and P. Halvorsen. See the accompanying webpage [link].
- Large-Scale Analysis of Soccer Matches using Spatiotemporal Tracking Data (2014) by Alina Bialkowski, Patrick Lucey, Peter Carr, Yisong Yue, Sridha Sridharan, and Iain Matthews.
- A Framework for Tactical Analysis and Individual Offensive Production Assessment in Soccer Using Markov Chains (2011) by Sarah Rudd. Accompanying NESSIS talk on Metacafe [link]; and
- An Extension of the Pythagorean Expectation for Association Football (2011) by Howard Hamilton.
- Charles Reep (1904-2002) pioneer of notational and performance analysis in football (2002) by Richard Pollard.
- Modelling Association Football Scores and Inefficiencies in the Football Betting Market (1997) by Mark Dixon and Stuart Coles.
- Skill and Chance in Ball Games (1971) by Charles Reep, Bernard Benjamin, and Richard Pollard.
- 21st Club
- Absolute Unit by Tiotal Football
- Analytics FC
- BiscuitchaserFC by Mark Wilkins
- The Chatalytics Newsletter by The Chatalytics Podcast
- Get Goalside! by Mark Thompson
- geom_mark
- GriffinFtbl by Luke Griffin
- Grace on Football by Grace Robertson
- From An Engineer Sight by Benoit Pimpaud
- KPMG Football Benchmark Newsletter - go to the home page and click on 'Registration' in the top-right corner
- Looks Good on Paper by Felix Pate
- Measureables by Brendan Kent
- No Grass in the Clouds by Ryan O'Hanlon
- Soccer Analytics Newsletter
- space space space by John Muller (this newsletter has now finished but catch John's work as a Senior Writer for The Athletic [link]
- Stats Perform
- Professional footballers threaten data firms with GDPR legal action (12/10/2021) for BBC News by Nick Hartley
- Liverpool director of research hints at seven reasons for quiet summer transfer window (11/10/2021) for Liverpool Echo by Josh Williams
- 'We can make some valuable signings' - Liverpool director of research explains how transfer strategy really works (11/10/2021) for Liverpool Echo by Paul Gorst
- Introducing Manchester United's big new signing: a mathematician (08/10/2021) for The Telegraph by James Ducker
- England vs Germany will be settled by spreadsheets (29/06/2021) for Wired by Amit Katwala;
- Now DeepMind is using AI to transform football (06/05/2021) for Wired by Andrew Powell;
- Kevin De Bruyne uses data analysts to broker £83m Man City contract without agent (08/04/2021) by David McDonnell for The Mirror;
- La extraña renovación de De Bruyne: sin agente y usando el 'big data' para calcular su salario (07/04/2021) for Marca;
- From scouting players on sidelines to sofas – Meet the WyScout generation transforming football analytics (07/04/2021) by Pete Hall for iNews;
- Meet Ram Srinivas, The Biggest Wes Hoolahan Fanatic In India (27/03/2021) by Fiachra Gallagher for Balls.ie;
- Soccer-From blogging to the dressing room - the rise of the new analysts (25/03/2021) by Simon Evans for Reuters
- Premier League club Manchester City hire astrophysicists (24/03/2021) by Alfredo Relaño for AS;
- Manchester City will have astrophysicists in their ranks in Marca;
- It IS rocket science! Manchester City hire astrophysicists to their data analysis team in bid to move Premier League leaders further ahead of their rivals by Jack Gaughan (22/03/2021) for The Daily Mail;
- Liverpool sign up for StatsBomb 360: Ted Knutson explains why this stats revolution will change the game (18/03/2021) by Adam Bate for Sky Sports News;
- Data experts are becoming football's best signings (05/03/2021) by Justin Harper for BBC News;
- How a Celtic blogger nurtured by Brendan Rodgers is now lifting Leicester City (27/02/2021) by Tom Roddy for The Times;
- 17-Year-Old Man Lands Dream Job Of Getting Paid To Watch Football All Day by Adnan Riaz for Sport Bible;
- Aged 17 and getting paid to watch football all day (04/02/2021) by Manish Pandey for BBC News;
- Man City’s Big Winter Signing Is a Former Hedge Fund Brain (31/01/2021) by David Dellier and Adam Blenford for Bloomberg;
- How data is pushing Twitter scouts and bloggers into football's big time (27/02/2021) by Paul MacInnes for The Guardian;
- Revealed: expected goals being used in football's war against match-fixing (13/02/2021) by Sean Ingle for The Guardian;
- 'What we do isn't rocket science': how Midtjylland started football's data revolution (25/10/2020) by Sean Ingle for The Guardian;
- How a teenager from Bangalore became a performance analyst for Dundee United (23/12/2020) by Tim Wigmore for The Telegraph;
- How the volunteers of data website Transfermarkt became influential players at European top football clubs (18/12/2020) by Pepihn Keppel and Tom Claessens;
- Colin Trainor: from bigging up Klopp to the little details of the GAA (17/10/2020) by Kenny Archer for The Irish Times;
- REVEALED: The data scientist, astrophysicist, chess champion, and doctor in theoretical physics who are behind Liverpool’s title-winning success… they may look a 'little nerdy' but this Fab Four prove it is rocket science! (27/06/2020) by Rob Draper and Adam Shafiq for The Daily Mail;
- How analysts have used lockdown to unearth football’s next hidden gems (17/07/2020) by Dan Clark in The Times;
- Behind the Badge: The physicist who leads Liverpool's data department (15/06/2020) by Sam Williams for LiverpoolFC.com;
- How Soccer Scouting Has Changed, And Why It’s Never Going Back (15/05/2020) by Robert Kidd for Forbes;
- ‘Expected threat’, ‘width per sequence’ – the statistical metrics you haven’t heard of (13/02/2020) by Dan Clark for The Times;
- How Brentford flipped the script and staged a data revolution to become England’s smartest club (24/01/2020) by Sean Ingle for Talksport;
- 'It's the boffins what won it!': Data experts plus Jurgen Klopp's charisma turn Liverpool into the kings of Europe (02/06/2019) by Joe Bernstein for The Mail on Sunday;
- How Data (and Some Breathtaking Soccer) Brought Liverpool to the Cusp of Glory (22/05/2019) by Bruce Schoenfeld for The New York Times;
- Brexit Could Drastically Change English Soccer (11/12/2018) by Laurie Shaw for FiveThirtyEight;
- Soccer's Moneyball Moment: How Enhanced Analytics Are Changing The Game (19/11/2018) by Robert Kidd for Forbes;
- 2018 World Cup: Prediction Time; Up Against The Machine (13/06/2018) by Bobby McMahon for Forbes;
- Home advantage, unconscious bias and the boisterous crowds who influence referees (23/04/2018) by Tim Wigmore for iNews;
- The Premier League is losing its competitive balance – that should be cause for concern (02/02/2018) by Tim Wigmore for iNews;
- Expected goals and Big Football Data: the statistics revolution that is here to stay (03/03/2017) by Paul MacInnes in The Guardian;
- How computer analysts took over at Britain's top football clubs (09/03/2014) by Tim Lewis for The Observer;
- How data analysis helps football clubs make better signings (01/11/2018) by John Burn-Murdoch for The FT;
- What does 'Expected Goals' mean? Welcome to the new Opta stat you will be hearing a lot about this season (12/08/2017) by Mirror Football for The Irish Mirror;
- A football revolution (17/07/2011) in The FT [pay wall]; and
- A working life: The quantitative analyst (11/06/2011) by Graham Snowdon for The Guardian.
See the Sports Analytics Reading List by Measureables (Brendan Kent), as part of his Sports Analytics 101 series
The following use Amazon UK links where available.
- Moneyball: The Art of Winning an Unfair Game by Michael Lewis;
- The Numbers Game by Chris Anderson and David Sally;
- Football Hackers by Christoph Biermann;
- Soccermatics by David Sumpter;
- Soccernomics by Simon Kuper and Stefan Szymanski;
- Money and Football: A Soccernomics Guide by Simon Kuper and Stefan Szymanski;
- Mathletics: How Gamblers, Managers, and Sports Enthusiasts Use Mathematics in Baseball, Basketball, and Football by Wayne Winston;
- Data Analytics in Football by Daniel Memmert and Dominik Raabe;
- Changing the Conversation series by 21st Club;
- Expected Goals: The story of how data conquered football and changed the game forever by Rory Smith
- Football Decoded: Using Match Analysis & Context to Interpret the Demands by Paul Bradley;
- Sports Analytics: A Guide for Coaches, Managers, and Other Decision Makers by Ben Alamar;
- Outside the Box by Duncan Alexander;
- Opta World Football Infographics: The Beautiful Game in Brilliant Detail by Adrian Besley;
- Zonal Marking: The Making of Modern European Football by Michael Cox;
- The Mixer: The Story of Premier League Tactics, from Route One to False Nines by Michael Cox;
- The Price of Football by Kieran Maguire. Check out The Price of Football Podcast with Kieran Maguire and Kevin Hunter Day;
- Inverting the Pyramid by Jonathan Wilson;
- Sprawlball: A Visual Tour of the New Era of the NBA by Kirk Goldsberry; and
- Numbers Don't Lie: New Adventures in Counting and What Counts in Basketball Analytics by Yago Colás.
The following is a series of playlists that that I have collated originally for my own personal viewing but they may be useful to you:
- All Sports Analytics - a huge playlist of around 800 video that includes anything at all to do with Sports Analytics and Data Science. Any video found related to the topic is here
- Football-specific Data Science lectures and seminars - presentations and seminars from conferences including StatsBomb, Opta, Sloan, and more. For links to recently unlisted Stats Perform (Opta) talks, see Ben Torvaney's Gist list [link]
- Football-specific Tableau tutorials
- Football-specific Power BI tutorials
- Football-specific Machine Learning
- Football-specific Data Viz
- Tracking data - all videos related to the topic of Tracking data include presentations and tutorials
- Expected Goals - all videos on the topic of Expected Goals
Playlists created by others
- Friends of Tracking Playlists:
- The Analytics (formally Opta) Playlists
- McKay Johns Playlists:
- StatsBomb Playlists
- UTSPAN Seminar Series 2020 by UTSPAN
- 2020 Google Sports Analytics Meetup by Alok Pattani for Google Sports Analytics
- Carnegie Mellon Sports Analytics Conference
- Great Lakes Analytics in Sports Conferences:
- Shorts Videos on Soccer Analytics by Dan Altman
- 42 Analytics – for SSAC conferences
- Barça Innovation Hub (English and Spanish)
- Big Data Sports by David Fombella
- The Coaches’ Voice
- CMU Statistics
- Friends of Tracking with David Sumpter, Dr. Catherine Pfaff, Javier Fernández, Laurie Shaw, Sudarshan 'Suds' Gopaladesikan, Pascal Bauer, and Fran Peralta
- Football Player Ratings by Lars Magnus Hvattum
- Football Whispers
- Futbol AnalysR by Josh Trewin - for PowerBI tutorials
- Mark Glickman – for NESSIS talks, uploaded to his personal channel. Old talks are available on his Metacafe channel. See the official website [link]
- McKay Johns's YouTube channel - for Python and Data Science tutorials
- Ninad Barbadikar's YouTube channel - for Tableau tutorials
- Opta - including Opta Pro Forum talks
- Planeta Data Fútbol (en español) con Jesús Lagos y Miguel Ángel García
- SciSports
- StatsBomb - including StatsBomb Conference talks
- STATS Insights
- Tifo Football
- Carlon Carpenter's Football Analytics repository of videos in Google Drive, featuring: Individual Concepts, General Football Tactics, specific clips for Men's football and Women's football, Tactical Camera Footage, and Training Sessions.
- Carlon Carpenter's Performance Analysis] repository of videos in Google Drive, featuring: Analysis Sample Reports and Writing and Sportscode Materials.
- Coaching Video Content Google Drive repository by Michael Loftman
- Laurie Shaw's Metrica Sports Tracking data series for Friends of Tracking - Introduction, Measuring Physical Performance, Pitch Control modelling, and Valuing Actions. See the following for code [link];
- Lotte Bransen and Jan Van Haaren's 'Valuating Actions in Football' series for Friends of Tracking - Valuing Actions in Football: Introduction, Valuing Actions in Football 1: From Wyscout Data to Rating Players, Valuing Actions in Football 2: Generating Features, Valuing Actions in Football 3: Training Machine Learning Models, and Valuing Actions in Football 4: Analyzing Models and Results. See the following for code [link];
- David Sumpter's Expected Goals webinars for #FoT - How to Build An Expected Goals Model 1: Data and Model, How to Build An Expected Goals Model 2: Statistical fitting, and The Ultimate Guide to Expected Goals. See the following for code 3xGModel, 4LinearRegression, 5xGModelFit.py, and 6MeasuresOfFit;
- Peter McKeever's 'Good practice in data visualisation' webinar for Friends of Tracking. See the following for code [link];
- StatsPerform AI in Sport series - Overview, AI in Basketball, AI In Soccer, and AI in Tennis;
- Making Offensive Play Predictable by Paul Power, Michael Stöckl, and Thomas Seidel for Opta Pro Forum 2021;
- Google Research Football by Piotr Stanczyk;
- Will Spearman's masterclass in Pitch Control for Friends of Tracking;
- How Tracking Data is Used in Football and What are the Future Challenges with Javier Fernández, Sudarshan 'Suds' Gopaladesikan, Laurie Shaw, Will Spearman and David Sumpter for Friends of Tracking;
- Why Do Clubs Need to Embrace Analytics to Stay Competitive? with Vosse de Boode, David Sumpter, Adrien Tarascon and Javier Fernández for Barca Innovation Hub;
- Valuing Actions in Football: Introduction with Lotte Bransen and Jan Van Haaren for Friends of Tracking;
- Routine Inspection: Measuring Playbooks for Corner Kicks by Laurie Shaw and Sudarshan 'Suds' Gopaladsikan;
- A Physics Based Measurement of Defensive Contributions (2021) by Aditya Kothari;
- Enriching Event Data: A Semi-Supervised Augmentation Approach Using Location Information by Debangan Dey, Rahul Ghosal and Atanu Mitra;
- Estimating the Change in Soccer… Home Advantage During the COVID-19 Pandemic by Luke Benz and Mike Lopez;
- Identifying and Evaluating Strategies for Successfully Penetrating a High Opposition Press from Short Goal Kicks, Played Inside the Box, to Move the Ball into the Opposition Half by Vignesh Jayanth;
- Pace and Power: Removing Unconscious Bias from Soccer Broadcasts by Sam Gregory;
- Player Masks: Encoding Soccer Decision-Making Tendencies by Devin Pleuler;
- Predictive Value of Off-Target Shots in Soccer by Ethan Baron;
- Quantifying League-Independent Scoring Ability in Soccer by Daniel Daly-Grafstein. New England Symposium on Statistics in Sports. October 2021.
- The Statistics of Spin in Soccer by Jackson Weaver. New England Symposium on Statistics in Sports by October 2021.
- Volatility and Calculation of Risk-Adjusted Return in Football Scouting by Ola Lidmark Eriksson;
- Tactical Insight Through Team Personas by David Perdomo Meza and Daniel Girela. See accompanying blog post [link];
- Training Ground Guru webinairs
- Christmas Lectures 2019: How to Get Lucky with Hannah Fry. Small segment with Tim Waskett @ 27mins;
- I’m in a Wide Open Space: Creating Opportunities at Set Pieces by Dan Barnett;
- Long or Short? How the New Short Goal Kick Rule Is Impacting Football by Tom Worville;
- Identifying and Evaluating Strategies to Break down a Low Block Defence by Vignesh Jayanth. See accompanying blog post [link];
- Seeing in to the Future: Modelling Football Player Movements by David Sumpter;
- Learning Value and Style from Soccer Event Stream Data by Tom Decroo;
- Marcelo Bielsa's infamous 'Spygate PowerPoint presentation of Derby County [link];
- Tom Goodall's Tactics, Training & Tableau: Football Tableau User Group. Check out his Football Tableau training courses [link;
- Data Robot Opening Remarks & Keynote: Making Better Decisions, Faster with Brian Prestidge;
- A Framework for Tactical Analysis and Individual Offensive Production Assessment in Soccer Using Markov Chains by Sarah Rudd. Accompanying slides [link];
- Demystifying Tracking data Sportlogiq webinar by Sam Gregory and Devin Pleuler;
- Data Analytics in Soccer by Dan Fradley;
- How Hammarby create the mathematically perfect pressing game by David Sumpter
- Hudl Presents: Performance Analysis in 2020
- Self-Supervised Representations for Tracking Data by Karun Singh;
- An American Analyst in London at SSAC 2019 with StatsBomb CEO Ted Knutson and Houston Rockets GM Daryl Morey;
- Beyond the Baseline by Marek Kwiatkowski;
- Some Things Aren't Shots by Thom Lawrence;
- Beyond Save Percentage by Derrick Yam
- Expected goals demonstration by Sander Ijtsma
- Goals change games by Garry Gelade
- Expected goals by Dan Altman
- What Football Analytics can Teach Successful Organisation by Rasmus Ankersen;
- Soccermatics: how maths explains football by David Sumpter
- Changing the soccer transfer market with big data by Giels Brouwer
- The Numbers Game: How Data Is Changing Football - FourFourTwo Documentary;
- How Stats Won Football: From Moneyball to FC Midtjylland – COPA90 Stories Documentary;
- Footballia - historical matches and highlights
Below I've tried to include both the Sports/Football Analytics and then notable episodes of all podcasts that have analytical content/interviews. Spotify and YouTube links used where available. All episodes mentioned below that are available on Spotify can be found in the following playlist (updated periodically): [link].
- All Stats Aren't We with Jon Mackenzie and Josh Hobbs (Leeds United Podcast)
- American Soccer Analysis;
- Analytics FC Podcast - originally with Tom Worville and Sam Gregory, next with Jon MacKenzie, and now with Alex Stewart;
- Big Data Sports (en español) con Marcelo Gantman y Agustin Mario Gimenez;
- Chatalytics Podcast by The Chatalytics Podcast
- Corridor of Uncertainty FPL Podcast by Simon and Jamie;
- The Dan & Omar Show with Daniel Geey and Omar Chaudhuri
- Double Pivot Podcast;
- Differentgame - The Football Analytics Podcast by Paul Riley and Richard Shephard;
- Expected Value;
- Fanalytics with Mike Lewis;
- First Time Finish Podcast with Tom Underhill, Bence Bocsak, and Ninad Barbadikar;
- The Football Fanalytics Podacst;
- Football Today;
- Laptop Gurus;
- Looks Good on Paper podcast by Felix Pate;
- Measurables Podcast by Brendan Kent;
- MRKT Insights with Tim Keech, Ram Srinivas, Matt Lawrence, Kevin Elphick, and Andy McGregor. Formally Jay Socik;
- Open Source Sports with Ron Yurko;
- A Podcast About Tactics by Jon Mackenzie
- The Price of Football Podcast with Kieran Maguire and Kevin Hunter Day. Check out the The Price of Football book by Kieran Maguire.
- The Scouted Football Podcast;
- smarterscout: The Why in Analytics by Dan Altman;
- Squawka Talker Football Podcast;
- SSAC by MIT Sloan Sports Analytics Conference;
- StatsBomb;
- The SV Podcast;
- Target Scouting by Luke Griffin;
- Tifo Podcast;
- Training Ground Guru;
- Three At The Back by Opta Pro; and
- xPodcast by Modern Fitba (Scottish football);
- Zonal Marking with Michael Cox, Tom Worville and Ali Maxwell.
- All Stats Aren't We:
- Analytics FC Podcast:
- The Beesotted Brentford Pride of West London Podcast
- Bet The Process
- Big Data Sports (Spanish) by Marcelo Gantman and Agustin Mario Gimenez:
- 87: No es Moneyball: es Brentford
- 66: Tres Libros Sobre Sports Analytics Más Allá De Moneyball
- 65: Métrica Sports: La máquina de entender el juego with Bruno Dagnino
- 56: STATS PERFORM: Cómo es el nuevo gigante de los datos del fútbol
- 47: Wyscout: 550 Mil Futbolistas "concentrados" En Un Software
- 35: Big Data Sports - 35: Analistas: Los nuevos "cracks" del fútbol
- 33: Google + IA = Fútbol en Real Time
- Blood Red: The Liverpool FC Podcast
- Burn It All Down
- Campbell's Footballs by Dr. Grant Campbell
- Challengers Podcast:
- Expected goals (2016)
- The Conor J Show:
- ČT sport podcasty
- The Derby County BlogCast
- January window preview with Ram Srinivas (MRKT Insights)
- Economic Rockstar:
- ESPN Daily
- ExtraTime Radio: The Numbers Game Book Club
- Merritt Paulson names his Timbers Mount Rushmore, plus a little storytime | Exploring future of data in soccer featuring Devin Pleuler
- ExtraTime Radio: The Numbers Game Book Club featuring Devin Pleuler
- Expected Value
- Explore Explain with Andy Kirk:
- Fanalytics with Mike Lewis:
- Getting Your Foot in the Door with Sean Steffen
- Fell In Love With A Girl
- Sommerpause Special - FCSP and DCFC: A Different Kind of Football Panel including Dr. Stefan Szymanski on the panel
- Fluid Football
- Freakonomics by Stephen J. Dubner:
- Can Britain Get Its “Great” Back? (Ep. 393) featuring Dr. Ian Graham @ 41m25s;
- The Football Analytics Shot by The Power Rank and Ed Feng (usually American football):
- Football CFB Podcast:
- The Football Collective Podcast:
- The Football Ramble
- The Football Pod:
- Football Today
- I Prefer not to Speak
- Infinite Football
- Inside The Newsroom
- Life...On Our Terms
- The Lowdown by Conor Walsh:
- The Lowe Post
- Measurables Podcast by Brendan Kent (football specific episodes):
- Stephanie Kovalchik, Senior Data Scientist at Zelus Analytics
- Simon Banoub, CMO at StatsBomb
- Rob Suddaby, First Team Data Analyst at Norwich City FC
- Ross Moses and Tyler Heaps, US Soccer Analytics and Research
- Mike Treacy, Chairman of Dundalk FC
- Sam Gregory, Data Analyst at Sportlogiq
- Resources
- Expected Goal Chain and Penalty Kick Analysis
- Elo and Tournament Projections
- Mariela Nisotaki, Technical Scout at Norwich City FC
- Expected Goals and Expected Assists
- Brendan Kent, Soccer Data Analyst at the Portland Timbers
- What is sports analytics?
- Men in Blazers:
- MLS Assist (a Total Soccer Show podcast):
- MLS Assist: Advanced soccer metrics explained, how MLS teams use data, and more with Eliot McKinley - Spotify
- The Modern Football Group Podcast
- Modern Soccer Coach Podcast with Gary Curneen:
- Motley Fool Money
- Motley Fool Money: 06.06.2014 with Stefan Szymanski
- New Books in Sports:
- Not The Top 20 Podcast:
- The Nutmegged Arena by The Nutmeg Assist:
- Open Source Sports with Ron Yurko;
- Player Chemistry in Soccer with Lotte Bransen
- The Ornstein & Chapman Podcast with David Ornstein and Mark Chapman:
- Latest on the race to sign Erling Haaland and Tuchel's reaction to Chelsea horror show featuring Tom Worville
- Should football scrap transfer fees? with Daniel Geey and Stefan Szymanski
- Football Club Ownership: Data, Decisions & Competitive Edge with Simon Hallett
- Pacey Performance Podcast with Robert Pacey:
- #340 What is data science (and what isn't), data informed decision making with Sudarshan Golaladesikan - Spotify and YouTube;
- The PinkUn Norwich City Podcast:
- Pinnacle Podcast:
- Planet Fútbol with Grant Wahl
- The Pomp Podcast
- The Process with James Allcott:
- Purely Arsenal - Football Purists, an AFC podcast
- Rigo Plascencia Deportes, Entrevistas y más:
- The Scouted Football Podcast:
- Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas
- SempreMilan Podcast:
- Sports Tech Research Podcast:
- Soccer Player Development Podcast:
- Episode 12 with Rasmus Ankersen - YouTube
- Squawka Talker Football Podcast:
- State of the World:
- These Football Times
- The Tao of Sports Podcast – The Definitive Sports, Marketing, Business Industry News Podcast:
- This Football Life:
- Tifo Podcast:
- The Transfer Market & 21st Club with Omar Chaudhuri - Spotify and YouTube
- How Memphis Depay Used Data to Find His Next Club with Giels Brouwer - Spotify and YouTube
- How Do Football Clubs Actually Use Statistics? - YouTube
- JJ Bull: Tactical Analysis & Coaching Badges - Spotify and YouTube
- A Day in the Life Of: A Football Recruitment Analyst - Spotify and YouTube
- Liverpool: Pressing, xG Concerns, and Klopp’s Future - Spotify and YouTube
- Understanding Stats in Football with Nikos Overheul - Spotify and YouTube
- Steve Morison: Tactical Insight & Football Psychology - Spotify and YouTube
- Football Tactics with Michael Cox (Zonal Marking) - Spotify and YouTube
- Football, Tactics & History with Jonathan Wilson - Spotify and YouTube
- The Future of Stats: xG, xA - Spotify and YouTube
- The Totally Football Show with James Richardson
- 03/07/2019: Football Hackers with Christoph Biermann
- Total Soccer Show:
- #32 What is xG and why are advanced stats useful in soccer? - YouTube
- Soccer stats and analytics with Ted Knutson (in which Ted explains Expected Goals to Daryl) - YouTube
- Mike L. Goodman (@TheM_L_G) talks USMNT tactical options, EPL trends, Expected Goals - YouTube
- Everton Premier League preview: Mike L. Goodman talks Silva's style, Moise Kean, and replacing - YouTube
- Trademate Sports:
- UCN/USF Sport Management - Sports Business Podcast:
- Wharton Business Daily
- The Wharton Moneyball Post Game Podcast
- Wharton Moneyball: Soccer Analytics, the Women's World Cup & Cirque Du Soleil featuring Ted Knutson
- 6/13/18 Wharton Moneyball with Stefan Szymanski
- Where Others Won't by Cody Royle:
- View From The Byline
- Training Ground Guru Staff Profiles;
- Female (identifying) sports analysts by Dr. Catherine Pfaff;
- 2020 Analytics Twitter Top 1,000 Power Rankings, calculated by Will Thomson. See the Twitter list created by Luton Town Analytics [link];
- Sports Analytics Twitter list by Jan Van Haaren;
- Soccer People Twitter list by John Muller;
- Football Analysts Twitter list by Colin Trainor;
- Opta Staff Twitter list by Opta;
- Football Analyst Community Rankings dashboard by Neil Charles;
- Football data Analysts spreadsheet by Dan Altman (few years old now but lists the OGs of football analytics); and
- Introduction to Soccer Analytics – The Guys I Follow by Ted Knutson (a old, 2013 bio of the guys that started the football analytics movement. Now out-of-date, but great if you want to know who helped influence football analytics).
- OptaPro Analytics Forum;
- StatsBomb Conference;
- Barça Sports Tomorrow, Sports Analytics Summit, and Sports Technology Symposium;
- MIT Sloan Sports Analytics Conference;
- New England Symposium on Statistics in Sports (NESSIS;
- Carnegie Mellon Sports Analytics Conference;
- CASSIS;
- Tactical Insights 2020 Conference at King Power Stadium;
- Artificial Intelligence in Team Sports (AITS) and [link];
- Machine Learning and Data Mining for Sports Analytics;
- International Workshop on Computer Vision in Sports;
- Google Sports Analytics Meetup.;
- DFB Hackathon;
- (Ohio State Sports Analytics Association Conference)[http://org.osu.edu/sportsanalytics/];
- PSG Sports Analytics Challenge;
- Football Data International Forum;
- Global Training Camp;
- Great Lakes Analytics Conference;
- MathSport International;
- Sports Analytics World Series; and
- Sportdata & Performance Forum.
The following includes non-football competitions.
- NFL Big Data Bowl (American Football) - 2021 - annual;
- Big Data Cup (Hockey) - annual;
- Google Research Football with Manchester City F.C. - October 2020; and
- Liverpool Analytics Challenge (Football) - May 2020. Challenge used Last Row Tracking-like data kindly provided by Ricardo Tavares. Full a full list of entries, see David Sumpter's Medium post [link], featuring the three eventual winners - Surya Kocherlakota, Theophane Gregoir and Paul Garnier's, and Gabin Rolland (discussed on Friends of Tracking [link]).
- Mathematical Modelling of Football by Uppsala University;
- StatsBomb Academy;
- Sport Analytics and Technologies MSc at Loughborough University, taught by Donald Barron;
- Football Analytics short course by StatsPerform with Birkbeck University; and
- Barça Innovation Hub.
For live job postings tracked by the community, check the Jobs channel of the Football in Numbers Discord server
- The Video Analyst - Rob Carroll posts many of the jobs going in football on his own website. Make sure to also follow him on Twitter (@thevideoanalyst);
- Job sites:
- The Football Scouts - looks really good!
- FutbolJobs
- Sports Jobs UK
- Training Ground Guru Job Board
- Data companies:
- Clubs (this list could become quite long but I've included the ones I've got round to seeing, so far:
- City Football Group. See also the City Football Insights Twitter account
- Arsenal
- Consultancies:
- TwentyFirst Group (formally 21st Club)
- Betting companies:
- Others / to sort:
- Football in Numbers Discord server organised by McKay Johns;
- Uppsala Mathematical Modelling of Football Slack group organised by Novosom Salvador;
- Tableau for Sports Discord server organised by Ninad Barbadikar;
- Football Analysts Discord server organised by Carlon Carpenter; and
- Scouted Football Discord server.
Focus on some of the key topics in football analytics. Most of the following resources features above but are instead reorganised by topic. This section is still very much a work in progress as I go along and may be missing resources mentioned above.
- Charles Reep Wiki;
- Analytics is older than you think: (re)introducing Charles Reep by Mark Thompson for his newsletter Get Goalside!;
- The evolution of football data by Mark Thompson;
- Goal Scoring in Association Football: Charles Reep by Keith Lyons;
- The Charles Reep and Bernard Benjamin Paper 50 Years On (1) by Keith Lyons;
- Bernard Benjamin profile by Keith Lyons;
- Charles, Richard, Neil and Simon: the stories we craft by Keith Lyons;
- Football’s Pioneer – The Charles Reep story by Rob Carroll;
- Grim Reep by Barnay Ronay
- History of Performance Analysis: The Controversial Pioneer Charles Reep by Guillermo Martinez Arastey;
- The Soccer Analytics Revolution by Nathan Luzum and Michael Model;
- How One Man’s Bad Math Helped Ruin Decades Of English Soccer by Joe Sykes and Neil Paine for FiveThirtyEight;
- The History of Sports Analysis: The Man Who Ruined English Football by Duncan Ritchie;
- No, seriously: what the heck is expected goals (xG)? by James Maw;
- Don't Shoot the Messenger. The First Football Analyst Was a Pioneer 50 Years Ahead of His Time by Alan Campbell;
- Papers by and about Charles Reep:
- Skill and Chance in Ball Games by Charles Reep, Bernard Benjamin and Richard Pollard
- Charles Reep (1904-2002): pioneer of notational and performance analysis in football by Richard Pollard
- Invalid Interpretation of Passing Sequence Data to Assess Team Performance in Football: Repairing the Tarnished Legacy of Charles Reep by Richard Pollard
- A Twitter thread of the original football analytics blogs from 2009 by Tiotal Football [link].
For a playlist of Expected Goals related videos available on YouTube, see the following playlist I have created [link].
- What is xG? by Tifo Football;
- Opta Expected Goals by The Analyst (formally Opta);
- What are Expected Goals? by David Sumpter and Axel Pershagen;
- Anatomy of a Goal by Numberphile Brady Haran);
- How Did These Goals Go In? - We Explain How Goal Probability Works by the Bundesliga;
- Soccer Analytics: Expected Goals by Dan Altman; and
- Anatomy of an Expected Goal by 11tegen (Sander IJtsma);
- David Sumpter's Expected Goals webinars for Friends of Tracking (see the following for code 3xGModel, 4LinearRegression, 5xGModelFit.py, and 6MeasuresOfFit):
- "Is Our Model Learning What We Think It Is?" Estimating the xG Impact of Actions in Football by Tom Decroos from the 2019 StatsBomb Innovation in Football Conference;
- Statsbomb Data Launch - Beyond Naive xG by Ted Knutson.
- Tech how-to: build your own Expected Goals model by Jan Van Haaren and SciSports.
- Fitting your own football xG model by Dato Fútbol (Ismael Gómez Schmidt). See GitHub repo [link];
- Python for Fantasy Football series by Fantasy Futopia (Thomas Whelan). See the following posts:
- Expected Goals & Player Analysis by Gabriel Manfredi
- Building an Expected Goals Model in Python by Peter McKeever (using WayBackMachine);
- An xG Model for Everyone in 20 minutes (ish) by Football Fact Man (Paul Riley).
- Sam Green's xG model;
- Michael Caley's xG model; and
- 11tegen (Sander IJtsma)'s xG model (using WayBackMachine).
For a collated list of Expected Goals literature collated by Keith Lyons, see the following [link]
- xG explained by FBref;
- What are expected Goals? by American Soccer Analysis;
- David Sumpter's Expected Goals pieces:
- Michael Caley's Expected Goals pieces:
- Jesse Davis and Pieter Robberechts' Expected Goals pieces for KU Leuven;
- Does xG really tell us everything about team performance? by Ben Torvanay;
- Unexpected goals Will Gürpinar-Morgan;
- Great Expectations by Will Gürpinar-Morgan;
- On single match expected goal totals by 2+2=11 (Will Gürpinar-Morgan]);
- Martin Eastwoood (Pena.lt/y)'s Expected Goals pieces [link];
- Expected Goals For All.
- Actual Goals Versus Expected Goals;
- Expected Goals Updated;
- Expected Goals: The Y Axis;
- Expected Goals And Exponential Decay;
- Expected Goals: Foot Shots Versus Headers;
- Expected Goals And Support Vector Machines;
- Expected Goals and Uncertainty; and
- Sharing xG Using Multi-touch Attribution Modelling.
- Garry Gelade's Expected Goals pieces:
- Expected Goals and Unexpected Goals (using WayBackMachine);
- Assessing Expected Goals Models. Part 1: Shots (using WayBackMachine);
- Assessing Expected Goals Models. Part 2: Anatomy of a Big Chance (using WayBackMachine);
- 11tegen (Sander IJtsma)'s Expected Goals pieces:
- A close look at my new Expected Goals Model (using WayBackMachine);
- The best predictor for future performance is Expected Goals (using WayBackMachine);
- Ted Knutson's Expected Goals pieces:
- Anatomy of a Shot by Thom Lawrence;
- Modern Fitba's Expected Goal Guides Part 1 and Part 2 by Christian Wulff;
- How StatsBomb Data Helps Measure Counter-Pressing by Will Gürpinar-Morgan;
- A Shooting Model – An Exp(G)lanation and Application by Paul Riley;
- Introducing xGChain and xGBuildup by Thom Lawrence;
- Introduction Expected Goals on Target (xGoT) by Jonny Whitmore;
- Quantifying finishing skill by Marek Kwiatkowski;
- The Dual Life of Expected Goals (Part 1) by Mike L. Goodman;
- Many bad shots or one good shot? by Luis Husier;
- Expected Goals Just Don’t Add Up — They Also Multiply. by Danny Page;
- An analysis of different expected goals models by Benjamin Cronin;
- Expected Goals 3.0 Methodology by Matthias Kullowatz;
- A simple Expected Goals model by Cricket Savant;
- How we calculate Expected Goals (xG) by Fantasy Football Fix; and
- Una mirada al Soccer Analytics usando R — Parte III by Dato Fútbol (Ismael Gómez Schmidt).
soccer-xg
by Jesse Davis and Pieter Robberechts at KU Leuven.
Expected Goals Thesis
by Andrew Rowlinson. See both his thesis [link] and the following notebooks:expected_goals_deep_dive
by Andrew Puopolo. See the following notebooks:soccer_analytics
by Kraus Clemens. See the following notebooks:xg-model
] by Dato Fútbol (Ismael Gómez Schmidt)xG_Model_Workflow
by Ian Dragulet
- Expected Goals Extravaganza by The Double Pivot podcast;
- Extreme nerding out over expected goals by The Double Pivot podcast;
- Explaing xGChain, evaluating defensive midfielders and more - it's the Mailbag by The Double Pivot podcast;
- Understanding why Burnley don't break expected goals by The Double Pivot podcast;
- #1: What Did You Expect? - Spotify by The Football Fanalytics Podcast
- Expected Goals and Expected Assists by Measureables;
- Advanced soccer metrics explained, how MLS teams use data, and more with Eliot McKinley by MLS Assist
- Rating players with expected goals from shot creation by smarterscout;
- Expected goals from ball progression and tactical applications by smarterscout;
- AVFC Extra #1 - xG, xA & PPG - The abbreviations of modern football explained by Claret & Blue podcast;
- Episode 3 - xG 101, West Ham in Trouble? Norwich Doomed? by Differentgame;
- Episode 5 - Shot Stoppers, xG at Corners, Building a Passing Model by Differentgame;
- What is xG by For the Love of Paul McGrath podcast;
- The Future of Stats: xG, xA - Spotify and YouTube by Tifo Podcast; and
- #56: Dominic Calvert-Lewin & Explaining Expected Goals - Spotify and YouTube by The Scouted Football Podcast.
- The benefits of including fake data in an Expected Goals model [link]; and
- Twitter thread by Jernej Flisar to build an Expected Goals model trained with Logistic Regression on StatsBomb Event data and using the model to predict Liverpool goals from Tracking-like data provided by Ricardo Tavares (Last Row View) for Friends of Tracking [link]. The model uses the SHAP library for feature interpretation.
- Scraping Premier League Football Data with Python by FC Python
- Football Data Visualizations - Passing Networks by Karol Działowski - a great blog post on how to create passing networks from first principles, specifically to webscraping, how to get Opta Event data from WhoScored. The second part of the blog shows how to create a passing network from this data using matplotlib.
- Python for Fantasy Football – Getting and Cleaning Data by Fantasy Futopia
- Intro to {polite} Web Scraping of Soccer Data with R! by Ryo Nakagawara
- Web Scraping Advanced Football Statistics by Sergi Lehkyi
- Web Scraping Football Data — Serverless Edition by Sergi Lehkyi
- How to Build a Football Dataset with Web Scraping by Otávio Simões Silveira
- How to scrape Understat for football data in Python with requests and BeautifulSoup by McKay Johns
- How to scrape SPORTS STATS websites with Python by John Watson Rooney
ScraperFC
- a Python package to scrape data from FBRef, Understat and FiveThirtyEight by Owen Seymour;Scrape-FBref-data
- Python library to scrape StatsBomb data via FBref by Parthe Athale, which in turn was updated from Christopher Martin's repository;tmscrape
- a Python TransferMarkt webscraper by danzn1;Tyrone Mings
- a Python TransferMarkt webscraper by FCrSTATS;worldfootballR
- a R package to allow users to extract various world football results and player statistics data from FBref and valuations and transfer data from TransferMarkt.com by Jason Zivkovic (see guide on how to use this package [link]);understat
- a Python webscraper by Amos Bastian; andunderstatr
- a R package to scrape data from Understat.
- Laurie Shaw's Metrica Sports Tracking data series for Friends of Tracking - Introduction, Measuring Physical Performance, Pitch Control modelling, and Valuing Actions. See the following for code [link];
- How Tracking Data is Used in Football and What are the Future Challenges with Javier Fernández, Sudarshan 'Suds' Gopaladesikan, Laurie Shaw, Will Spearman and David Sumpter for Friends of Tracking;
- Introduction to tracking data in football. by David Sumpter for Friends of Tracking;
- Learning to Watch Football: Self-Supervised Representations for Tracking Data by Karun Singh. See accompanying blog post [link];
- On Tracking Data, the Nature of Soccer, and Allocation by Tiotal Football, as part of their Absolute Unit newsletter;
- How Hoffenheim are helping to democratise tracking data by Training Ground Guru.
Pitch Control modelling and Valuing Actions tutorials by Laurie Shaw as part of his Metrica Sports Tracking data series for Friends of Tracking. See the following for code [link];
Metrica-pitch-control
by Will Thompson - a Python implementation of Javier Fernández and Luke Bornn's Pitch Control model from their paper Wide Open Spaces: A statistical technique for measuring space creation in professional soccer (2018) and Will Spearman's Pitch Control model from his paper Beyond Expected Goals (2018). The respectively Google Colab notebooks are available [link] and [link];
- Everything you need to know about 'pitch control' by Mark Thompson;
- A Framework for the Fine-Grained Evaluation of the Instantaneous Expected Value of Soccer Possessions (2020) by Javier Fernández, Luke Bornn and Daniel Cervone;
- Decomposing the Immeasurable Sport: A deep learning expected possession value framework for soccer (2019) by Javier Fernández, Bornn, and Dan Cervone. Accompanying talks - SSAC19, StatsBomb conference;
- Beyond Expected Goals (2018) by Will Spearman;
- Replaying the NBA (2018) by Luke Bornn
- Wide Open Spaces: A statistical technique for measuring space creation in professional soccer (2018) by Javier Fernandez and Luke Bornn;
- Physics-Based Modeling of Pass Probabilities in Soccer (2017) by Will Spearman, Austin Basye, Greg Dick, Ryan Hotovy, and Paul Pop;
- Will Spearman's masterclass in Pitch Control for Friends of Tracking;
- How to create the mathematically perfect press using pitch control. by David Sumpter for Friends of Tracking.
- How each Premier League team pass by John Muller for The Athletic
- Interactive Passing Networks by Karun Singh
- Explaining xGChain Passing Networks by Ted Knutson
- Using Network Science to Quantify the Identifiability of Football Teams by Javier M. Buldú, David Garrido
- The role of passing network indicators in modeling football outcomes: an application using Bayesian hierarchical models by Riccardo Ievoli, Aldo Gardini & Lucio Palazzo
- Football Data Visualizations - Passing Networks by Karol Działowski - a great blog post on how to create passing networks from first principles, using Opta Event data acquired from WhoScored. This data is then visualised using matplotlib
- Creating a Passmap in Python by Abhishek Sharma
- Football passing networks using R by Dato Fútbol (Ismael Gómez Schmidt)
- How to Render 3D Football Pass Network by Daniel Linke
- Plotting a passing network on a football pitch by Alberto Rodríguez Martín
- Medium blog post by Rahul Iyer - Guide to Creating Passing Networks in Tableau
- How to Create Football Pass Networks in Python by McKay Johns
- Creating passing networks with Barcelona data scientist by Serio Llana, to build customizable passing networks with matplotlib for Friends of Tracking. The code is prepared to use both eventing (StatsBomb) and tracking data (Metrica Sports). See the corresponding GitHub repository -
passing-networks-in-python
- Introduction to Soccer Pass Network Analysis with Python
- Premier League Passing Networks by John Muller for The Athletic
- Pass network EPL 2019/20 by Cheuk Hei Ho
- Pass Network now with xT by Matt Trevillion
- A Framework for Tactical Analysis and Individual Offensive Production Assessment in Soccer Using Markov Chains (2011) by Sarah Rudd;
- Attacking Contributions: Markov Models for Football by Derrick Yam for StatsBomb;
- Introducing a Possession Value Framework by Nils Mackay for Stats Perform;
- Expected Potential (xPo) by Aditya Kothari;
- Deep Soccer Analytics: Learning an Action-Value Function for Evaluating Soccer Players by Guiliang Liu, Yudong Luo, Oliver Schulte, Tarak Kharrat;
- Evolving Our Possession Value Framework by Jonny Whitmore for Stats Perform; and
- Why Possession Value Is Bollocks by Paul Riley.
- Introducing Expected Threat (xT) by Karun Singh. Check out also as an unrolled Twitter thread [link] Karun's Twitter thread for the many resources out there around this topic, including: Episode 19 of The Football Fanalytics Podcast, Karun's StatsBomb conference presentation [link] and slides [link], Rob Hickman's StatsBomb conference presentation where he extended xT to take defensive risk into account [link], Last Row View (Ricardo Tavares)'s blog post for evaluating off-the-ball player movements by combining xT and tracking data, and Karun's xT values as a 12x8 grid to download as a JSON file [link].
- Introducing ‘expected threat’ (or xT), the new metric on the block by Tom Worville
- Explaining Expected Threat by David Sumpter
- Football's New Stat - What is Expected Threat? by Tifo
- How to Calculate Expected Threat (xT) in Python by McKay Johns. See the corresponding GitHub repository [link] and Jupyter notebook [link]
- Implementing Expected Threat (xT) in Julia by Abhishek Sharma
- Lotte Bransen and Jan Van Haaren's 'Valuing Actions in Football' series for Friends of Tracking - Valuing Actions in Football: Introduction, Valuing Actions in Football 1: From Wyscout Data to Rating Players, Valuing Actions in Football 2: Generating Features, Valuing Actions in Football 3: Training Machine Learning Models, and Valuing Actions in Football 4: Analyzing Models and Results. See the following for code [link];
- STARSS: A Spatio-Temporal Action Rating System for Soccer by Tom Decroos, Jan Van Haaren, Vladimir Dzyuba, Jesse Davis;
- Actions Speak Louder Than Goals: Valuing Player Actions in Soccer (V1) by Tom Decroos, Lotte Bransen, Jan Van Haaren, Jesse Davis; and
- Actions Speak Louder Than Goals: Valuing Player Actions in Soccer (V2) by Tom Decroos, Lotte Bransen, Jan Van Haaren, Jesse Davis.
- Goals Added: Introducing a New Way to Measure Soccer by John Muller for American Soccer Analysis;
- The future of possession value models with David Sumpter, Catherine Pfaff, Matthias Kullowatz and Jernej Flisar for Friends of Tracking. The Goals Added (g+) model is focussed on in minutes 9-45 of the lecture.
- Modelling Association Football Scores and Inefficiencies in the Football Betting Market (1997) by Mark Dixon and Stuart Coles
- Analysis of football prediction methods by William Brojanigo
- Predicting Football Results Using Python and the Dixon and Coles Model by Martin Eastwood
- Dixon Coles and xG: together at last by Ben Torvaney
- A generic Dixon-Coles model for estimating team strengths by Ben Torvaney
- Dixon Coles by Mathematical Football Predictions
- Dixon Coles Model by Philip Winchester
- Predicting Football Results With Statistical Modelling: Dixon-Coles and Time-Weighting by David Sheehan
- The Seven Styles of Soccer by John Muller
- You Down With t-SNE? by Eliot McKinley and Cheuk Hei Ho for American Soccer Analysis
- Tweet to Clustering European Teams by Behaviors by Cheuk Hei Ho
- Defining Player Roles: How Every Player Contributes to Goals by Michael Imburgio for American Soccer Analysis
- The Relentless, the Chaotic, and the Bus Conductors by James McMahon
- Similar Player Tool by Niklas Hemmer
- The Bargain Bin Bielsa Machine by James McMahon
- Using Machine Learning To Find Players In Similar Roles In Scotland by Matt Rhein
- Comparing Players: Clustering and Style of Play by American Soccer Analysis
- Clustering Playing Stles in the Modern Day Fullback by Mark Carey and Mladen Sormaz
- Finding a replacement for Gerard Pique using Machine Learning by Malhar B.
- Grouping Soccer Players with Similar Skillsets in FIFA 20 by Jaemin Lee
- Clustering Football Players by Using FIFA 19 Data by Oğuz Can Yurteri
- Google Research Football: A Novel Reinforcement Learning Environment (2020) by Karol Kurach, Anton Raichuk, Piotr Stańczyk, Michał Zając, Olivier Bachem, Lasse Espeholt, Carlos Riquelme, Damien Vincent, Marcin Michalski, Olivier Bousquet, Sylvain Gelly;
Google Research Football
GitHub repo;- Google Research Football with Manchester City F.C. Kaggle Competition (ended October 2020)
- Karol Kurach - Google Research Football
- Karol Kurach (Google Brain) "Google Research Football: Learning to Play Football with Deep RL
- Google Research Football by Piotr Stanczyk;
- Google's AI Plays Football…For Science! by Two Minute Papers
- Identifying Play Styles of Football Players Based on Match Event Data (2021) by Mark Riezebos
- Soccer Analytics Meets Artificial Intelligence: Learning Value and Style from Soccer Event Stream Data (2020) by Tom Decroos
- SoccerMix: Representing Soccer Actions with Mixture Models (2020) by Tom Decroos, Maaike Van Roy, and Jesse Davis
- Actions Speak Louder Than Goals: Valuing Player Actions in Soccer (2019) by Tom Decroos, Lotte Bransen, Jan Van Haaren, and Jesse Davis
- Player Vectors: Characterizing Soccer Players’ Playing Style from Match Event Streams (2019) by Tom Decroos and Jesse Davis (discussed in the DeepMind blog: Advancing sports analytics through AI research)
- Automatic Discovery of Tactics in Spatio-Temporal Soccer Match Data (2018) by Tom Decroos, Jan Van Haaren, and Jesse Davis
- Distinguishing Between Roles of Football Players in Play-by-play Match Event Data by Bart Aalbers and Jan Van Haaren
- Analysis of association football playing styles: An innovative method to cluster networks (2018) by Jacopo Diquigiovanni and Bruno Scrapa
- Predicting Soccer Highlights from Spatio-Temporal Match Event Streams (2017) by Tom Decroos, Vladimir Dzyuba, Jan Van Haaren, and Jesse Davis
- Game style in soccer: what is it and can we quantify it? by Adam Hewitt, Grace Greenham, and Kevin Norton
- Stats Perform Playing Styles - An Introduction by Andy Cooper for Stats Perform
- Introducing Role Discovery: Generating Data-Driven Roles in Elite Professional Football
- How Does the Context of the Game Impact the Style of Play in Football Teams? by Carlos Lago Peñas for Barça Innovation Hub
- Comparing Players: Clustering and Style of Play by Sam Goldberg
- Player Roles: How to find the right type of player for your team? by SciSports
- SciSports 22 Player Roles
- Clustering Playing Styles in the Modern Day Full-Back by Mark Carey and Mladen Sormaz
- Learning Value and Style from Soccer Event Stream Data by Tom Decroos, as part of the Sports Analytics Lab at Harvard University and American Statistical Association Section on Statistics in Sports seminar series
- Presentation at SIGKDD 2019 | Actions Speak Louder than Goals: Valuing Player Actions in Soccer by Tom Decroos
- Making Offensive Play Predictable by Paul Power for the Opta Pro Forum 2021
- How to properly compare players by Paul Power for Training Ground Guru
- Measuring Style of Play in Football Using Statistics and Machine Learning by Xiaoyi Ji (Sia)
- Stats Perform Playing Styles playlist
Section created after seeing the following tweets and threads by Ashwin Raman ([link]) and Stuart Reid ([link])
- Dynamic Analysis of Team Strategy in Professional Football (2019) by Laurie Shaw and Mark Glickman. Accompanying talks - NESSIS 2019, 2020 Google Sports Analytics Meetup;
- Breaking Down Set Pieces: Picks, Packs, Stacks and More by Euan Dewar;
- Tactical Theory: Set-Pieces by István Beregi;
- Set-Piece Analysis: A comprehensive guide to zonal marking from corners by Cameron Meighan. See all his pieces [link];
- Changing How the World Thinks About Set Pieces by Ted Knutson;
- Set Pieces and Market Efficiency by Ted Knutson;
- The Blades’ Sharpest Edge: A look at Sheffield United’s 17/18 Set Pieces. by Oli Walker;
- Pieces by Marc Lamberts [link]; and
- Pieces by Stuart Reid [link].
- StatsBomb radar articles. For all articles, see the following: [link]
- Understanding StatsBomb Radars by StatsBomb (16/07/2021)
- New Team, Same Numbers: How Transfers Do (And Don't) Change Player Output by Tim Keech (06/03/2019)
- Introducing Goalkeeper Radars by Ted Knutson (11/12/2018)
- Radar Wars
- New Data, New StatsBomb Radars by Ted Knutson (03/08/2018)
- Revisiting Radars by Ted Knutson (18/05/2017)
- Understanding Football Radars For Mugs and Muggles by Ted Knutson (25/04/2016)
- Radar Wars - CASSIS Presentation Summer 2018
- Models for evaluating players part 2: Player radars by David Sumpter for Friends of Tracking
- Introducing Twenty3’s Dynamic Radars
- Radar Charts in mplsoccer
soccerplots
- a Python package that can be used for making visualizations for football analytics by Anmol Durgapal- Building a Radar Plot in ggplot2 by FC rSTATS
- Gerard Moore uses the Event Lab to analyse centre-backs for recruitment by Gerard Moore for Twenty3
- Using StatsBomb IQ For Player Recruitment: Centre Backs by StatsBomb (05/06/2021)
- Recruitment & Analysis at Melborne City: Optimising Key Processes using Data and Technology by Andy Cooper for Stats Perform
- Season Analysis & Summer Recruitment pieces
- Blackburn Rovers Season Analysis & Summer 2020 Recruitment by Andy Watson (see tweet [link]
- Motherwell Summer 2021 Recruitment Plan by Greg Marshall (see tweet [link]
- Nottingham Forest Recruitment Plan Summer 2021 by Liam Henshaw (see tweet [link]
- Sheffield Wednesday Recruitment Plan for the 21/22 season by Owls Analytics (see tweet [link]
- Celtic F.C. Opposition Report by Liam Bailey
- TransferMarkt values of leagues - Europe, Asia
- Deloitte Football Money League (DFML)
- FiveThirtyEight
- Stats Perform’s Power Rankings - ranks 19,008 clubs across 391 domestic leagues, based on a system that scores the current strongest side 100 and the weakest zero. The ratings evolve each day based on the results of both an individual club and those of other teams within its own domestic league. Domestic and European fixtures are taken into account, as well as the strength of each league. Access is not available online but there is theoccasional The Analyst article referring to the rankings - November 2021, October 2021 update
- Global Football Rankings by Ken Ackerson (Pear Apps), powered by FiveThirtyEight's Global Club Rankings
- UEFA Club Coefficients (official rankings). See also on the European Club Association (ECA) [link]
- UEFA European Cup Football Results and Qualification by Bert Kassies
- World Football Elo Ratings
- Club Elo Ratings - Methodology. See Episode 27: A Closer Looking into "Elo Ratings" by The Football Fanalytics Podcast
- Glicko Rating System by Mark Glickman
- Euro Club Index - Methodology
- World Football / Soccer Clubs Ranking by FootballDatabase
- the KA Club Rating - What are the KA Ratings? and Procedure
- Ben Torvaney's 2018 Opta Pro Forum talk 'Counting Across Borders' [link]. For slides, see Ben's blog post [link] or by from the Stats Perform blog using Waybackmachine [link] and for Ben's original submission, see [link]
- Tony El Habr's Soccer league Strength post (see also [link]
- Aditya Kothari (The Come On Man) uses differences in VAEP values of players who transferred between different leagues [link]
- Which is the most physically demanding league? by Training Ground Guru
- Soccer Power Index explained by ESPN staff
- Examining FiveThirtyEight’s Soccer Power Index Ratings
- Man Utd, Barcelona, Liverpool: Data analysts rank the top 30 clubs in world football by Kobe Tong
- PlayeRank: data-driven performance evaluation and player ranking in soccer via a machine learning approach by Luca Pappalardo, Paolo Cintia, Paolo Ferragina, Emanuele Massucco, Dino Pedreschi, Fosca Giannotti.
- Ranking soccer teams on the basis of their current strength: A comparison of maximum likelihood approaches by Christophe Ley, Tom Van de Wiele, and Hans Van Eetvelde.
- Mladen Sormaz's StatsBomb 2021 talk Practical tools for ‘Bridging the gap (see @ 20m42s)
- Tweets by AI Abucus [link] and [link]. They use a simple Dickson-Coles method focusing on historic results going back 15 years to build an order of hierarchy amongst teams in leagues that might have never played each other.
- On the anatomy of a counter-attack by Will Morgan. Also available at the following [link]
- Quantifying Player Contribution to Counter Attacks by Laurynas Raudonius. See his poster [link]
- Spotlight on: counter-attacks by The FA
- Evolution of Counterattacking by Adin Osmanbašić
- The Various Forms of Restdefences Part 2: Counterattacking
- Counter- or Gegenpressing
- Pressing, counterpressing, and counterattacking by Adin Osmanbašić. Also available at the following [link]
- Tactical Analysis: Defending Against the Counter Attack
- Stats Perform Playing Styles - An Introduction - see the 'Counter Attack' subsection
- Analysis: Leicester City Counter Attack
- The Importance of Counter-Attacking in Football by Max Bergmann
- Premier League Club Stats - Goals From Counter Attack
- InStat Sport Facebook post on counter attacking
- Quantifying the Value of Transitions in Soccer via Spatiotemporal Trajectory Clustering by Jennifer Hobbs, Paul Power, Long Sha, Hector Ruiz, Patrick Lucey
- Evaluating Football Player Actions During Counterattacks
- Counter attack detection with machine learning from log files of RoboCup simulation
- Why More Teams Should Counter-Attack | By The Numbers by Tifo
- Carlos Carvalhal • Fast attacks: counter-attacking to organised possession • CV Academy Session
- Dean Wright • Norwich City under-15; Counter-attacking • CV Academy coaching course
- Sit deep and play on the counter-attack | Football tactics | Nike Academy
- How to hit a team on the counter-attack | Soccer drill | Tactics | Nike Academy
- Tactics Explained: Tottenham's counter-attack
- How To Score The Perfect Counter-attack Goal? by Bundesliga
- Course - Counter Attacking Masterclass - Part 1 (HD)
- Course - Counter Attacking Masterclass - Part 2 (HD)
- Wenger's tips: Counter-attacking
- Fast As Lightning Counter-attacks | Premier League | Salah, Aguero, Martial
- Top 10 Counter Attack Goals RB Leipzig - Werner & Co. with Superfast Transitions by Bundesliga
- Top 10 Counter-Attacking Goals 2020/21 so far – Haaland, Gnabry & More by Bundesliga
- Alcacer, Reus & Co. - Top 10 Counter-Attack Goals 2018/19 So Far by Bundesliga
- Top 10 Counter-Attacking Goals - 2015/16 by Bundesliga
- Top 10 counter attack goals - including Lionel Messi v Arsenal
- The best counter-attacking team in Europe according to Pep Guardiola | Oh My Goal
- What is Ball-Oriented Defending: How to defend, press and actively win the ball feat. Rangnick, Klopp & Nagelsmann
- Pressing, counterpressing, and counterattacking by Adin Osmanbašić. Also available at the following [link]
- Counterpressing variations
- Pressing, counterpressing, and counterattacking by Adin Osmanbašić. Also available at the following [link]
- The Question: is the counter-counter more crucial than the counterattack? by Jonathan Wilson
- Data-driven detection of counterpressing in professional football [2021] by Pascal Bauer and Gabriel Anzer.
- The DePO Models: Bringing Moneyball to Professional Soccer by Sam Goldberg and Mike Imburgio
- TransferRoom Expected Transfer Value (xTV) Webinar - webinar by Daniel Blades of TransferRoom, providing insight into the workings of xTV, a metric created by TransferRoom to enable a data-led measure of a player’s value in the transfer market. The webinar covers: What is xTV? How is it calculated? Does it differ from other valuation methods? How does xTV benchmark players? What is the reliability of xTV? How does xTV help clubs and agents? What are some real-life examples of xTV?
- TransferMarkt: Transfermarkt Market Value explained - How is it determined?. See also the paper The Wisdom of Crowds and Transfer Market Values by Dennis Coates and Petr Parshakov
- Football Benchmark by KPMG: KPMG Methodology and limitations of published information
- SciSports: SciSports’ transfer fee prediction model
- CIES Football Observatory: Scientific evaluation of the transfer value of football players by Drs Raffaele Poli, Loïc Ravenel and Roger Besson
- sportskeeda](https://www.sportskeeda.com/): Transfer Values - Calculation explained
- Football Transfer: Data & Algorithms
- The Transfer List - UEFA Market Value Calculator, CONMEBOL Market Value Calculator, CONCACAF Market Value Calculator, and Intra-MLS Market Value Calculator
- The Wisdom of the Crowd: Soccer takes the player valuations posted on the website Transfermarkt extremely seriously. It has never really stopped to ask where they come from by Rory Smith for The New York Times
- Soccer’s Confounding Calculation: What’s a Player Worth? by Rory Smith for The New York Times
- The DePO Models: Bringing Moneyball to Professional Soccer by Sam Goldberg and Mike Imburgio
- Transfer window analysed: Less spent, young players targeted and free agents have defined key moves by Tom Worville
- How do you value a player? by Stuart James
- How to value the modern footballer: Algorithms, cheat codes and the scientification of the transfer market by Lawrence Ostlere
- How the volunteers of data website Transfermarkt became influential players at European top football clubs by Pepijn Keppel and Tom Claessens
- From Sessegnon to Sanchez: How to calculate the correct market salary for EPL players by Laurie Shaw
- Money Madness: The secrets of a how football player’s transfer value is calculated by Ash for Football Whispers
- Player Valuation: Putting Data to Work on Transfer Market AnalysisFootball Benchmark
- How much do you value your favorite football star? by Johannes Post
- Knutson's Transfer Model Review by StatsBomb
- Analyzing and Identifying Markets for Transfers by Steven Marc Scott
- Determining Player Contracts Based on Player Values Over Time by Steven Marc Scott
- Analyse af Superliga tranfervinduet 2021 by Christian Rønsholt. See the following GitHub [link] for his project analysing the transfer windows in the Danish Superliga from 2010 to 2021
- FIFA considering using AI technology to calculate transfer fees by Paul Macdonald
- Market value of football players: what is important? by Callum Williams
- How Players are Valued
- Is It Possible to Predict Football Players’ Value by Burak Arslan
- Using machine learning to identify high-value football transfer targets by Jack Tattersall
- Predicting Transfer Fee of a Football Player by Yahya Yavuz
- Beyond crowd judgments: Data-driven estimation of market value in association football by Oliver Müller, Alexander Simons, and Markus Weinmann
- Football Player’s Performance and Market Value by Miao He, Ricardo Cachucho, and Arno Knobbe
- A novel machine learning method for estimating football players’ value in the transfer market by Iman Behravan andSeyed Mohammad Razavi
- Predicting player transfers in the small world of football by Roland Kovas and Laszlo Toka
- Predicting Market Value of Football Players using Machine Learning Algorithms by Sidharrth Mahadevan
- Estimating transfer fees of professional footballers using advanced performance metrics and machine learning by Ian G.McHale, and Benjamin olmes
- Predict the Value of Football Players Using FIFA Video Game Data and Machine Learning Techniques by Mustafa A. Al-Asadi and Sakir Tasdemır
- Econometric Approach to Assessing the Transfer Fees and Values of Professional Football Players by Raffaele Poli, Roger Besson, and Loïc Ravenel
- Identification of Factors Determining Market Value of the Most Valuable Football Players by Sebastian Majewski
- Predicting Market Value of Soccer Players Using Linear Modeling Techniques by Yuan He
- A Machine Learning Ensembling Approachm to Predicting Transfer Values by Ayse Elvan Aydemir, Tugba Taskaya Temizel, and Alptekin Temizel
- The Determinants of Football Transfer Market Value: An Age of Financial Restraint by Thomas Preston
- Modelling the transfer prices of football players by Ivo Hendriks
- A study of Prediction models for football player valuations by quantifying statistical and economic attributes for the global transfer market by Dibyanshu Patnaik, Harsh Praharaj, Kartikeya Prakash, and Krishna Samdani
- A Modelling Analysis of Transfer Fees from the 2019/20 Premier League Season by Callum Littler
football_scout_ml
R scripts by Jack Tattersall - 01_load_data.R, 02_train_xgboost.R, 03_xgboost_predict.R, randomforest.R. See the accompanying Medium post: Using machine learning to identify high-value football transfer targetsPlayerValueEstimator
by Burak Arslan. See the accompanying GitHub repo [link] and Medium Blog: Is It Possible to Predict Football Players’ Value
- Capology - a sports salaries platform
- KPMG Football Benchmark player valuation data
- spotrac - player contracts, salaries, and transfer information for the Premier League, MLS, and NWSL
- TransferMarket player bio and fiscal data
- This data can be scraped in the following ways:
- Python:
Tyrone Mings
webscraper by FCrSTATS (I've currently submitted a pull request to fix issues with this library to scrape bio-status data, see my TransferMarkt scraping notebook for code with minor fixes to enable code to run). - R:
worldfootballR
package by Jason Zivkovic (see guide [link])
- Python:
- An extract of player data for 2010-2021 for the 'Big 5' European leagues has been made available by John Muller, see [link]
- This data can be scraped in the following ways:
- Soccerway (transfer values)
transfers
GitHub repo for European football clubs' player transfers from 1992/93-2020/21 (as per TransferMarkt) by ewenme- Player Transfer Data collated by Tom Worville (see Tweet [link]
- The Price of Football Master Spreadsheet - data from the finance/business aspect of football by Kieran Maguire
Tyrone Mings
TransferMarkt webscraper by FCrSTATStmscrape
TransferMarkt.de webscraper by znstriderworldfootballR
package by Jason Zivkovic that include TransferMarkt webscraping (see guide [link])football_scout_ml
- a repo that uses machine learning to identify high-value targets in the football transfer market by James Tattersall. See the accompanying Medium Post [link]- Predicting Football Player Transfer Values by Sanjit Varma
DSbootcamp-Project2
by Burak Arslan. See the accompanying Medium Blog: Is It Possible to Predict Football Players’ ValueFootball_Transfers
by . GitHub repo for understanding trends in football transfers through an attempt to build a prediction model to predict the market value of players using Python
- Predicting Soccer Player Transfer Values by Sanjit Varma
- A Bayesian Approach to In-Game Win Probability by Jesse Davies, Pieter Robberechts, and Jan Van Haaren for DTAI Sports Analytics Lab. See paper [link]
- Who Will Win It? An In-game Win Probability Model for Football by Pieter Robberechts, Jan Van Haaren, and Jesse Davies for DTAI Sports Analytics Lab
- Explaining Live Win Probability (LWP) by Jonny Whitmore for The Analyst
- Opta's Live Win Probability Model on Amazon Prime Video by Alex Jennings for Stats Perform
- We Have a New Win Probability Model by Tyler Richardett for American Soccer Analysis
- Explained: What is ‘win probability’ and how does it work? by Alex Jennings for FourFourTwo
- Win probability Wiki
- FiveThirtyEight’s 2018 World Cup Predictions featuring win probability visualisations
- Anything by Dr. John Harrison. See his pieces on Goalkeeper.com. See his analysis being used on Sky Sport's Monday Night Football [link]
- Profiling keepers with data by Victor Renaud
- Intro to Goalkeeper Analysis by Ted Knutson (08/11/2018)
- Articles by Paul Riley
- What’s a Goalkeeper Worth to His Team? (06/07/2012)
- Things to watch differently in the Premier League this season #2 – Goalkeepers (06/10/2012)
- Safe hands? Is your keeper performing as well as expected? (11/06/2014)
- Courtois and Pickford: The Tall and Short of Keeper Styles (13/08/2018)
- Turning Theory Into Practice: Paul Riley Meets Swedish National Goalkeeping Coach Maths Elfvendal (04/12/2018)
- The Unique (and Not so Unique) Challenges of Goalkeeping in Women's Soccer (16/11/2018)
Thanks to all those that have kindly wrote about or promoted this GitHub repository. See:
- Articles:
- Social Media:
- WHERE TO LEARN FOOTBALL ANALYTICS? by Irfan Alghani Khalid
- The following LinkedIn post by Hadi Sotudeh
- The following LinkedIn post and Kaggle post by Ekrem Bayar
- The following Tweet by Tom Worville. Check out his Twitter thread on getting started in football analytics
- The following Tweet by Jan Van Haaren
- The following Tweet by Joe Gallagher
- The following Tweet by Hadi Sotudeh
- The following Tweet by Ninad Barbadikar
- The following Tweet by Tim Keller
- The following Tweet by The Devil's DNA
- The following Tweet by Brian McDonnell
- The following Tweet by Panda_9097
- Blogs/Reddit:
- Moneyball - Section Football Analytics by Alberto Riccardi
- Pra quem gosta de análise de dados no futebol by Wagner Andrade
- datasets on football or sports in general by nichtgefunden
- Ask Anything Thread by MatchAnalyst
- How do the top 5 leagues in Europe differ in overall style?
- New York City FC vs New York Red Bulls Data Analysis Report by MatchAnalyst
- Weekly Open Thread - General Discussion by messimisses
- GitHub Repositories:
This GitHub repository and resources list will be a constant work in progress so if you can think of any resources that I've missed, feel free to create a pull request or send me a message @ edd.j.webster@gmail.com or @eddwebster.
If you're new to creating a pull request, please follow these steps (based on this)
-
Create an account on GitHub if you do not already have one.
-
Fork the project repository: click on the ‘Fork’ button near the top of the page. This creates a copy of the code under your account on the GitHub user account. For more details on how to fork a repository see this guide.
-
Clone your fork of the football_analytics repo from your GitHub account to your local disk:
git clone https://github.com/<github username>/football_analytics.git cd football_analytics
-
Create environment with:
$ python3 -m venv my_env
or$ python -m venv my_env
or with conda:
$ conda create -n my_env python=3
-
Activate the environment:
$ source my_env/bin/activate
or with conda:
$ conda activate my_env
-
Add the upstream remote. This saves a reference to the main hyperopt repository, which you can use to keep your repository synchronised with the latest changes:
$ git remote add upstream https://github.com/eddwebster/footbal_analytics.git
You should now have a copy of the football analytics repository, and your git repository properly configured. The next steps now describe the process of modifying code and submitting a pull request:
-
Synchronize your master branch with the upstream master branch:
git checkout master git pull upstream master
-
Create a feature branch to hold your development changes:
$ git checkout -b my_change
and start making changes. Always use a feature branch. It’s good practice to never work on the master branch!
-
Then, once you commit ensure that git hooks are activated (Pycharm for example has the option to omit them). This can be done using pre-commit, as follows:
pre-commit install
-
Develop the feature on your feature branch on your computer, using Git to do the version control. When you’re done editing, add changed files using git add and then git commit:
git add modified_files git commit -m "my first football_analyitcs commit"
-
Record your changes in Git, then push the changes to your GitHub account with:
git push -u origin my_change
Star history for the football_analytics
repository.
- Soccer Analytics Handbook by Devin Pleuler;
- Awesome Soccer Analytics by Matias Mascioto;
- Jan Van Haaren's Soccer Analytics 2021 Review;
- Jan Van Haaren's Soccer Analytics 2020 Review;
- Jan Van Haaren's
soccer-analytics-resources
Github repo; awesome-readme
repository by Matias Singers used to restyle this README; and- Excel spreadsheet version of this README by Melanie Loeper link.