AUTHOR=Anzer Gabriel , Bauer Pascal TITLE=A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer) JOURNAL=Frontiers in Sports and Active Living VOLUME=3 YEAR=2021 URL=https://www.frontiersin.org/journals/sports-and-active-living/articles/10.3389/fspor.2021.624475 DOI=10.3389/fspor.2021.624475 ISSN=2624-9367 ABSTRACT=

Due to the low scoring nature of football (soccer), shots are often used as a proxy to evaluate team and player performances. However, not all shots are created equally and their quality differs significantly depending on the situation. The aim of this study is to objectively quantify the quality of any given shot by introducing a so-called expected goals (xG) model. This model is validated statistically and with professional match analysts. The best performing model uses an extreme gradient boosting algorithm and is based on hand-crafted features from synchronized positional and event data of 105, 627 shots in the German Bundesliga. With a ranked probability score (RPS) of 0.197, it is more accurate than any previously published expected goals model. This approach allows us to assess team and player performances far more accurately than is possible with traditional metrics by focusing on process rather than results.