Data-driven team ranking and match performance analysis in Chinese Football Super League

2020 
Abstract Recent years have seen an increasing body of research into the evaluation of the team-level technical-tactical performance in association football using match events data. However, most studies used mono-dimensional approach and modeled the influence of each performance aspects on match result in isolation, which limited the interpretability of the results. The study was aimed to apply a state-of-the-art algorithm to the ranking of team performance and exploitation of key performance features in relation to match outcome based on massive match dataset. Data of all 1200 matches from 2014 to 2018 Chinese Football Super League (CSL) were used. From the original 164 match events, we extracted 22 features that were related to attacking, passing, and defending performance and most. A Linear Support Vector Classifier (LSVC) model was subsequently built with these 22 input features and trained in order to rank the teams by their performance and analyze the features that influence most match outcome (win/not win), with the dataset being divided into a ratio of 4:1 to train and validate the model. The results have shown that the data-driven LSVC model displayed a prediction accuracy of 0.83 and the ranking of teams’ match performance and prediction of teams’ league standings were highly correlated with their actual ranking. Saves, pass success and shot on target in penalty area were demonstrated as top positive features for winning whereas shots on target during open play, pass and bad shot% were three negative features most influential for the match result. The team ranks of all teams were highly correlated with their real final league rankings. In general, CSL winning teams build their success based on defensive ability and shooting accuracy, and high-ranked teams could always maintain better performance than their counterparts. The team-rank framework could provide a consolidated and complex approach to evaluate the match performance quality of the teams, refining decisions-making during match preparation and player transfer at different periods of the season.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    50
    References
    4
    Citations
    NaN
    KQI
    []