Prediction of Football Match Score and Decision Making Process
Prediction of Football Match Score and Decision Making Process
Prediction of Football Match Score and Decision Making Process
Pramila Shinde
Assistant Professor Lalit Kumar Teli
IT-SAKEC IT-SAKEC
Mumbai, India. Mumbai, India.
[email protected] [email protected]
Abstract— Football isone of the most popular sport internationally. With that amount of popularity and fame, it has gained a lot of attention
from the technical field. Prediction of match results has been in demand for various purposes including team management, analysis and betting.
We have come up with a solution using machine learning algorithms that can fulfil all the current needs in football match prediction. The
implementation only includes teams from Spanish La Liga over the last 5 seasons. We have predicted the outcome of matches between Home
Team and Away Team which would include the final score, the starting 11 players, the substitutes and the names of probable goal scorers. For
the purpose of analysis, we have provided the stats of players and teams referring to the FIFA 18 game database as well as their actual career
stats. We have also provided the analysis of strength, weakness and tactics of players and teams. Finally, for decision making purposes, we
would make a system that can analyse the Home and Away team and then suggest the tactics to the user for their team that can maximize their
winning chances.
_________________________________________________*****_________________________________________________
B. Implementation
The system has been implemented using several machine
learning algorithms (Logistic Regression, Random Forest,
Artificial Neural Network, Linear SVM and Naïve Bayes)
using Python Sci-Kit Learn library. The training and testing is
done on the complete dataset using Cross-Validation function
with cv = 10. The main part of our implementation was
concentrated on database pre-processing and attribute
selection.
The Match History Database includes 12 attributes: Home
Team ID, Home Shots, Home Shots on Target, Home
Corners, Home Yellow Cards, Home Red Cards and similar 6
attributes for Away team. A total of 29 teams have
participated in La Liga between 2012-2017. These teams have
been assigned an ID from 1-29 based on their performance in
the 5 seasons where the best team ID starts from 1.
As the 13th attribute we used the Home Team’s win
percentage against the given Away Team, which was obtained
from the Team Vs Team Database. It proved to be successful
by increasing the accuracy of the model by almost 8%. This
idea was taken from the fact that every team has a playing Figure1. Project Flow Diagram
style and also has rivals. So, even though a team is better than
the other team in everything, but due to the rivalry or due to
C. Results Obtained
their respective playing styles, the result can be a loss. Hence,
TABLE 1. ALGORITHM ACCURACY TABLE
the inclusion of Team ID and Head to Head win percentage
LR RF ANN Linear NB
was successful. SVM
Further, we used the Goals History database to predict the Match 63.94% 61.53% 63.1% 58.25% 58.63%
exact score of both the teams at the end of the match using History
Logistic Regression and the accuracy was found out to be Database
69.87%. After this, we used the statistical and random Match 71.63% 69.9% 69.2% 66.95% 63.57%
simulation approach to find the goal scorers. The team and History
player analysis were displayed directly from the databases and Database
represented graphically as in [5]. +
Later we would make the player evaluation system to find Team Vs
whether a new player can fit in a squad and also their transfer Team
Database
value. Finally, the decision support system will be
164
IJRITCC | February 2018, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 6 Issue: 2 162 - 165
______________________________________________________________________________________
LR = Logistic Regression VI. REFERENCES
RF = Random Forest [1] Igiri,Chinwe Peace, Nwachukwu, Enoch Okechukwu,“An
ANN = Artificial Neural Network Improved Prediction Systemfor Football a Match Result”
IOSR Journal of Engineering (IOSRJEN), Vol. 04, PP 12-
NB = Naïve Bayes
20.
SVM=Support Vector Machine [2] FarzinOwramipur, ParinazEskandarian,and
FaezehSadatMozneb,“Football ResultPrediction with
The Accuracy specified in the table is the average of the Bayesian Network in Spanish LeagueBarcelona Team”
accuracy for Home Win, Away Win and Draw classifications. InternationalJournal of Computer Theory and Engineering,
Vol. 5, No. 5, October 2013.
[3] Darwin Prasetio, Dr.Harlili,“Predicting Football
D. Regression Coefficients MatchResults with LogisticRegression”, Institute Teknologi
Bandung, 2016.
TABLE 2. REGRESSION COEFFICIENTS TABLE [4] Josip Hucaljuk, AlenRakipović,“Predicting football scores
Home Win Away Win Draw using machine learningtechniques”, University of Zagreb,
HT ID -0.027 -0.008 0.020 Faculty of Electrical Engineering and Computing Unska 3,
May 23-27, 2011.
AT ID 0.009 -0.015 -0.004 [5] Jongho Shin, Robert Gasparyan, “A novel way to Soccer
HT Win 4.613 -4.836 -0.630 Match Prediction”.
percentage
vs AT
HT Shots -0.112 0.072 0.030
AT Shots 0.053 -0.089 0.024
HT Shots on 0.449 -0.275 -0.193
Target
AT Shots on -0.347 0.560 -0.193
Target
HT Corners -0.155 0.083 0.060
AT Corners 0.089 -0.117 0.016
HT Yellow -0.036 0.027 0.034
Cards
AT Yellow -0.087 0.017 0.104
Cards
HT Red -0.650 0.435 0.128
Cards
AT Red 0.489 -0.722 0.093
Cards
V. CONCLUSION
We implemented the model using different machine learning
algorithms and were able to reach the accuracy of 71.63%
with Logistic Regression on the Match History Database of 5
seasons along with the Team Vs Team Database.Referring to
related works, there are a lot of improvements that can be
made to our system for improving the accuracy of both, match
result prediction as well as goal prediction. Also, the two
predictions can be later combined to further improve the
model. We can also use the Machine Learning approach for
the Goal Scorer prediction.
165
IJRITCC | February 2018, Available @ http://www.ijritcc.org
_______________________________________________________________________________________