In this project I created a machine learning model to predict the total number of goal scorers at a Copa America Tournament based on the year. I used a fairly small dataset going back to 1916, when the first tournament took place. Afterwards I used GraphViz to create a Decision Tree Model so that users can visualize the information.
Lessons LearnedÂ
Although this project was fairly small the lessons it gave me weren't. The biggest lesson I learned was the importance of big data sets. Due to the fact that the Copa America only occurs every 4 years the data set was far too small. This led to less than stellar accuracy. In the future I will be sure to maximize the amount of data in my model to maximize predictive ability.