Prediction Basketball Games

Final project of the KTH ID2221 - Data-Intensive Computing course.

The aim of this project can be split into the following points:

Data preprocessing in order to develop a dataset of NBA rosters from 1990 to 2018.
Test different ML-based approaches in order to predict the final results of NBA matches along a season.

The first bulletpoint has been performed using Scala and Spark, using the API Ball Don't Lie. By means of the obtained historical data we have built our own dataset of rosters in base to the seasonal averages of the team players in each recorded metric.

The second point has been developed using Python, specifically some already provided methods of the library SciKit.

The results obtained with our approach are similar to other state-of-the-art methods.

In order to get more information about the method and its implementation, please review NBA Outcomes Predictor.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
data		data
plots		plots
roster		roster
season_averages		season_averages
season_games		season_games
.gitignore		.gitignore
ApiRequest.py		ApiRequest.py
ApiRequest_players.py		ApiRequest_players.py
CleanRosterAvg.py		CleanRosterAvg.py
Features.py		Features.py
NBA_Outcomes_Predictor.pdf		NBA_Outcomes_Predictor.pdf
NbaPredictor.py		NbaPredictor.py
Pre_process_games.ipynb		Pre_process_games.ipynb
Pre_process_rosters.ipynb		Pre_process_rosters.ipynb
README.md		README.md
RosterGenerator.ipynb		RosterGenerator.ipynb
RosterLoader.py		RosterLoader.py
cleaner.sh		cleaner.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prediction Basketball Games

About

Releases

Packages

Contributors 2

Languages

adrian-camp/DataIntensive-Project-Prediction-BasketballGames

Folders and files

Latest commit

History

Repository files navigation

Prediction Basketball Games

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages