probability_from_postgres

Kaggle competition where the goal is to create an algorithm that detects the probability that an app was downloaded given an add was clicked

You can download the data here https://www.kaggle.com/c/talkingdata-adtracking-fraud-detection/data

As of right now, this algorithm counts the words in a given text, and than calculates the probability of each individual word in the given text. We have a couple million rows of testing data. Lets go ahaed and print our 20 rows to give an idea of what it is composed of:

Our training data has 186 million rows!!! Lets go ahead and print out a few rows to get a glimpse at what our traingin data looks like:

is_attributed cooresponds to whether an app has been downloaded or not. 0 means no downlad, 1 means app has been downloaded. This algorithm looks up an ip adress from the testing data in our postgres DB, gets the rows from the training data with the same IP adress, which is also in our postgres DB, then calcualtes the probability of that app being downloaded. The probability function is: P(APP DOWNLOADED | GIVEN IP ADRESS) * P(APP WAS DOWNLOADED). Our output should be a csv file. Lets take a look at what the values look like:

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
Screen Shot 2018-05-07 at 11.15.08 PM.png		Screen Shot 2018-05-07 at 11.15.08 PM.png
Screen Shot 2018-05-07 at 11.27.22 PM.png		Screen Shot 2018-05-07 at 11.27.22 PM.png
Screen Shot 2018-05-07 at 11.37.58 PM.png		Screen Shot 2018-05-07 at 11.37.58 PM.png
click_fraud_from_postgres.py		click_fraud_from_postgres.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

probability_from_postgres

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

bnicholl/probability_from_postgres

Folders and files

Latest commit

History

Repository files navigation

probability_from_postgres

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages