Resampling Unbalanced Datasets using the Credit Card Fraud Dataset

In this project I will show an example of how resampling can be useful for unbalanced datasets in binary classification problems. I'll be using a logistic regression model to demonstrate this. I'm aware that there are a vast amount of tools and libraries to deal with resampling and I strongly recommend that you use a combination of these methods to deal with unbalanced datasets. Here I would like to simply demonstrate the pros and cons of resampling using Sklearn's resample().

dataset: https://www.kaggle.com/datasets/yashpaloswal/fraud-detection-credit-card

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
README.md		README.md
project_notebook.ipynb		project_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resampling Unbalanced Datasets using the Credit Card Fraud Dataset

About

Releases

Packages

Languages

samarakoon-ryan/resampling-unbalanced-datasets

Folders and files

Latest commit

History

Repository files navigation

Resampling Unbalanced Datasets using the Credit Card Fraud Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages