Automated Off-Policy Estimator Selection via Supervised Learning

Installation

First of all, clone this repository, then enter in the repository folder
Create the conda environment with all the dependencies needed:
- if cuda-based NVIDIA GPUs are available, run
```
conda env create -f environment_nvidia.yml
```
- otherwise run the following command (no GPUs support)
```
conda env create -f environment.yml
```
Download the datasets used in the experiments reported in the paper, that are
- the Open Bandit Dataset, available here
- the CIFAR10 Dataset, available here
Create the directories that will contain the datasets used in the experiments:
- Open Bandit Dataset:
```
mkdir -p real_datasets/open_bandit_dataset
```
Copy the downloaded dataset in the created directory.
- CIFAR-10
```
mkdir -p real_datasets/cifar10 
```
Unzip and copy the downloaded dataset in the created directory. Be sure that the data path is real_datasets/cifar10/cifar-10-batches-py/
Finally, run the code to generate the data, optimise and train the model, and execute the experiments.

In the following the commands for all the paper experiments are reported. For a more in depth explanation of each parameter you can take a look to the code.

Data Generation

To generate the synthetic data used to train the AutoOPE model, run this command:

cd src/black_box/data/
PYTHONPATH=../../ python3 binary_rwd_dataset_generation.py

Model Optimization

To optimize, train and test the AutoOPE model on the generated synthetic dataset, run this command:

cd src/black_box/estimator_selection/
PYTHONPATH=../../ python3 optimization.py --val_perc 0.25

Experiments

Below are reported the command line arguments used for each experiment:

Open Bandit Dataset experiment

cd src/black_box/evaluation/
PYTHONPATH=../../ python3 real_data_evaluation_run.py \
--dataset obd \
--subsampling_ratio 0.05 \
--n_data_generation 20  \
--outer_n_jobs 20

UCI Datasets experiments

The parameter <NAME> is set to a different value based on the dataset. The complete list of all the values is the following: 'letter', 'optdigits', 'page-blocks', 'pendigits', 'satimage', 'vehicle', 'yeast', 'breast-cancer'.

cd src/black_box/evaluation/
PYTHONPATH=../../ python3 real_data_evaluation_run.py \
--dataset <NAME> \
--subsampling_ratio 0.9 \
--n_data_generation 50  \
--outer_n_jobs 50

Cifar-10 experiment

cd src/common/evaluation/
PYTHONPATH=../../ python3 real_data_evaluation_run.py \
--dataset cifar10 \
--subsampling_ratio 0.66666 \
--n_data_generation 20  \
--outer_n_jobs 20

Synthetic experiments

$(\beta_1, \beta_2) = (2, -2)$

cd src/black_box/evaluation/
PYTHONPATH=../../ python3 synthetic_data_evaluation_run.py \
--beta_1 -2 \
--beta_2 2 \
--outer_n_jobs_gt 100 \
--inner_n_jobs 10 \
--outer_n_jobs 100

$(\beta_1, \beta_2) = (3, 7)$

cd src/black_box/evaluation/
PYTHONPATH=../../ python3 synthetic_data_evaluation_run.py \
--beta_1 3 \
--beta_2 7 \
--outer_n_jobs_gt 100 \
--inner_n_jobs 10 \
--outer_n_jobs 100

Additional Scaling and Ablation Experiments

To reproduce the scaling experiments reported in the paper, run this command:

cd src/black_box/estimator_selection/
PYTHONPATH=../../ python3 incremental_train_size_optimization.py \

To reproduce the ablation experiments on the features types reported in the paper, run this command with <FEATURE_TYPE> chosen from 'policy_dep', 'policy_indep', 'estimator_dep':

cd src/black_box/estimator_selection/
PYTHONPATH=../../ python3 optimization.py \
--features_subset <FEATURE_TYPE>

To reproduce the ablation experiments on the dataset diversity reported in the paper, run this command with <DATA_TYPE> chosen from 'KL', 'actions':

cd src/black_box/estimator_selection/
PYTHONPATH=../../ python3 optimization.py \
--features_subset <DATA_TYPE>

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
environment_nvidia.yml		environment_nvidia.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automated Off-Policy Estimator Selection via Supervised Learning

Installation

Data Generation

Model Optimization

Experiments

Additional Scaling and Ablation Experiments

About

Uh oh!

Releases

Packages

Languages

License

remaplab/AutoOPE

Folders and files

Latest commit

History

Repository files navigation

Automated Off-Policy Estimator Selection via Supervised Learning

Installation

Data Generation

Model Optimization

Experiments

Additional Scaling and Ablation Experiments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages