add script to split into specified train/val splits given a data directory
add script to split into specified train/val splits given a data directory