Right now, it looks like we check the directory the files are downloaded in to see if the file has already been scraped. This means if you move the file, you end up re-downloading it.
I suggest we instead store the downloaded files in a file a read that instead.
cc @khaliqgant