For dataset 41081, I get: ``` 1.1G dataset.arff 2.3G dataset.pkl.py3 ``` Maybe we should be using joblib instead? My hard drive is getting full because I'm trying to run CC-18 stuff, which is a bit annoying.