Data Whitening Improves Sparse Autoencoder Learning shows that whitening the input data during training results in better SAEs. This seems like a pretty intuitive and sort of obvious thing to do, it's surprising nobody has done this so far. We already support normalizing activations using normalize_activations="expected_average_only_in", we could add an additional option "covariance_whitening" implementing the technique from this paper.
Data Whitening Improves Sparse Autoencoder Learning shows that whitening the input data during training results in better SAEs. This seems like a pretty intuitive and sort of obvious thing to do, it's surprising nobody has done this so far. We already support normalizing activations using
normalize_activations="expected_average_only_in", we could add an additional option"covariance_whitening"implementing the technique from this paper.