About Performance.

Hi @crowsonkb/Katherine,

I wanted to discuss performance here, what are the gain compared to a classic Numpy implementation, especially in regard to the following comment:

> Theano can compile it to use a GPU but this was found to run slower.

Cheers,

Thomas