Hi @crowsonkb/Katherine, I wanted to discuss performance here, what are the gain compared to a classic Numpy implementation, especially in regard to the following comment: > Theano can compile it to use a GPU but this was found to run slower. Cheers, Thomas