FFT for convolution

I wanted to use this library for a CNN. After a quick look at the source, it doesn't look like you use the the [FFT algorithm for fast convolution](https://en.m.wikipedia.org/wiki/Convolution_theorem). It is probably worth implementing.