-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is libdnn support Mali gpu? #17
Comments
@zhenghuitian Other than that, you mainly have to find out the required FLOPS per global memory read/write to fully occupy the chip, as well as memory reading/writing strides for the individual threads. Now, GEMM and convolution are quite difficult to get exactly right, for element-wise operations it's much easier. LibDNN is mainly developed for desktop class GPUs (AMD RX480, W9100 and nVidia GTX980, 1080) at the moment. |
@naibaf7 thank you. |
@naibaf7 but I do not understand the meaning of what you say "you mainly have to find out the required FLOPS per global memory read/write to fully occupy the chip, as well as memory reading/writing strides for the individual threads." and how to do that? |
I've opened a specific issue at #18 |
I use caffe-opencl with Mali gpu but I seems that libdnn can not support Mali.
Actually, I want to make some optimization in opencl kernel for some operate such as element-wise multiplication. You have do some memory optimization in libdnn of opencl kernel. But as I know, the memory of opencl in mali just use CL_MEM_ALLOC_HOST_PTR .. for cpu data.
would you tell me the method libdnn use for memory optimization or show me some resources about this.
thank you
The text was updated successfully, but these errors were encountered: