Skip to content

CUDA matrix multiplication, reduction, and softmax kernels optimized for my RTX 4070 in C++17

Notifications You must be signed in to change notification settings

ajagtapdev/kernel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CUDA Kernels

Build with cmake in cuda/ directory.

About

CUDA matrix multiplication, reduction, and softmax kernels optimized for my RTX 4070 in C++17

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published