Hardware engineer exploring GPUs, CUDA and ML systems
I’m a hardware engineer who likes understanding systems from the ground up.
My background sits in CPU design, verification and memory systems.
Lately I’ve been diving into GPU compute and deep learning workloads to connect the dots between hardware and software performance.
I’m running a personal challenge to sharpen my understanding of GPU kernels and performance behavior.
Repo here:
https://github.com/theRTLmaker/CUDA_in_100_days
- CUDA programming
- GPU memory behavior and profiling
- Deep learning systems and training loops
- How ML frameworks interact with hardware
- CPU design and verification
- SystemVerilog, C++ and Python
- Cache behavior, coherency and memory systems
- ML workflow fundamentals and performance intuition
- GPU programming
- ML systems
- Hardware architecture
- Low level debugging
- Performance tuning
Happy to talk about GPUs, CPUs, ML systems or hardware architecture.
