-
Building efficient algorithms for training Machine Learning models [1]
-
Developing a cross-platform 128-bit floating-point dtype for NumPy
- ML Systems and Compilation
- GPU programming and optimizations
- Distributed Systems & large-scale compute
For discussions, ping me on X/swayaminsync or email at [email protected]




