Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Except that CUDA is low level, so it's not hard to shim above it and write interoperable code. There are too many players who don't want to play the Nvidia tax, this will play out like OpenGL vs Direct3d in reverse.


> this will play out like OpenGL vs Direct3d in reverse

Is that also like OpenCL vs CUDA in reverse?


I feel like abstractions don't work if we want to get the maximum performance. Afaik Tensor cores are not usable from opencl and on the other hand even in the CUDA universe cuBLAS (hand optimized) seems to outperform cutlass (using abstractions)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: