CUDA|Triton|TensorRT
Learn CUDA Programming, published by Packt
Learn CUDA Programming, published by Packt
Sample codes for my CUDA programming book
我的教程专栏,你将绝对能实现CUDA工程化,完全从环境安装到CUDA核函数编程,从核函数到使用相关内存优化,从内存优化到深度学习算子开发(如:nms),从算子优化到模型(以yolo系列为基准)部署。最重要的是,我的教程将简单明了直切主题,CUDA理论与实战实例应用,并附相关代码,可直接上手实战。我的想法是掌握必要CUDA相关理论,去除非必须繁杂理论,实现CUDA算法应用开发,待进一步提高,将…
An x64 assembly implementation of essential data structures and algorithms for DSA.
Hands-On GPU Accelerated Computer Vision with OpenCV and CUDA, published by Packt
Hands-On GPU Accelerated Computer Vision with OpenCV and CUDA, published by Packt
Hands-On GPU Programming with Python and CUDA, published by Packt
Hands-On GPU Programming with Python and CUDA, published by Packt
关于书籍CUDA Programming使用了pycuda模块的Python版本的示例代码
关于书籍CUDA Programming使用了pycuda模块的Python版本的示例代码
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.
A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.
bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码
bilibili视频【CUDA 12.x 并行编程入门(Rust版)】配套代码
bilibili视频【CUDA 12.x 并行编程入门(Python版)】配套代码
A CUDA tutorial to make people learn CUDA program from 0