Skip to content
View coderonion's full-sized avatar

Block or report coderonion

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

C++

616 repositories

A C++ header-only Eigen-based Library for Lie group operations

C++ 278 16 Updated Jan 12, 2025

A C++ header-only Eigen-based Library for Lie group operations

C++ 1 Updated Jan 12, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,489 132 Updated Aug 4, 2025

Python bindings for llama.cpp

Python 9,405 1,185 Updated Jul 18, 2025

Python bindings for llama.cpp

Python 1 Updated Jan 29, 2025

GPU Kernels

Cuda 191 16 Updated Apr 27, 2025

100 days of building GPU kernels!

Cuda 479 49 Updated Apr 27, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,782 1,058 Updated Aug 2, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 1 Updated Feb 13, 2025

Large-scale LLM inference engine

C++ 1,495 160 Updated Aug 4, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 1 Updated May 28, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 5,863 618 Updated Aug 1, 2025

Examples from Programming in Parallel with CUDA

Cuda 158 57 Updated Mar 17, 2023

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

Cuda 287 58 Updated Jun 13, 2025

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 27 5 Updated Jul 24, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1 Updated Jul 10, 2025