Productive, portable, and performant GPU programming in Python.
-
Updated
Mar 8, 2025 - C++
Productive, portable, and performant GPU programming in Python.
Open3D: A Modern Library for 3D Data Processing
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
CUDA Templates for Linear Algebra Subroutines
Mesh optimization library that makes meshes smaller and faster to render
a language for fast, portable data-parallel computation
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
ArrayFire: a general purpose GPU library.
cuML - RAPIDS Machine Learning Library
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
Lightning fast C++/CUDA neural network framework
HeavyDB (formerly OmniSciDB)
On-device AI across mobile, embedded and edge for PyTorch
Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."