-
cudarc
Safe and minimal CUDA bindings
-
nvml-wrapper
A safe and ergonomic Rust wrapper for the NVIDIA Management Library
-
lru-slab
Pre-allocated storage with constant-time LRU tracking
-
vulkano
Safe wrapper for the Vulkan graphics API
-
femtovg
Antialiased 2D vector drawing library
-
whisper-rs
Rust bindings for whisper.cpp
-
opencl3
Khronos OpenCL 3.0 API and extensions
-
neptune
Poseidon hashing over BLS12-381 for Filecoin
-
cl3
Khronos OpenCL 3.0 API and extensions
-
pixels
A tiny library providing a GPU-powered pixel frame buffer
-
ocl
OpenCL bindings and interfaces for Rust
-
trueno
High-performance SIMD compute library with GPU support for matrix operations
-
cubecl
Multi-platform high-performance compute language extension for Rust
-
gpu-descriptor
agnostic descriptor allocator for Vulkan like APIs
-
xilem
A next-generation cross-platform Rust UI framework
-
trueno-viz
SIMD/GPU/WASM-accelerated visualization library for data science and ML
-
vhost-device-gpu
A virtio-gpu device using the vhost-user protocol
-
gpu-alloc
agnostic memory allocator for Vulkan like APIs
-
quant-iron
high-performance, hardware-accelerated modular quantum computing library with a focus on physical applications. Quant-Iron provides tools to represent quantum states, apply standard quantum gates…
-
envy-tui
TUI manager for EnvyControl - GPU switching for Nvidia Optimus laptops
-
diffusionx
A multi-threaded crate for random number generation and stochastic process simulation, with optional GPU acceleration
-
qmassa
Terminal-based tool for displaying GPUs usage stats on Linux
-
GORBIE
GORBIE! Is a minimalist notebook library for Rust
-
inlyne
Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye
-
tenflowers-autograd
Automatic differentiation engine for TenfloweRS
-
mosec
Model Serving made Efficient in the Cloud
-
ash_renderer
Vulkan renderer in Rust using ASH - ECS-free, pure rendering engine
-
beamterm-renderer
High-performance WebGL2 terminal renderer for beamterm, targeting sub-millisecond render times in web browsers
-
tenflowers-neural
Neural network layers, models and training APIs for TenfloweRS
-
gameboy
emulator written in Rust and WebAssembly
-
neofetch
-
jolt-platform
Cross-platform battery and power monitoring for jolt
-
mtl-gpu
Rust bindings to Apple's Metal framework
-
kubectl-view-allocations
kubectl plugin to list allocations (cpu, memory, gpu,...) X (utilization, requested, limit, allocatable,...)
-
repartir
Sovereign AI-grade distributed computing primitives for Rust (CPU, GPU, HPC)
-
oxifetch
program that displays key system information, such as OS details, uptime, CPU specs, memory usage, and more. The output includes an ASCII art logo and a quick overview of your machine's current status.
-
gpu-trace-perf
Plays a collection of GPU traces under different environments to evaluate driver changes on performance
-
glcore-rs
The OpenGL core functions for Rust, also supports OpenGL ES
-
par-term
Cross-platform GPU-accelerated terminal emulator with inline graphics support (Sixel, iTerm2, Kitty)
-
cubecl-cpp
CPP transpiler for CubeCL
-
easy-async-opencl3
A declarative, multi-device asynchronous executor for OpenCL based on cl3
-
turboplot
A blazingly fast waveform renderer made for visualizing huge traces
-
beamterm-atlas
Font atlas generator for beamterm WebGL terminal renderer, creating GPU-optimized texture arrays from TTF/OTF fonts
-
pathfinder_geometry
Basic SIMD-accelerated geometry/linear algebra
-
ringkernel-cuda
CUDA backend for RingKernel - NVIDIA GPU support via cudarc
-
tenflowers-ffi
Python bindings for TenfloweRS
-
tenflowers-dataset
Data pipeline and dataset utilities for TenfloweRS
-
spirv-std
Standard functions and types for SPIR-V
-
mtl-sys
Low-level Objective-C runtime bindings for Metal
-
numr
High-performance numerical computing with multi-backend GPU acceleration (CPU/CUDA/WebGPU)
-
gllm-kernels
Low-level attention kernels for gllm with CUDA/ROCm support
-
tracy_full
Fully featured bindings for the Tracy profiler
-
runmat-gc-api
Public API types for the RunMat garbage collector
-
mtl-foundation
Foundation framework bindings (NSObject, NSString, NSArray, etc.)
-
runmat-kernel
Jupyter kernel implementation for RunMat using ZeroMQ and the Jupyter protocol
-
nvml-wrapper-sys
Generated bindings to the NVIDIA Management Library
-
hybrid-predict-trainer-rs
Hybridized predictive training framework with warmup, full-train, predict, and residual correction phases for accelerated deep learning
-
vkcore-rs
The Vulkan core functions for Rust
-
keplemon
Expanded functionality for the Standardized Astrodynamics Algorithms Library (SAAL)
-
runpod
client for the RunPod API
-
oversee
A modern system monitor for macOS with Apple Silicon GPU support
-
rtop
A system monitor implemented in Rust, Monitors both system activity and GPU activity for NVIDIA GPUs
-
ann-search-rs
Various approximate nearest neighbour searches in Rust with emphasis for single cell applications
-
kronos-compute
A high-performance compute-only Vulkan implementation with cutting-edge GPU optimizations
-
burn-vision
Vision processing operations for burn tensors
-
async-cuda
Async CUDA for Rust
-
wa
Cross-platform window assistant made primarily for Rio terminal
-
mtl-fx
MetalFX bindings for AI upscaling and frame interpolation
-
ghostflow-cuda
CUDA backend for GhostFlow ML framework
-
trueno-db
GPU-first embedded analytics database with SIMD fallback and SQL query interface
-
ringkernel-ecosystem
Ecosystem integrations for RingKernel - actors, web frameworks, data processing, ML
-
archx
High-performance CPU/GPU adaptive optimization library with SIMD and Multithreading
-
halldyll_starter_runpod
managing RunPod GPU pods - Provisioning, orchestration & state management
-
runmat-turbine
Cranelift-based JIT compiler for RunMat - the optimizing tier of our V8-inspired execution model
-
silicon-monitor
Silicon Monitor: Comprehensive hardware monitoring for CPUs, GPUs, NPUs, memory, I/O, and network silicon across all platforms
-
hw_dcmi_wrapper
A safe and ergonomic Rust wrapper for the Huawei DCMI API
-
runmat-builtins
RunMat built-in functions and standard library components
-
ec-gpu
Traits for field and eliptic curve operations on GPUs
-
par-fractal
Cross-platform GPU-accelerated fractal renderer with 2D and 3D support
-
image-colorizer
Never settle for images outside your colorscheme again!
-
archetype_asset
Fast, modular asset system with spatial preloading
-
cubemoma
A multi-word modular arithmetic library based on CubeCL
-
ringkernel-wavesim
Interactive 2D wave propagation showcase for RingKernel
-
nnl
A high-performance neural network library for Rust with CPU and GPU support
-
rocm_smi_lib
easy to use crate for using rocm-smi from rust
-
opencl-sys
OpenCL C FFI bindings for the Rust programming language
-
rust-ai-core
Unified AI engineering toolkit: orchestrates peft-rs, qlora-rs, unsloth-rs, axolotl-rs, bitnet-quantize, trit-vsa, vsa-optim-rs, and tritter-accel
-
async-tensorrt
Async TensorRT for Rust
-
ringkernel-metal
Metal backend for RingKernel - Apple GPU support
-
trtx
Safe Rust bindings to NVIDIA TensorRT-RTX (EXPERIMENTAL - NOT FOR PRODUCTION)
-
wgpu_render_manager
Cached Render/Compute Manager for wgpu (pipelines + bind groups + procedural textures automated)
-
pylate-rs
WebAssembly library for late interaction models
-
horizon-lattice-render
Graphics rendering backend for Horizon Lattice using wgpu
-
vkfetch-rs
fetch-program that displays basic information about your vulkan-compatible graphic card(s)!
-
xdl-amp
Multi-backend GPU/ML acceleration for XDL
-
ha-ndarray
A hardware-accelerated n-dimensional array
-
nviwatch
A blazingly fast Rust-based TUI for managing and monitoring NVIDIA GPU processes
-
optirs
Advanced ML optimization and hardware acceleration library (main integration crate)
-
hanzo-pqc
Post-quantum cryptography primitives (ML-KEM, ML-DSA, SLH-DSA) for Hanzo ecosystem
-
cuvs
RAPIDS vector search library
-
ringkernel-wavesim3d
3D acoustic wave simulation with realistic physics, binaural audio, and GPU acceleration
-
bitnet-metal
Metal GPU acceleration for BitNet on Apple Silicon
-
unmtx-gpu
Micro matrix library for neural networks that uses GPU
-
burn-wgpu
WGPU backend for the Burn framework
-
images_and_words
GPU middleware and abstraction layer for high-performance graphics applications and games
-
trueno-graph
GPU-first embedded graph database for code analysis (call graphs, dependencies, AST traversals)
-
trueno-zram-adaptive
ML-driven compression algorithm selection for trueno-zram
-
cudaforge
Advanced CUDA kernel builder for Rust with incremental builds, auto-detection, and external dependency support
-
vkobject-rs
The Vulkan object wrappers for Rust
-
sevensense-embedding
Embedding bounded context for 7sense bioacoustics - Perch 2.0 ONNX integration
-
host_discovery
host discovery
-
nam-ec-gpu-gen
Code generator for field and elliptic curve operations on the GPUs
-
opentelemetry-system-metrics
System metric export through Opentelemetry
-
runmat-accelerate
Pluggable GPU acceleration layer for RunMat (CUDA, ROCm, Metal, Vulkan/Spir-V)
-
ec-gpu-gen
Code generator for field and eliptic curve operations on the GPUs
-
ringkernel-ir
Intermediate Representation for RingKernel GPU code generation
-
burn-cubecl
Generic backend that can be compiled just-in-time to any shader language target
-
memkit-gpu
Backend-agnostic GPU memory management for memkit
-
cuda-rust-wasm
CUDA to Rust transpiler with WebGPU/WASM support
-
gatenative
execute natively Gate circuits
-
skia-graphics-rs
High-performance 2D graphics library built on Skia with GPU acceleration
-
hardware-query
Cross-platform Rust library for comprehensive hardware detection, real-time monitoring, power management, and AI/ML optimization
-
flashlight_tensor
gpu/cpu tensor library focused around matrix and neural network operations
-
ringkernel-cuda-codegen
CUDA code generation from Rust DSL for RingKernel stencil kernels
-
feagi-npu-burst-engine
High-performance burst engine for FEAGI neural processing
-
memkit-co
CPU-GPU memory coordination for the memkit ecosystem
-
ringkernel-cpu
CPU backend for RingKernel - testing and fallback implementation
-
with-gpu
Intelligent GPU selection wrapper for CUDA commands
-
rdpe
Reaction Diffusion Particle Engine - GPU particle simulations made easy
-
cuneus
A WGPU-based shader development tool
-
mtl-quartz-core
QuartzCore/CoreAnimation bindings (CAMetalLayer, CAMetalDrawable)
-
tensor_frame
A PyTorch-like tensor library for Rust with CPU, WGPU, and CUDA backends
-
kitsune-stt
Speech-to-Text tool using Candle and Voxtral
-
ringkernel-wgpu
WebGPU backend for RingKernel - cross-platform GPU support
-
autograph
A machine learning library for Rust
-
goldy
Modern Graphics Library
-
optirs-gpu
OptiRS GPU acceleration and multi-GPU optimization
-
nvglances
A TUI system monitor with support for NVIDIA GPUs (CUDA/NVML) and Apple Silicon GPUs (Metal)
-
feather-ui
Feather UI library
-
chromapath
GPU-accelerated path tracer implementing 'Ray Tracing in One Weekend' with CPU, Vulkan compute, and hardware ray tracing backends
-
dora-ros2-bridge
ROS2 bridge for dora-rs
-
runpod-sdk
Unofficial Rust SDK for RunPod: deploy and scale GPU workloads with serverless endpoints and on-demand pods
-
burn-mlx
MLX backend for Burn deep learning framework - native Apple Silicon GPU acceleration
-
rocm-rs
Rust bindings for AMD ROCm libraries
-
tensorlogic-scirs-backend
SciRS2-powered tensor execution backend for TensorLogic
-
kn-cuda-eval
A CUDA executor for neural network graphs
-
ringkernel
GPU-native persistent actor model framework - Rust port of DotCompute Ring Kernel
-
burn-cubecl-fusion
Provide optimizations that can be used with cubecl based backends
-
sass-assembler
SASS (NVIDIA GPU) assembler for Gaia project
-
haagenti-cuda
CUDA GPU decompression kernels for Haagenti tensor compression
-
rustkernels
GPU-accelerated kernel library for financial services, analytics, and compliance workloads
-
cubecl-cuda
CUDA runtime for CubeCL
-
hive-gpu
High-performance GPU acceleration for vector operations with Device Info API (Metal, CUDA, ROCm)
-
trueno-gpu
Pure Rust PTX generation for NVIDIA CUDA - no LLVM, no nvcc
-
gpuinfo
A minimal command-line utility for querying GPU status
-
krnl
Safe, portable, high performance compute (GPGPU) kernels
-
obrah
High-level Rust wrapper around OpenCL using OBWIO
-
candle-cuda-vmm
CUDA Virtual Memory Management bindings for elastic KV cache allocation in Candle
-
nvidia-gpu-exporter
Prometheus exporter for NVIDIA GPUs using NVML
-
vulkano-util
functionality to make usage of vulkano easier
-
cuda_setup
Assists with CUDA setup when using the CUDARC lib
-
burn-cuda
CUDA backend for the Burn framework
-
halldyll_deploy_pods
Declarative, idempotent, and reconcilable deployment system for RunPod GPU pods
-
job_config
Configuration parser for Silva workflow jobs with TOML support
-
glitcher
Real-time visual effects engine for VJ performance
-
tenflowers
Pure Rust implementation of TensorFlow - A comprehensive deep learning framework
-
pixstage
A canvas for drawing pixels
-
crystal-api
Crystal API is a unified GPU API's wrapper
-
daedalus-gpu
GPU backends and WGSL dispatch helpers for the Daedalus runtime
-
libinfer
Rust interface to TensorRT for high-performance GPU inference
-
crystal-vk
Graphics wrapper for Vulkan
-
basis-universal
Bindings for the basis-universal Supercompressed GPU Texture Codec by Binomial
-
canvas-renderer
Custom minimal renderer for Saorsa Canvas built on wgpu. Provides GPU rendering with WebGL2/2D fallbacks.
-
cubecl-hip
AMD ROCm HIP runtime for CubeCL
-
cubek-reduce
CubeK: Reduce Kernels
-
vulkano-shaders
Macro for generating Rust code from shaders
-
beamterm-data
Core data structures and binary serialization for the beamterm WebGL terminal renderer
-
virglrenderer
Safe and idiomatic Rust wrapper for virglrenderer
-
multi-agent-engine
A concurrent library for building multi-agent simulators with lock-free CPU/GPU hybrid execution
-
cubecl-common
Common crate for CubeCL
-
bevy_open_vat
OpenVAT (Vertex Animation Texture) plugin for Bevy
-
est-render
Easy to use winit, softbuffer & wgpu abstractions
-
torsh-distributed
Distributed training and inference for ToRSh
-
sysutil
Linux system utils library
-
rustkernel-core
Core abstractions, traits, and registry for RustKernels GPU kernel library
-
qook
Pure Rust unitaryfund/qrack Wrapper
-
vk-sys
Bindings for the Vulkan graphics API
-
cellcast
A recast of cell segmentation models built on the Burn deep learning framework
-
mltop
Resource monitor for ML engineers written in Rust
-
bevy_app_compute
An easy way to run compute shaders for Bevy
-
runmat-repl
Interactive RunMat REPL with line editing and Jupyter kernel integration
-
mushin
Computational graphs with reverse automatic differentation in the GPU
-
slosh3d
Cross-platform GPU 3D Material Point Method implementation
-
orb8
eBPF-powered observability toolkit for Kubernetes with GPU telemetry
-
cubecl-wgpu
WGPU runtime for the CubeCL
-
runmat-runtime
Core runtime for RunMat with builtins, BLAS/LAPACK integration, and execution APIs
-
gni
create a GPU Node ID
-
slosh2d
Cross-platform GPU 2D Material Point Method implementation
-
kn-runtime
Dynamic wrapper around CPU and GPU inference
-
tsai_compute
Heterogeneous compute abstraction layer for tsai-rs
-
chalamet_pir
Stateful, Single-Server Private Information Retrieval for Key-Value Databases
-
opencl-heads
Actively Maintained with Long Term Support, Pure Rust Implementation, Low Level bindings for OpenCL
-
supasim
A compute-oriented GPU usage library
-
supasim-dev-utils
A set of utilities used internally by supasim, such as for testing
-
socktop_agent
Socktop agent daemon. Serves host metrics over WebSocket.
-
ringkernel-wgpu-codegen
WGSL code generation from Rust DSL for RingKernel
-
cubecl-opt
Compiler optimizations for CubeCL
-
yoinky
TUI tool for monitoring system resources like CPU, RAM, and GPU
-
burn-rocm
ROCm HIP backend for the Burn framework
-
tsai_cli
Command-line interface for tsai-rs time series deep learning
-
wgpu_struct
A wgsl data encoding and decoding library
-
obwio
OpenCL bindings and runtime utilities for Rust
-
cubecl-ir
Intermediate representation for CubeCL
-
nvidiagpu_top
A TUI for monitoring NVIDIA GPU metrics
-
astrelis-render
Astrelis Core Rendering Module
-
trueno-ublk
Pure Rust ublk block device for compressed RAM storage (educational - use kernel zram for production)
-
rustc_codegen_spirv
SPIR-V code generator backend for rustc
-
below-gpu-stats
GPU stats crate for below
-
gpu-mumu
GPU/Vulkan matrix and tensor operations for the mumu/lava language
-
lambdaworks-gpu
Modular math library for cryptography - GPU implementation
-
luminal_cudarc
Safe wrappers around CUDA apis
-
hodu_metal_kernels
hodu metal kernels
-
gpu-alloc-vulkanalia
vulkanaliabackend forgpu-alloc -
sonr-daemon
Background daemon for sonr, providing semantic search capabilities using local LLMs
-
cl-sys
OpenCL FFI bindings
-
globject-rs
OpenGL Object Wrapper for Rust
-
pathfinder_content
Vector path utilities for the Pathfinder rendering library
-
ringkernel-cli
CLI tool for RingKernel project scaffolding, kernel code generation, and profiling
-
servo/pathfinder_swf
A fast, practical GPU rasterizer for fonts and vector graphics
-
femto-gpt
Minimal Rust library for training GPT language-models
-
paraxis
A maths library for doing calculations on spatial data
-
gpu-alloc-ash
ashbackend forgpu-alloc -
daedalus-wgsl-infer
WGSL binding/workgroup inference helpers for Daedalus GPU tooling
-
tract-gpu
Tiny, no-nonsense, self contained, TensorFlow and ONNX inference
-
wgcore
abstractions for composable WGSL shaders
-
cubecl-spirv
SPIR-V compiler for CubeCL
-
haagenti-serverless
Serverless deployment with cold start optimization and GPU memory management
-
xnn
A lightweight ML framework with GPU-first architecture
-
nexus3d
Cross-platform 3D rigid-body physics
-
socktop
Remote system monitor over WebSocket, TUI like top
-
ranim-render
The renderer of Ranim
-
fly-sdk
Unofficial Fly.io SDK for Rust
-
raybnn
RayBNN
Try searching with DuckDuckGo.