Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Nov 22, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
TensorFlow binaries supporting AVX, FMA, SSE
The Vector Optimized Library of Kernels
SIMD Vector Classes for C++
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
A simple C library for compressing lists of integers using binary packing
TensorFlow binaries supporting AVX, FMA, SSE
High performance algorithms in C#: SIMD/SSE, multi-core and faster
Agenium Scale vectorization library for CPUs and GPUs
Fast decoder for VByte-compressed integers
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
DSP library for signal processing
UME::SIMD A library for explicit simd vectorization.
A fast implementation of single-pattern substring search using SIMD acceleration.
Fast random number generators: Vectorized (SIMD) version of xorshift128+
High-performance dictionary coding
Fast differential coding functions (using SIMD instructions)
Fast C header-only library for popcnt, pospopcnt, and set algebraic operations
Add a description, image, and links to the simd-instructions topic page so that developers can more easily learn about it.
To associate your repository with the simd-instructions topic, visit your repo's landing page and select "manage topics."