Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
A header only library implementing common mathematical functions using SIMD intrinsics
x86-64, ARM, and RVV intrinsics viewer
DR3 enables users to write vectorised code using generic lambdas and filters. Switch instruction set just by changing enclosing namespace
miniRT is the final C project of the 42 Common Core: our very first ray-tracer. Our miniRT focused on optimising CPU-rendered graphics, to achieve a real-time renderer with movement controls and extra...
🎹 Unified implementation of SIMD intrinsic functions and fallback implementations on hardware that does not natively support them, as an extension to Swift Numeric types.
C++ template for generating small sorting networks compatible with SIMD intrinsics
A High Performance C# wrapper that allows you to get the benefits of SIMD Intrinsics on List<T>.
C++ interface for SIMD instruction sets
Vectroized String Helper Functions
Winning submission for StartHack 2024: HPC optimized multi-GPU/CPU inference
Simple neural network microkernels in C accelerated with ARMv8.2-a Neon vector intrinsics.
PrySec - Privacy & Security framework for your .NET applications