A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Google Bing GitHub
Automatically exported from code.google.com/p/sse2neon
A simple and fast linear algebra library for games and graphics
SSE2 Optimized GLSL-like math library
Unusual uses of SSE2 registers
SSE2 implementations of sin, cos, exp, log, tan, cot, atan, atan2
Porting the SSE instructions to ARM NEON instrctions.
Go Rabin Fingerprinting (SSE2)
WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram, Datadog and Cloudflare Workers.
MMX/SSE/SSE2/SSE4/AVX/AVX2/AVX512 optimization
3D Haar Descrete Wavelet Transform C++11 library (using OpenMP and SSE/SSE2/SSE3/AVX).
x100 faster implementation of GOST 34.12-2015 Kuznyechik optimized for high throughput and low latency on SSE2-capable CPUs
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybi...