”avx-512“ 的搜索结果

kfr

@kfrlib

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

C++1.67 k

8 天前

Google Bing GitHub

sse2 avx2 information-retrieval sse avx ndjson sorting-algorithms string substring string-manipulation

Simd

Ihar Yermalayeu@ermig1979

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.

翻译 - 使用SIMD的C ++图像处理和机器学习库：SSE，SSE2，SSE3，SSSE3，SSE4.1，SSE4.2，AVX，AVX2，AVX-512，VMX（Altivec）和VSX（Power7），NEON for ARM。

sse avx altivec vsx sse2

C++2.07 k

2 天前

MIPP

@aff3ct

Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).

C++484

18 天前

xbyak

MITSUNARI Shigeo@herumi

A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2

翻译 - 适用于x86（IA-32）/ x64（AMD64，x86-64）MMX / SSE / SSE2 / SSE3 / SSSE3 / SSE4 / FPU / AVX / AVX2 / AVX-512的JIT汇编器

C++2.06 k

18 天前

avx-turbo

Travis Downs@travisdowns

Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts

C++191

1 个月前

CRoaring

@RoaringBitmap

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, and StarRocks

C1.58 k

7 天前

Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F

@yzhaiustc

Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.

C116

3 年前

blake3

Luke Champine@lukechampine

An AVX-512 accelerated implementation of the BLAKE3 cryptographic hash function

Assembly363

7 个月前

base64-avx512

@WojciechMula

Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"

C198

5 年前

Intel-AVX512-Brief-Introduction

@zenny-chen

Intel AVX-512简介

C36

1 年前

simdutf

@simdutf

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64. Part of Node.js, WebKit/Safari, Ladybird, Clo...

C++1.18 k

17 小时前

tensorflow-build-archived

@lakshayg

TensorFlow binaries supporting AVX, FMA, SSE

Shell1.91 k

5 年前

js-sha512

@emn178

A simple SHA-512, SHA-384, SHA-512/224, SHA-512/256 hash functions for JavaScript supports UTF-8 encoding.

JavaScript217

10 个月前

StringZilla

@ashvardanian

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖

simd CSV dataset ndjson string

C++2.23 k

17 天前

tensorflow-build

@lakshayg

TensorFlow binaries supporting AVX, FMA, SSE

237

1 个月前