The mouse and trackpad utility for Mac.
A BVH implementation to speed up raycasting and enable spatial queries against three.js meshes.
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
翻译 - [ICCV 2019] TSM:高效视频理解的时移模块。
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
翻译 - [ICLR 2019] ProxylessNAS:直接在目标任务和硬件上进行神经体系结构搜索。
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
A developer friendly approach for sensors in React Native
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Reduce CPU usage by non-blocking async loop and psychologically speed up in JavaScript
The CDN for developers.
Display-agnostic acceleration of macOS applications using external GPUs.
The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Caffe for Sparse and Low-rank Deep Neural Networks
A high speed stepper library for Atmega 168/328p (nano), Atmega32u4, Atmega 2560, ESP32, ESP32S2, ESP32S3, ESP32C3, ESP32C6 and Atmel SAM Due
volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
A .NET library for hardware-accelerated, high performance, immediate mode rendering via Direct2D.
Pytorch implementation of our paper accepted by CVPR 2020 (Oral) -- HRank: Filter Pruning using High-Rank Feature Map