#计算机科学#Tensor library for machine learning
ggml implementation of BERT
#大语言模型#Python bindings for the Transformer models implemented in C/C++ using GGML library.
Python bindings for ggml
#大语言模型#Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Run GGML models with Kubernetes.
Binding to transformers in ggml
Inference Vision Transformer (ViT) in plain C/C++ with ggml
ggml implementation of the baichuan13b model (adapted from llama.cpp)
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3