#计算机科学#Tensor library for machine learning
ggml implementation of BERT
#大语言模型#Python bindings for the Transformer models implemented in C/C++ using GGML library.
Python bindings for ggml
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Run GGML models with Kubernetes.
Binding to transformers in ggml
ggml implementation of the baichuan13b model (adapted from llama.cpp)
#大语言模型#Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
call rwkv v4 raven 1B5-14B onnx and ggml using csharp cpu/gpu (support INT4,8,Float16,32)
The repository contains scripts and merge scripts that have been modified to adapt an Alpaca-Lora adapter for LoRA tuning when assuming the use of the "rinna/japanese-gpt-neox..." [gpt-neox] model con...
This is a PowerShell script that automates the process of setting up and running VICUNA on a CPU (without a graphics card) using the llama.cpp library and a pre-trained ggml-vicuna-13b-4bit.bin model....