GGUF Quantization support for native ComfyUI models
For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcpp
GGUF implementation in C as a library and a tools CLI program
automatically quant GGUF models
Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.
GGUF Quantization of any LLM.
一个基于 Gradio 的 Web UI,用于运行像 LLaMA、llama.cpp、GPT-J、Pythia、OPT 和 GALACTICA 这样的大型语言模型。
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.