GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”vlm“ 的搜索结果

vlmcsd存档
@Wind4

vlmcsd 就是一个仿真KMS管理工具,可以部署在内网或者公网可以运行在Linux包括Android、FreeBSD、Solaris、Minix、以及Mac OS、IOS、Windows等系统平台上。

C8.65 k
1 年前

相关主题

vlm大语言模型vision-language-modelqwenllamadeepseekllavamultimodal人工智能clip

Google   Bing   GitHub

Tutorial
@InternLM

LLM&VLM Tutorial

Python1.83 k
2 个月前
VLMEvalKit
@open-compass

#大语言模型#Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

gpt-4vlarge-language-modelsllavamulti-modalopenai
Python2.62 k
15 小时前
hiyouga/LLaMA-Factory
LLaMA-Factory
@hiyouga

#自然语言处理#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

fine-tuningllama大语言模型pefttransformers
Python53.22 k
1 天前
THUDM
CogVLM
THUDM@THUDM

a state-of-the-art-level open visual language model | 多模态预训练模型

cross-modalitylanguage-modelmulti-modalpretrained-modelsvisual-language-models
Python6.6 k
1 年前
huggingface/nanoVLM
Hugging Face
nanoVLM
Hugging Face@huggingface

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python3.6 k
16 小时前
vlmcsd
@kkkgo

🔑Portable open-source KMS Emulator in C

kmsvlmcsdemulated-kms-servers
C1.09 k
1 年前
prismatic-vlms
@TRI-ML

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python718
1 年前
VLM-R1
@om-ai-lab

#大语言模型#Solve Visual Understanding with Reinforced VLMs

deepseek-r1grpo大语言模型multimodalvlm
Python5.23 k
5 天前
VLM_survey
@jingyi0000

#计算机科学#Collection of AWESOME vision-language models for vision tasks

机器视觉深度学习knowledge-distillationsurveytransfer-learning
2.8 k
1 个月前
ComfyUI-Florence2
@kijai

Inference Microsoft Florence2 VLM

Python1.27 k
1 个月前
asn1c
@vlm

The ASN.1 Compiler

C1.09 k
2 年前
R1-V
@Deep-Agent

Witness the aha moment of VLM with less than $3.

Python3.53 k
4 个月前
THUDM
CogAgent
THUDM@THUDM

An open-sourced end-to-end VLM-based GUI Agent

gui-agentcomputer-usevlmagentglm
Python978
3 个月前
Flame-Code-VLM
@Flame-Code-VLM

#前端开发#Flame is an open-source multimodal AI system designed to translate UI design mockups into high-quality React code. It leverages vision-language modeling, automated data synthesis, and structured train...

code-generationfrontend-developmentvision-language-model人工智能
Python525
3 个月前
X-VLM
@zengyan-97

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

multimodalityvision-and-language
Python478
3 年前
mlx-vlm
@Blaizzy

#大语言模型#MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

llava大语言模型MLXvision-transformerapple-silicon
Python1.41 k
7 小时前
minimind-v
@jingyaogong

#大语言模型#🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

人工智能ChatGPTvision-language-model
Python3.93 k
2 个月前
oumi
@oumi-ai

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

dpoevaluationfine-tuninginferencellama
Python8.23 k
12 小时前
xlite-dev/Awesome-LLM-Inference
Awesome-LLM-Inference
@xlite-dev

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

flash-attentiontensorrt-llmvllm
Python4.18 k
16 小时前
awesome-vlm-architectures
@gokayfem

#Awesome#Famous Vision Language Models and Their Architectures

clipllavavlm
Markdown897
4 个月前
阿里巴巴
Pai-Megatron-Patch
阿里巴巴@alibaba

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python942
3 个月前
nexa-sdk
@NexaAI

#大语言模型#Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...

asredge-computing大语言模型on-device-aion-device-ml
Python4.59 k
8 天前
awesome-llm-and-aigc
@coderonion

#数据仓库#🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applic...

gpt大语言模型Awesome Listsllamaaigc
712
2 个月前
joycaption
@fpgaminer

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

captioningvlm
Python667
2 个月前
Surveillance_Video_Summarizer
@Ravi-Teja-konda

#大语言模型#VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for que...

人工智能ChatGPTflorence-2gpt-4gradio
Python116
24 天前
comfyui_LLM_party
@heshengtao

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces,...

comfyuiopenaiworkflowagentdify
Python1.76 k
9 天前