”cogvlm“ 的搜索结果 | GitHub 中文社区

a state-of-the-art-level open visual language model | 多模态预训练模型

cross-modality language-model multi-modal pretrained-models visual-language-models

Python6.14 k

6 个月前

lmm visual-prompting language-model instance-segmentation object-detection cross-modality multi-modal gpt-4 gpt-4-vision pretrained-models

CogVLM2

THUDM@THUDM

GPT4V-level open-source multi-modal model based on Llama3-8B

cogvlm pretrained-models language-model multi-modal

Python2.15 k

3 个月前

cog-vlm-client

@roboflow

Simple CogVLM client script

Python14

1 年前

taggui

@jhc13

Tag manager and captioner for image datasets

Python771

1 个月前

awesome-vlm-architectures

@gokayfem

Famous Vision Language Models and Their Architectures

Markdown463

3 个月前

cogvlm-image-caption

@fblissjr

Using CogVLM and CogAgent for image captioning

Python13

1 年前

Consume-CogVLM2

@C0nsumption

CogVLM2 Autocaptioning Tools

Python5

5 个月前

CogVLM2

@kyegomez

Implementation of "CogVLM2: Visual Language Models for Image and Video Understanding" in PyTorch

Shell4

4 天前

cogvlm2

@LazyChads

Python2

1 年前

CogVLM

@naklecha

Python2

1 年前

maestro

@roboflow

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

lmm multimodality segment-anything instance-segmentation object-detection

Python1.4 k

4 天前

编程语音

Python