”grounding“ 的搜索结果 | GitHub 中文社区

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

object-detection open-world open-world-detection vision-language vision-language-transformer

Python6.87 k

4 个月前

Google Bing GitHub

image-segmentation open-world yolov5 object-detection yolov7 data-generation vision-language automatic-labeling-system computer-vision vision-language-transformer

Open-GroundingDino

@longzw1997

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python454

5 个月前

groundingLMM

@mbzuai-oryx

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python788

6 天前

awesome-grounding

@TheShadow29

awesome grounding: A curated list of research papers in visual grounding

1.03 k

2 年前

grounded-video-description

@facebookresearch • Meta

Video Grounding and Captioning

Python323

3 年前

GroundingGPT

@lzw-lzw

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python305

25 天前

mast3r

NAVER@naver

Grounding Image Matching in 3D with MASt3R

Python1.36 k

2 个月前

Grounded-Segment-Anything

@IDEA-Research

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

open-vocabulary-detection open-vocabulary-segmentation data-generation automatic-labeling-system caption

Jupyter Notebook15.27 k

3 个月前

DeepRL-Grounding

@devendrachaplot

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Python237

7 年前

onestage_grounding

@zyang-ur

A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)

Python144

4 年前

grounded-segment-anything-colab

@camenduru

Grounding DINO with Segment Anything & Stable Diffusion colab

Jupyter Notebook191

1 年前

vRGV

@doc-doc

Visual Relation Grounding in Videos (ECCV'20, Spotlight)

Python57

2 年前

Awesome-Temporally-Language-Grounding

@WuJie1010

A curated list of “Temporally Language Grounding” and related area

111

5 年前

UnifiedSKG

@xlang-ai

[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models

Python549

1 年前

xlang-paper-reading

@xlang-ai

Paper collection on building and evaluating language model agents via executable language grounding

335

7 个月前

zoe

@CogComp

Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.

Python43

5 年前

DRFT

@wenz116

End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021

3 年前

TVQAplus

@jayleicn

[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering

Python124

2 年前

SAM_gDINO_AutoLabeling

@mhyeonsoo

Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO

Jupyter Notebook8

2 年前

LCMCG-PyTorch

@youngfly11

AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"

Jupyter Notebook54

3 年前

MultiModal-DeepFake

@rshaojimmy

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Python366

7 个月前

graph-coloring

@amirdeljouyi

Graph grounding for graph coloring algorithms such as Welsh Powell and Evolution algorithms like Harmony Search and Genetic

TypeScript36

6 年前

NAFAE

@jshi31

Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Losses"

Python29

4 年前

notebooks

@roboflow

#计算机科学#Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...

机器视觉深度学习深度神经网络 image-classification image-segmentation

Jupyter Notebook5.61 k

2 天前

cvevals

@roboflow

Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, models hosted on Roboflow)

Python18

1 年前

编程语音

Python
Jupyter Notebook
TypeScript