captioning-videos · GitHub Topics

#大语言模型#[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

captioning-videos ChatGPT gradio langchain video-question-answering video-understanding stablelm chat Video big-model foundation-models large-language-models

Python 3.21 k

3 个月前

TheShadow29 / awesome-grounding

#自然语言处理#awesome grounding: A curated list of research papers in visual grounding

机器视觉自然语言处理 grounding Awesome Lists papers arxiv video-understanding captioning-videos embodied-agent multimodal-deep-learning language-grounding Bukkit

1.07 k

2 年前

TheShadow29 / vognet-pytorch

#自然语言处理#[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

grounding Video pytorch-implementation vision vision-and-language 自然语言处理 captioning-videos

Python 67

5 年前

aimagelab / pacscore

[CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

captioning captioning-videos 机器视觉 cvpr cvpr2023 vision-and-language

Python 61

1 个月前

TheShadow29 / VidSitu

#自然语言处理#[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

vision vision-and-language grounding 自然语言处理 Video srl captioning-videos captioning

Python 59

4 年前

mynlp / cst_captioning

PyTorch Implementation of Consensus-based Sequence Training for Video Captioning

captioning-videos policy-gradient

Python 59

7 年前

chihyaoma / cyclical-visual-captioning

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

PyTorch vision-and-language captioning-videos

Python 44

5 年前

x-CK-x / Dataset-Curation-Tool

A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well a...

captioning-videos data-curation 下载器 tagging

Python 37

3 个月前

linto-ai / linto-studio

Transcription and annotation interface for recorded audio or video files

asr stt video-transcription caption captioning-videos subtitle subtitles

JavaScript 31

2 天前

szq0214 / MSR-VTT-Challenge

Video to Language Challenge (MSR-VTT Challenge 2016)

captioning-videos

Jupyter Notebook 31

7 年前

sanjifr3 / Narrator

#自然语言处理#An image and video description generator using an CNN-RNN based architecture.

PyTorch Python Flask rnn 自然语言处理机器视觉 tts captioning-videos

Jupyter Notebook 23

9 个月前

aimagelab / mvad-names-dataset

M-VAD Names Dataset. Multimedia Tools and Applications (2019)

captioning-videos video-captioning

Python 20

6 年前

preritj / show_attend_tell

Caption generator for live camera feed

show-attend-and-tell Tensorflow lstm live-streaming stream-processing captioning-videos convolutional-neural-networks

Python 13

8 年前

apivideo / caption.new

Sample app to add captions to an uploaded video. From api.video (https://api.video)

Node.js captioning-videos

JavaScript 11

2 年前

cd2bit / awesome-list-of-captioned-courses

Online professional courses that are captioned and/or subtitled

Web Accessibility (a11y)captions captioning captioning-videos subtitles courses online-course airtable

6 年前

BKHMSI / forget-me-not

Video Search using Natural Language

PyTorch sst vision attention-mechanism captioning-videos

Python 3

7 年前

Hyeongkeun / LAVCap

Official Pytorch Implementation of 'LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport' (ICASSP2025)

audio captioning captioning-videos large-language-models multimodal-large-language-models multimodal-learning Video visual

Python 3

3 个月前

tjoab / captionaize

#大语言模型#Generate TikTok— and Instagram—tailored captions and hashtags for your videos using the power of some super creative robots up in the clouds ☁️ 🤖 💬 ☁️

captioning-videos genai google-gemini 大语言模型

Python 2

10 个月前

marquesafonso / multilang-asr-captioner

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

automatic-speech-recognition captioning-videos faster-whisper Whisper

HTML 2

5 个月前

Edrolo / wistitler

Automated Wistia video captioning tool

自动化 captioning-videos

Python 0

2 年前