pytorch implementation of video captioning
Captions for my video courses
Video Grounding and Captioning
A simple iOS photo and video browser with grid view, captions and selections.
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing the ...
Download pictures (or videos) along with their captions and other metadata from Instagram.
翻译 - 从Instagram下载图片(或视频)以及其标题和其他元数据。
Video to Text: Natural language description generator for some given video. [Video Captioning]
transcripts and captions for 3blue1brown videos
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2018, with code, model and prediction results.
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
Using Semantic Compositional Networks for Video Captioning
A collection of tools made to help you create and edit subtitles in different formats (Subrip, WebVTT, Substation Alpha...)
Video Captioning is an encoder decoder mode based on sequence to sequence learning
这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。
Simple image captioning model
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense r...
翻译 - X-modaler 是用于跨模态分析的多功能高性能代码库。
Dense image captioning in Torch
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
BERT + Image Captioning
Image Captioning Using Transformer
Code for Unsupervised Image Captioning
Image Captioning with Keras
Efficient Image Captioning code in Torch, runs on GPU