Visual Question Answering in Pytorch
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
Visual Question Answering Demo on pretrained model
❔ Visual Question Answering in Torch
Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.
Visual Q&A reading list
Strong baseline for visual question answering
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Tensorflow implementations of Relational Networks and a VQA dataset named Sort-of-CLEVR proposed by DeepMind.
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
Visual Question Answering in Tensorflow.
Codes for the paper "Power of Tempospatially Unified Spectral Density for Perceptual Video Quality Assessment", ICME2017 (Finalist of the World's FIRST 10K Best Paper Award)
图像/视频质量评价(I/VQA)算法代码汇总。持续更新。
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-bas...
[CVPR 2021] Multi-Modal-CelebA-HQ: A Large-Scale Text-Driven Face Generation and Understanding Dataset
Visual Question Answering task written in Keras that answers questions about images
Softwares:进行视频质量评价必须常用的软件;Papers: 视频质量评价领域重要论文