GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub

编程语言

”textvqa“ 的搜索结果

mmgnn_textvqa
@likenneth

A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

gnnvqaPyTorch
Python50
2 年前

相关主题

vqatextvqaPyTorch深度学习languagedialogmulti-taskingpretrained-modelsmultimodaleccv

Google   Bing   GitHub

sam-textvqa
@yashkant

Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

eccvtextvqavisionlanguage
Python64
4 年前
Meta Research
TextVQA存档
@facebookresearch • Meta

Website for TextVQA dataset.

JavaScript28
2 年前
Meta Research
mmf
@facebookresearch • Meta

#计算机科学#A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

PyTorchvqapretrained-modelsmultimodal深度学习
Python5.58 k
3 个月前
textvqa_grounding_task_qwen2.5-vl-ft
@828Tina

Jupyter Notebook29
2 个月前
ssbaseline
@ZephyrZhuQi

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Python57
3 年前
SMA
@ChenyuGAO-CS
内容违规,已屏蔽
Python11
4 年前
Awesome-MLLM-TextVQA
@zhousheng97

✨✨Latest Research on Multimodal Large Language Models on Scene-Text VQA Tasks

9
3 个月前
mlci
@zhangshengHust

mlci model for textvqa

Jupyter Notebook4
4 年前
stvqa_amazon_ocr
@furkanbiten

STVQA and TextVQA OCR results from Amazon Text in Image pipeline

Jupyter Notebook11
3 年前