#计算机科学#pix2tex: Using a ViT to convert images of equations into LaTeX code.
翻译 - pix2tex:使用 ViT 将方程图像转换为 LaTeX 代码。
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
#自然语言处理#读过的CV方向的一些论文,图像生成文字、弱监督分割等
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
#计算机科学#Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.
A collection of scripts to "help" you with your programming exams and assignments.
A AutoIT 3 wrapper library around the OCRSpace API.
Civitai Stable Diffusion 337k Dataset; dataset of ai generated image
#大语言模型#A Large Language Model (LLM) Based App to Generate Stories from Pictures
[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rat...
TAO71 I4.0 is an AI created by TAO71 in Python.
Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.
#计算机科学#[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue
🎞 Video editor with description generation for MTS TrueTech Hack