image2text · GitHub Topics

#计算机科学#pix2tex: Using a ViT to convert images of equations into LaTeX code.

翻译 - pix2tex：使用 ViT 将方程图像转换为 LaTeX 代码。

机器学习 transformer im2latex 深度学习 image2text LaTeX dataset PyTorch im2markup OCR latex-ocr vit math-ocr vision-transformer 图像处理 Python im2text

Python 14.09 k

3 个月前

OleehyO / TexTeller

TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.

image2text latex-ocr

Python 513

11 天前

prabhakar267 / image2text

📋 Python wrapper to grab text from images and save as text files using Tesseract Engine

tesseract optical-character-recognition OCR image2text tesseract-ocr

Python 407

2 年前

wangleihitcs / Papers

#自然语言处理#读过的CV方向的一些论文，图像生成文字、弱监督分割等

机器视觉自然语言处理 captions vqa image2text cvpr eccv iccv scene-text-detection-recognition

126

5 年前

ekiim / vim-mathpix

#编辑器#Vim commands to use mathpix from your screen

Vim LaTeX image2text

Shell 41

7 个月前

Hangover3832 / ComfyUI-Hangover-Nodes

Various nodes for ComfyUI

comfyui image2text stable-diffusion

Python 40

10 个月前

yuanxiaosc / Image-Captioning

CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述

image-captioning image2text tensorflow2 template-project Tensorflow

Jupyter Notebook 35

6 年前

etosworld / etos-deepcut

#计算机科学#Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.

segmentation object-segmentation 深度学习 semantic-segmentation annotation pspnet image2text PyTorch

Python 25

4 年前

TheLime1 / CheatoMate

A collection of scripts to "help" you with your programming exams and assignments.

人工智能 chat cheat cheating exam assignment image2text codebase

Python 17

1 年前

MurageKabui / AutoIT-OCRSpace-UDF

A AutoIT 3 wrapper library around the OCRSpace API.

optical-character-recognition recognition 图像处理 image2text text2image OCR API Library devtools developer-tools

AutoIt 13

1 年前

thefcraft / civitai-stable-diffusion-337k

Civitai Stable Diffusion 337k Dataset; dataset of ai generated image

civitai dataset image-classification image-generation image2text stable-diffusion

Python 10

3 个月前

sssingh / pic-to-story

#大语言模型#A Large Language Model (LLM) Based App to Generate Stories from Pictures

generative-model gpt-3-text-generation gradio huggingface image2text langchain large-language-models 大语言模型 OpenAPI Specification

Python 7

2 年前

Jerey / image-to-pdf-and-txt

Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.

Python image2text opencv-python tesseract OCR Hacktoberfest

Python 6

2 年前

michelecafagna26 / HL-dataset

[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rat...

dataset vision-and-language image-captioning image2text

1 年前