Faker is a Python package that generates fake data for you.
翻译 - Faker是一个Python软件包,可为您生成伪造数据。
#计算机科学# pix2tex: Using a ViT to convert images of equations into LaTeX code.
翻译 - pix2tex:使用 ViT 将方程图像转换为 LaTeX 代码。
#计算机科学# A MNIST-like fashion product database. Benchmark 👇
翻译 - 类似于MNIST的时尚产品数据库。基准:point_right:
#自然语言处理# Open source annotation tool for machine learning practitioners.
翻译 - 机器学习从业者的开源文本注释工具。
#Awesome# Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
翻译 - 机器学习,NLP,视觉,推荐系统项目创意的精选清单
#计算机科学# PHP-ML - Machine Learning library for PHP
翻译 - PHP-ML-PHP的机器学习库
This repository contains compatibility data for Web technologies as displayed on MDN
翻译 - 该存储库包含MDN上显示的Web技术的兼容性数据
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
#计算机科学# Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
#自然语言处理# Models, data loaders and abstractions for language processing, powered by PyTorch
翻译 - 文本和NLP的数据加载器和抽象
#计算机科学# We are building an open database of COVID-19 cases with chest X-ray or CT images.
翻译 - 我们正在建立一个带有胸部X光或CT图像的COVID-19病例的开放数据库。
Extract data from a wide range of Internet sources into a pandas DataFrame.
翻译 - 从各种Internet来源中提取数据到pandas DataFrame中。
#自然语言处理# A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Windows Events Attack Samples
翻译 - Windows 事件攻击示例
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets
翻译 - ISO 3166-1国家/地区列表与联合国地理信息系统区域代码合并为现成的JSON,XML和CSV数据集