#数据仓库#Label Studio is a multi-type data labeling and annotation tool with standardized output format
翻译 - Label Studio是具有标准化输出格式的多类型数据标签和注释工具
Faker is a Python package that generates fake data for you.
翻译 - Faker是一个Python软件包,可为您生成伪造数据。
#计算机科学#pix2tex: Using a ViT to convert images of equations into LaTeX code.
翻译 - pix2tex:使用 ViT 将方程图像转换为 LaTeX 代码。
#计算机科学#A MNIST-like fashion product database. Benchmark 👇
翻译 - 类似于MNIST的时尚产品数据库。基准:point_right:
#自然语言处理#Open source annotation tool for machine learning practitioners.
翻译 - 机器学习从业者的开源文本注释工具。
#Awesome#Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
翻译 - 机器学习,NLP,视觉,推荐系统项目创意的精选清单
#计算机科学#PHP-ML - Machine Learning library for PHP
翻译 - PHP-ML-PHP的机器学习库
This repository contains compatibility data for Web technologies as displayed on MDN
翻译 - 该存储库包含MDN上显示的Web技术的兼容性数据
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
#计算机科学#Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
#自然语言处理#Models, data loaders and abstractions for language processing, powered by PyTorch
翻译 - 文本和NLP的数据加载器和抽象
#自然语言处理#Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
#计算机科学#We are building an open database of COVID-19 cases with chest X-ray or CT images.
翻译 - 我们正在建立一个带有胸部X光或CT图像的COVID-19病例的开放数据库。
Extract data from a wide range of Internet sources into a pandas DataFrame.
翻译 - 从各种Internet来源中提取数据到pandas DataFrame中。
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
Windows Events Attack Samples
翻译 - Windows 事件攻击示例