The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
翻译 - 在数据集中查找标签错误并使用嘈杂的标签进行学习。
#计算机科学#A system for quickly generating training data with weak supervision
翻译 - 一种在监管不力的情况下快速生成训练数据的系统
#自然语言处理#Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
#自然语言处理#skweak: A software toolkit for weak supervision applied to NLP tasks
翻译 - skweak:适用于 NLP 任务的弱监督软件工具包