#计算机科学#1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
翻译 - 从pandas DataFrame对象创建HTML分析报告
Always know what to expect from your data.
翻译 - 永远知道您对数据的期望。
#数据仓库#The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
翻译 - 在数据集中查找标签错误并使用嘈杂的标签进行学习。
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
翻译 - 元数据开放标准。发现、协作和正确获取数据的单一场所。
#计算机科学#Visualize and compare datasets, target values and associations, with one line of code.
翻译 - 只需一行代码即可可视化和比较数据集,目标值和关联。
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
#计算机科学#Automatically find issues in image datasets and practice data-centric computer vision.
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Monitor the stability of a Pandas or Spark dataframe ⚙︎
#计算机科学#Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Databricks framework to validate Data Quality of pySpark DataFrames
#数据仓库#🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML f...
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility acr...
Swiple enables you to easily observe, understand, validate and improve the quality of your data