#计算机科学#1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
翻译 - 从pandas DataFrame对象创建HTML分析报告
#计算机科学#Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
翻译 - Easy Machine Learning是一个基于通用数据流的系统,可简化将机器学习算法应用于现实世界任务的过程。
A Data Analysis Board in Vue.
PySpark-Tutorial provides basic algorithms using PySpark
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
Powerful & Easy way for big data discovery
翻译 - 强大而轻松的大数据发现方式
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
#计算机科学#A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
This is about learning courses in Coursera. All the answers given written by myself
#计算机科学#I have built the computer vision models in 3 different ways addressing different personas, because not all companies will have a resolute data science team. quality-control manufacturing big-data-anal...
#计算机科学#Bucketize an image based on exhaust data and AI generated data. industry-solutions azure azure machine learning services computer-vision big data big data analytics machine learning image recognition...
The Pandata scalable open-source analysis stack
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
Big data projects implemented by Maniram yadav
Egis - a handy Ruby interface for AWS Athena