#数据仓库#OpenRefine(原名Google Refine) 是一个强大的数据清洗和转换工具
A Scalable Data Cleaning Library for PySpark.
Data visualisations in Power BI
Table Enforcer is my attempt to apply a sort of "test driven development" workflow to data cleaning and validation. A python package to facilitate the iterative process of developing and using schema-...
Examples for Optimus a Data Cleansing Library for Big Data.
#自然语言处理#-This project targets the textual analysis of Egyptian movie plot summaries that were curated from online sources, covering the four golden decades of Egyptian Cinema.
"Telewire Analytics," an innovative project aimed at optimizing resource utilization within the telecom industry.
This Project is based of an Online Retail store that wants to analyse major contributing factors to the revenue so they can strategically plan for next year.
Analyzed a survey recieved using Power BI tool to draw useful insights.
Cleaned a movies dataset to present specific visuals to answer research questions
Data cleansing and validation for Data Science Master degree
Implementation of a Neural Network (NN) model for handwriting recognition using the MNIST dataset.
Advance Guide Of Cleaning & 20+ ways of cleaning data with python
This is the curated pile of notebooks/small projects which contains linear and non-linear regression models.
This project extracts data from Azure datalake gen 2 storage, transforming it and then transferring it to SQL database.
This course by University of Michigan introduces the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv fil...
This project is an internal project with INTEL where a framework for monitoring data quality from disparate sources and automating it using python.
CSVParser is a tool to parse csv file using univocity and commons csv parsers. It cleans new line (\n) character & special characters between data. It also handle various garbage data like odd no of q...