Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
翻译 - Dataprep:Python中的数据准备
Make your dataset talk to you. The AI assistant for data preparation.
A Holistic Platform for Automating Data Preparation
#计算机科学#This repo includes codes for ML Zoomcamp. If you can follow the tutorials from the link here: https://www.youtube.com/watch?v=rowoDjPc8HU&list=PL3MmuxUbc_hIhxl5Ji8t4O6lPAOpHaCLR 👩🏼💻
Trifacta Flows Examples and Templates. Flows zip files, recipes and datasets.
Public repository for custom blocks for Omniscope
Learn data visualization through Tableau 2020 and create opportunities for you or key decision-makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlene...
#网络爬虫#ScrapySub is a Python library designed to recursively scrape website content, including subpages. It fetches the visible text from web pages and stores it in a structured format for easy access and an...
#时序数据库#Data Science materials
This repository demonstrates data imputation using Scikit-Learn's SimpleImputer, KNNImputer, and IterativeImputer.
The project deals with determining and predicting the type of accident taking place in the city of Austin. The data would help in understanding what possible factors are leading to the accidents based...
CAN U PREDICT CORRECTLY WHETHER A CUSTOMER WILL CANCEL THE RESERVATION?? You can find the dataset from this kaggle website: https://www.kaggle.com/datasets/ahsan81/hotel-reservations-classification-da...
The Bikes Sales Analysis Excel Project is a practical exploration of sales data analysis using Microsoft Excel. This project showcases how Excel can be a powerful tool for data cleaning, preprocessing...
#计算机科学#This project involves the use of K-Means Clustering to find the best accommodation for students in Bangalore (or any other city of your choice) by classifying accommodation for incoming students on th...
Preparing and Cleaning Data
#计算机科学#Trying to predict survival rate of passengers using algorithms like Logistic Regression, Ada Boost, Gradient Boost , Decision Tree Classifiers , Extra Tree Classifiers , Random Forest Classifiers and ...
In this project, I have used logistic regression, a supervised machine learning algorithm, to predict whether a person has diabetes or not based on various features such as age, blood pressure, glucos...
The data is available in kaggle competitions. https://www.kaggle.com/competitions/spaceship-titanic I have participated and completed the competition on my own.
#算法刷题#For a real estate firm, building a house price prediction model based upon various factors. Problem - Regression | Algorithm used -Linear Regression using OLS