#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
#计算机科学#Prepping tables for machine learning
Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.
#大语言模型#Scalable data pre processing and curation toolkit for LLMs
#大语言模型#Open source project for data preparation of LLM application builders
#计算机科学#Data Preparation for Satellite Machine Learning
#计算机科学#A New, Interactive Approach to Learning Data Science
#学习与技能提升#An open source book to learn data science, data analysis and machine learning, suitable for all ages!
#数据仓库#🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
ABAP unit testing framework, prepare in Excel, reuse in abap code
This repository contains my implementations of the algorithms which MoNuSAC participants could use for data preparation to train their models at ISBI 2020.
“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumP...
Market Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
GWAS summary statistics files QC tool
Data preparation for data science projects.
A library for creating and curating reproducible pipelines for scientific and industrial machine learning
Foofah: programming-by-example data transformation program synthesizer
Extract and evaluate radiomics for liver cancer tumors from DICOM segmentation masks. Using SimpleITK, PyRadiomics and PyDicom.