#计算机科学#A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Profiling Pandas on Ray
Benchmark Code for Pandas on Ray Blog Post
Ad-hoc tests for Pandas on Ray from UC Berkeley RISE Lab
A repository for comparing Pandas and Modin (Dask/Ray) performance on simple TPC-H Queries (Q1 and Q6) written in Pandas.
Using DataFlow pipelines from pandas to do classification over AlexNet like CNN. Using TENSORFLOW-KERAS and SCI-KIT LEARN on NVIDIA K80 GPU by the University at Albany. Over the NIH CHEST X-RAY Data.s...
Built a model for detecting and localizing the disease in the Chest X-rays of patients using the NIH Chest X-rays dataset on Kaggle using the concepts of Keras, TensorFlow, Matplot, Pandas, and NumPy ...
#大语言模型#RayLLM - LLMs on Ray (Archived). Read README for more info.
Koalas: pandas API on Apache Spark
#大语言模型#Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
#计算机科学#A toolkit to run Ray applications on Kubernetes
V-Ray for Blender Add-on
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.
Rust ray tracer based on Ray Tracing in One Weekend
Training Sonic with RLlib