#计算机科学# TFX is an end-to-end platform for deploying production ML pipelines
翻译 - TFX是用于部署生产ML管道的端到端平台
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
翻译 - Google提供的Cloud Dataflow模板管道,用于解决简单的Cloud数据任务
Yet Another UserAgent Analyzer
[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
翻译 - Kubernetes运算符,用于管理Apache Flink应用程序的生命周期。
#区块链#ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Clojure API for a more dynamic Google Dataflow
Collection of transforms for the Apache beam python SDK.
#区块链#Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Microservices in Post-Kubernetes Era. A polyglot monorepo
Some class materials for a data processing course using PySpark
Opinionated serverless event analytics pipeline