Kubernetes-based, scale-to-zero, request-driven compute
翻译 - 基于Kubernetes,从零扩展到请求驱动的计算
#大语言模型# Private Open AI on Kubernetes