#

SRE

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

bregman-arie/devops-exercises
https://static.github-zh.com/github_avatars/bregman-arie?size=40

#面试#DevOps 面试问题,知识点涉及 Linux、Jenkins、AWS、SRE、Prometheus、Docker、Python、Ansible、Git、Kubernetes、Terraform、OpenStack、SQL、NoSQL、Azure、GCP、DNS、Elastic、网络、虚拟化等

Python 78.28 k
15 天前
https://static.github-zh.com/github_avatars/awesome-foss?size=40

#Awesome#A curated list of amazingly awesome open-source sysadmin resources.

30.98 k
2 个月前
upgundecha/howtheysre
https://static.github-zh.com/github_avatars/upgundecha?size=40

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)

JavaScript 9.44 k
8 天前
https://static.github-zh.com/github_avatars/isno?size=40

⭐ 【出版书籍】京东购买链接 https://item.jd.com/10183653901041.html 深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。

JavaScript 8.34 k
18 小时前
linkedin/school-of-sre
https://static.github-zh.com/github_avatars/linkedin?size=40

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.

HTML 8.03 k
1 年前
https://static.github-zh.com/github_avatars/coroot?size=40

Coroot is an open-source observability and APM tool with AI-powered Root Cause Analysis. It combines metrics, logs, traces, continuous profiling, and SLO-based alerting with predefined dashboards and ...

Go 7.06 k
19 小时前
https://static.github-zh.com/github_avatars/k8sgpt-ai?size=40
Go 6.95 k
2 天前
https://static.github-zh.com/github_avatars/StackStorm?size=40

StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 i...

Python 6.33 k
6 天前
https://static.github-zh.com/github_avatars/hjacobs?size=40
HTML 6.21 k
5 年前
https://static.github-zh.com/github_avatars/chaosblade-io?size=40

An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)

Go 6.2 k
1 天前
https://static.github-zh.com/github_avatars/rundeck?size=40

Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts

Groovy 5.89 k
2 天前
litmuschaos/litmus
https://static.github-zh.com/github_avatars/litmuschaos?size=40

Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....

Go 4.83 k
4 天前
https://static.github-zh.com/github_avatars/jonmosco?size=40
Shell 3.71 k
4 个月前
loading...
Website
Wikipedia
维基百科