diff --git a/README.md b/README.md index 1ac8e95..daa6387 100644 --- a/README.md +++ b/README.md @@ -230,7 +230,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools ### Blog Posts * [Site Reliability at DAZN](https://medium.com/dazn-tech/site-reliability-at-dazn-a3ba4af0638d) - +
@@ -265,6 +265,8 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools ### Blog Posts +* [Deployment At Scale: Story Behind Dream11’s In-House Blue-Green Deployment Platform ‘OneClick’.](https://blog.dream11engineering.com/deployment-at-scale-story-behind-dream11s-in-house-blue-green-deployment-platform-oneclick-b2c761b12896) +* [Enhancing security and trust with AWS WAFv2](https://blog.dream11engineering.com/enhancing-security-and-trust-with-aws-wafv2-8b050b1cba37) * [Lessons learned from running GraphQL at scale](https://blog.dream11engineering.com/lessons-learned-from-running-graphql-at-scale-2ad60b3cefeb) * [Break circuits, save Kong 🦍](https://blog.dream11engineering.com/break-circuits-save-kong-3680d88a0639) * [Finding Order in Chaos: How We Automated Performance Testing with Torque](https://blog.dream11engineering.com/finding-order-in-chaos-how-we-automated-performance-testing-with-torque-6eb63706fcea) @@ -341,6 +343,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools ### Blog Posts +* [Automating Performance Standards](https://medium.com/expedia-group-tech/automating-performance-standards-b51efc92d237) * [Using Fault-Injection to Improve our new Runtime Platform’s Reliability](https://medium.com/expedia-group-tech/using-fault-injection-to-improve-our-new-platforms-reliability-656b1147b132) * [Learning from Incidents at Expedia Group](https://medium.com/expedia-group-tech/learning-from-incidents-at-expedia-group-51a8c72a4286) * [Improving Vrbo Homepage Loading Experience](https://medium.com/expedia-group-tech/improving-vrbo-homepage-loading-experience-e4b2207535f4) @@ -356,22 +359,6 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools
-
- Facebook - -### Blog Posts - -* [More details about the October 4 outage](https://engineering.fb.com/2021/10/05/networking-traffic/outage-details/) - -### Videos - -* [A Customer Service Approach to SRE](https://www.usenix.org/conference/srecon19emea/presentation/looney) -* [How (Not) to Scale a Project: A Post-Mortem](https://www.usenix.org/conference/srecon19asia/presentation/bagnoli) -* [Releasing the World's Largest Python Site Every 7 Minutes](https://www.usenix.org/conference/srecon19asia/presentation/wong-shuhong) -* [Using ML to Automate Dynamic Error Categorization](https://www.usenix.org/conference/srecon19asia/presentation/davoli) - -
-
Fastly @@ -397,6 +384,8 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools ### Blog Posts +* [Using ChatOps to help Actions on-call engineers](https://github.blog/2021-12-01-using-chatops-to-help-actions-on-call-engineers/) + * [Partitioning GitHub’s relational databases to handle scale](https://github.blog/2021-09-27-partitioning-githubs-relational-databases-scale/) * [Increasing developer happiness with GitHub code scanning](https://github.blog/2021-09-07-increasing-developer-happiness-github-code-scanning/) * [Why (and how) GitHub is adopting OpenTelemetry](https://github.blog/2021-05-26-why-and-how-github-is-adopting-opentelemetry/) @@ -410,7 +399,9 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools * [MySQL High Availability at GitHub](https://github.blog/2018-06-20-mysql-high-availability-at-github/) ### Major incidents & analysis reports - +* [GitHub Availability Report: December 2021](https://github.blog/2022-01-05-github-availability-report-december-2021/) +* [GitHub Availability Report: November 2021](https://github.blog/2021-12-01-github-availability-report-november-2021/) +* [GitHub Availability Report: October 2021](https://github.blog/2021-11-04-github-availability-report-october-2021/) * [GitHub Availability Report: September 2021](https://github.blog/2021-10-06-github-availability-report-september-2021/) * [GitHub Availability Report: August 2021](https://github.blog/2021-09-01-github-availability-report-august-2021/) * [GitHub Availability Report: July 2021](https://github.blog/2021-08-04-github-availability-report-july-2021/) @@ -550,6 +541,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools ### Blog Posts +* [Scaling AWS Infrastructure to Support Multiple Regions](https://www.grammarly.com/blog/engineering/scaling-aws-infrastructure/) * [Security Operations in an AWS Environment](https://www.grammarly.com/blog/engineering/security-infrastructure-aws/)
@@ -578,6 +570,7 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools ### Blog Posts +* [The Adventures of Rendezvous in Heroku’s New Architecture](https://blog.heroku.com/engineering) * [Incident Response at Heroku](https://blog.heroku.com/incident-response-at-heroku-2020) @@ -690,6 +683,23 @@ _Note to readers: This list refers to some of the articles, posts, videos, tools +
+ Meta + +### Blog Posts +* [SLICK: Adopting SLOs for improved reliability](https://engineering.fb.com/2021/12/13/production-engineering/slick/) +* [More details about the October 4 outage](https://engineering.fb.com/2021/10/05/networking-traffic/outage-details/) +* [Update about the October 4th outage](https://engineering.fb.com/2021/10/04/networking-traffic/outage/) + +### Videos + +* [A Customer Service Approach to SRE](https://www.usenix.org/conference/srecon19emea/presentation/looney) +* [How (Not) to Scale a Project: A Post-Mortem](https://www.usenix.org/conference/srecon19asia/presentation/bagnoli) +* [Releasing the World's Largest Python Site Every 7 Minutes](https://www.usenix.org/conference/srecon19asia/presentation/wong-shuhong) +* [Using ML to Automate Dynamic Error Categorization](https://www.usenix.org/conference/srecon19asia/presentation/davoli) + +
+
Microsoft