Book Review: Database Reliability Engineering – Designing and Operating Resilient Database Systems

Book Review
Hello and welcome to yet another book review. Databases have been called the “killer application of IT” and it is true that in, almost any, computing environment today, one or more databases are in play. Having said that, in-depth knowledge of these database systems used to reside with the DBAs of an organization. Today, with the roles being in flux, if you are an SRE chances are you have to deal with databases, quite often without the luxury of a dedicated DBA. Databases themselves have proliferated as well, with NoSQL paradigm entering the market and various combinations of the CAP theorem in effect, depending on the use case. So, it was about time that a dedicated volume appeared in the market that deals with how to apply SRE principles within…
Read More

Article Review: Containers will not fix your broken culture (and other hard truths)

Book Review
First things first, if you do not know what is ACM Queue (or even worse, do not know what ACM is), click on the links provided. ACM relatively recently has reformed and now presents articles by industry experts, especially in the Queue magazine (you get an article from Queue with every Communications of the ACM magazine but there is more, much more). (disclaimer - while I am a paying ACM member, I make no profit or have no further affiliation with the organization (i.e. I am not an official Ambassador). With that out of the way, let's focus in the article in question. The author is Bridget Kromhout, currently working for Microsoft. The main idea of the article is that solution to difficult, seemingly technical problems, can be best resolved…
Read More

Book Review: The Practice Of Cloud System Administration Volume 2 – Designing And Operating Large Distributed Systems

Book Review
Hello everyone with another book review. This time, I will be reviewing a book that I consider a classic. As always, let's start with the list of contents: Part I Design: Building it Designing in a distributed world Designing for Operations Selecting a Service Platform Application Architectures Design Patterns for Scaling Design Patterns for Resiliency Part II Operations: Running it Operations in a Distributed World DevOps Culture Service Delivery: The Build Phase Service Delivery: The Deployment Phase Upgrading Live Services Automation Design Documents Oncall Disaster Preparedness Monitoring Fundamentals Monitoring Architecture and Practice Capacity Planning Creating KPIs Operational Excellence Part III Appendices Assessments The Origins and Future of Distributed Computing and Clouds Scaling Terminology and Concepts Templates and Examples Recommended Reading   overall a bit over 500 beautifully printed pages (as…
Read More

Book Review: PostgreSQL Replication

Book Review, Uncategorized
So for my series of System Engineering books, I will proceed with a short review of PostgreSQL Replication by Packt. The reason this book came to be a part of my collection is that while there is a lot of information regarding PostgreSQL replication out there, a lot of it is out of date, given the overhaul of the replication system in PostgreSQL 9.X. Without further ado, here is the list of contents of the book. Understanding Replication Concepts Understanding the PostgreSQL Transaction Log Understanding Point-In-Time Recovery Setting up asynchronous replication Setting up synchronous replication Monitoring your setup Understanding Linux High-Availability Working with pgbouncer Working with PgPool Configuring Slony Using Skytools Working with Postgres-XC Scaling with PL/Proxy The book gets straight into business with an introduction of replication concepts, and…
Read More