Originally posted by Evan
View Post
They also didn't suffer from failed backups. A backup is required if you lose data, and need to restore it. They didn't lose data. Backups were not a factor in this incident. The network was not a factor in this incident. I guarantee you they had a backup system in place and a network designed to gracefully fail over. That kind of regulation would be completely unnecessary and would not have changed this incident one iota. Testing for the effects of a fire in the data center is pretty hard to do.
The central points of failure that typically cause serious problems at this level would be primary network trunks -- like a fiber line into the data center, that can be broken by a car accident into a pole or cut by a construction crew a hundred miles away, had this happen to me twice last year -- power (the problem here), primary data center loss due to fire/flood, or software system upgrade (i.e. updating a system and the new software causing problems).
When you read about cobbled together systems, it isn't the network or even the hardware people are worried about, it is the software systems they are worried about.
Comment