The more complex a system is, the more complex it becomes to accurately monitor the system. Each interaction between different parts of the system is a potential point of failure. Your monitoring systems will need to grow and develop alongside the systems they are monitoring.
Read MoreThe First Law of Yaks is that there are always more yaks than you expect. A tale of ec2 metadata, chef runs, and mongo elections gone awry.
Read MoreWhen all you have is a hammer, everything looks like a nail. And when you find people abusing puppet exec in order to get their work done, you might want to rethink your permission scheme just a wee bit.
Read MoreSetting up a Zenoss event command to send a message to our NOC IRC channel, since that's where events and errors already get discussed.
Read More