- s/cntFailures/cntNotify/; - emit status only on state transitions, can get noisy if testing once a minute in crontab - uses file names to track node status
bash script to ping a series a nodes, and on failure, generates a traceroute, then generates an alert