Hi,
since a few months I'm SimpleMonitoring 150+ hosts from a Windows Server.
Very basic just ping every 1 min plus Pushover notifications and HTML status page:
[HostName]
type=ping
host=172.x.y.z
tolerance=5
It works fine but I've realized when an Host is down for long time another one is often reported up and down every 10/15 mins even if (checked pinging directly from command line) no packet was really lost.
It looks like the false positive problem is reported for the Host immediately before in the configuration file of the one really down.
For example:
#Host reported flapping even if UP
[Host-A]
type=ping
host=172.x.y.z
tolerance=5
#Host DOWN since long time
[Host-B]
type=ping
host=172.x.y.z
tolerance=5
If I comment the Host-B configuration the problem disappear.
My Python knowledge is very limited so I didn't go trough the code to find where the problem could be.
Thanks
Hi,
since a few months I'm SimpleMonitoring 150+ hosts from a Windows Server.
Very basic just ping every 1 min plus Pushover notifications and HTML status page:
It works fine but I've realized when an Host is down for long time another one is often reported up and down every 10/15 mins even if (checked pinging directly from command line) no packet was really lost.
It looks like the false positive problem is reported for the Host immediately before in the configuration file of the one really down.
For example:
If I comment the Host-B configuration the problem disappear.
My Python knowledge is very limited so I didn't go trough the code to find where the problem could be.
Thanks