Nagios Database is misbehaving abnormal i.e. Host/Services showing down and in actual those services are up

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Post Reply
riya_g2
Posts: 1
Joined: Mon Jan 27, 2025 4:54 am

Nagios Database is misbehaving abnormal i.e. Host/Services showing down and in actual those services are up

Post by riya_g2 »

We are experiencing constant abnormalities in my Nagios server.

I have configured host and services of multiple clients of multiple location through NCPA and all hosts(which are virtual machines at the backend) are communicating through NCPA agent for both Windows and LINUX(mostly CENTOS) of 64-bit OS.

What we are facing from some days is that hosts and services shows down at NAGIOS XI some shows socket time out or some shows service check time out but when we chcek the host by accessing them, NCPA service must be running fine on the host they are reachable at that time but shows down on nagios xi and triggering the alerts and no latency is also there in accessing the host from within the nagios server.

Tried changing service check time out parameter but still issue persists, kindly help me in finding the root cause.
gwesterman
Posts: 269
Joined: Wed Aug 23, 2023 11:29 am

Re: Nagios Database is misbehaving abnormal i.e. Host/Services showing down and in actual those services are up

Post by gwesterman »

Hi @riya_g2,

How are you determining that the hosts marked as down in XI are reachable at that moment? Are you sure they aren't going down and then recovering before you can check?

If it is in fact just going down and you would like to avoid false positives in XI, you can change the check settings in the host's config.

Let us know what you find.

Thank you!
DoubleDoubleA
Posts: 286
Joined: Thu Feb 09, 2017 5:07 pm

Re: Nagios Database is misbehaving abnormal i.e. Host/Services showing down and in actual those services are up

Post by DoubleDoubleA »

Also it may help if you post the exact text of the error.
Post Reply