no checks

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
scharft
Posts: 112
Joined: Wed Jul 29, 2015 6:55 am

no checks

Post by scharft »

hi,

today Nagios doesn't do any check (host or service) from ~ 6.30 pm to 9.30pm.
Every Nagios service was running, the System Component Status and Monitoring Engine Status are good.
Maybe the high htime is a indicator?

Service Check Latency
Min 0.00 sec
Max 108.37 sec
Avg 0.19 sec

Service Check Execution Time
Min 0.00 sec
Max 96.16 sec
Avg 1.67 sec

How can find the issue?

A host reboot fixed the issue for the moment but we should find a solution to prevent this in the future happen again.


Best Regards
Thomas
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: no checks

Post by benjaminsmith »

Hi Thomas,

Looking at the check latency, it would suggest that one or more service checks may have not run as scheduled. We'd like to review the logs on the server. Can you send us the current nagios.log, it's in the /usr/local/nagios/var directory?

Also, a fresh system profile and the /var/log/messages file. Thanks.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: no checks

Post by tgriep »

Thanks for the profile but we need to get the full /usr/local/nagios/var/nagios.log file and the /var/log/messages file for when the issue happened.
I would like to look at this file as well as I saw some Postgres database issues.
/var/lib/pgsql/data/pg_log/postgresql-Wed.log

I took a look at the profile and I see what looks like a passive check sending in lot of bad data. You should fix that.

Code: Select all

At line:1 char:1
+ C:\\MONITORING\\check_passive.ps1 ; exit($lastexitcode)
+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : ObjectNotFound: (C:\\MONITORING\\check_passive.p 
   s1:String) [], CommandNotFoundEx
n returned error Unknown or unsupported command
[1572433299] Warning: Unrecognized external command ->  CommandNotFoundEx
n;
[1572433299] External command function, script file, or operable program. Check the spelling of the name, or 
if a path was included, verify that the path is correct and try again.
At line:1 char:1
+ C:\\MONITORING\\check_passive.ps1 ; exit($lastexitcode)

In the profile, the nagios process is not running, please start it.

Code: Select all

service nagios start

The kernel message queue needs to be increased. Follow this article to do that.
https://support.nagios.com/kb/article/n ... d-139.html
Be sure to check out our Knowledgebase for helpful articles and solutions!
scharft
Posts: 112
Joined: Wed Jul 29, 2015 6:55 am

Re: no checks

Post by scharft »

i happend again, setting a downtime in nagios was not possible, the production phoned me during my sleep and produced many alarms because they were not able to set the downtime.
So it was not possible to download the system profile before i reboot the server....

the message queues is increased.
The log file from wednesday and today is attached, the nagios.log file is 96mb big, to much for the this upload..
You do not have the required permissions to view the files attached to this post.
scharft
Posts: 112
Joined: Wed Jul 29, 2015 6:55 am

Re: no checks

Post by scharft »

the passive checks every 2nd day in stale state..

enough sending log files to you.

i want a technican who remotely works on our nagios server to fix this issue THIS WEEK
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: no checks

Post by benjaminsmith »

Hello Thomas,

I fully appreciate the inconvenience this has caused you, and would like to maintain your cooperation in this process. We believe you have issues with your passive check configurations, and let's get a ticket opened on this and a remote session.

In regards to the log files, they are necessary for troubleshooting, and also help the technicians prepare for a successful remote. In short, the logs help speed up the time to resolution.

To open a support ticket, please visit.
https://support.nagios.com/tickets/
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked