Page 1 of 2

Nagios Standard Availability Reports

Posted: Tue Jun 11, 2024 4:03 am
by rj-admin2
I'm looking for a way of reporting on service availability for selected mission critical services. Is there anything within the standard reporting that would allow me to get this information?

TIA.

Re: Nagios Standard Availability Reports

Posted: Tue Jun 11, 2024 2:44 pm
by lgute
Hi @rj-admin2, thanks for reaching out.

Would the "Availability Summary" report work?

You could create a Service Group for your mission critical services and then run the "Availability Summary" report on that Service Group for the period of time you need.

Re: Nagios Standard Availability Reports

Posted: Wed Jun 12, 2024 1:53 am
by rj-admin2
Hi,

Many thanks for the reply.

Yeah I was kind of looking at that as a solution. The only problem I have is the host services I'm interested in are not all available for selection. Most are contained in a Host Group along with the same monitored services on our Dev and QA systems. I'm only interested in the services that are run on our Production server(s) for this report. Not sure how I would get round this.


TIA.

Re: Nagios Standard Availability Reports

Posted: Wed Jun 12, 2024 11:06 am
by swolf
Hi @rj-admin2,

I would take the exact services you want in the report and add them to a new servicegroup in the CCM. If you need to include some host checks then you could also make a host group that specifically covers your production servers instead.

-Sebastian

Re: Nagios Standard Availability Reports

Posted: Wed Jun 19, 2024 1:11 am
by rj-admin2
Hi @swolf,

Many thanks for the reply.

Sorry are you saying that I should have additional services defined just for the new servicegroup in CCM for reporting on?

Also, in terms of the Executive Summary Report, although we get "Top Alert Producers" and "Latest Alerts" listed in the report tables, we tend to find the Alert Histogram shows no data. Is there possibly a bug in this report or am I reading it incorrectly?

Finally is there a way of removing the "Critical" service availability from the report - although it may be in a critical state, it does not necessarily mean the production service is down and unavailable?

Apologies for the additional questions on this.

TIA.

Re: Nagios Standard Availability Reports

Posted: Wed Jun 19, 2024 10:13 am
by snapier3
I ended up going to the version1 API to get the data for my reporting.

My primary reasoning is that using the "limited queries" available for your endpoint in the API do allow for a more granular selection of non grouped objects and tends to give you faster results than going through the UI.

Something like this for the current service status:

Code: Select all

objects/servicestatus?host_name=in:server1,server2,server42&service_description=in:service-one,service-two,service-order66
I parse the results to CSV and then pass them to other python data analysis tools to do the visualization of the results.

There are also the serviceavailability and hostavailability endpoints in the API but, I have not explored the extent in which the limited queries can be used to data mine for multiple objects that are not grouped in XI.

--SN

Re: Nagios Standard Availability Reports

Posted: Wed Jun 19, 2024 2:13 pm
by sgardil
rj-admin2 wrote: Wed Jun 19, 2024 1:11 am Hi @swolf,

Many thanks for the reply.

Sorry are you saying that I should have additional services defined just for the new servicegroup in CCM for reporting on?

Also, in terms of the Executive Summary Report, although we get "Top Alert Producers" and "Latest Alerts" listed in the report tables, we tend to find the Alert Histogram shows no data. Is there possibly a bug in this report or am I reading it incorrectly?

Finally is there a way of removing the "Critical" service availability from the report - although it may be in a critical state, it does not necessarily mean the production service is down and unavailable?

Apologies for the additional questions on this.

TIA.

Hey @rj-admin2

You'll want to make a new service group and in that service group you want to add the selected services that you want a separate availability report for.

For the Alert Histogram should display data. What Xi version are you on and what distro are you using? Also do you have any information in your php logs?

I am a little confused by this last question. From the first response it sounded like you wanted to have an availability report for some services that were in a critical state. So this last question seems a little conflicting to me. Is it you just dont like the word critical being used for this? It's correct that if its in a critical state that it doesn't mean its down or unavailable. It just means that its exceeding the critical threshold that was set for the service upon making the service.

-SG

Re: Nagios Standard Availability Reports

Posted: Thu Jun 20, 2024 12:35 am
by rj-admin2
Hi @sgardil,

Apologies for the confusion.

Basically I'm really only interested in the percentage of actual downtime, which as you have pointed out does not necessarily include the length of time services are in a critical state. So for instance we have high CPU and low free disk space on a consistent basis for some of our production systems. This doesn't mean users cannot use the systems - so for these reports I'm really only interested in the percentage of when the system(s) could not be accessed due to a mission critical service being offline (e.g., the Windows IIS service being offline on our production web server would stop access to our customer and supplier portals). Hope this helps to make a bit more sense.

In terms of the service group - I agree I should create a new one for the reporting. What I was driving at with the services was that we have multiple servers (Development, QA and Production) all running the same services like IIS, SQL Server etc. As things stand, from a configuration viewpoint, specific services are already contained in a relevant host group and all we do is add the hosts into that group to get the relevant services defined. This saves having to define the same individual services for each host.

We are currently using Nagios XI 2024R1.1.3 running on Ubuntu 20.04.6 LTS. I can't seem to find a /var/log/httpd folder so not sure where the php log file would exist?

TIA.

Re: Nagios Standard Availability Reports

Posted: Fri Jun 21, 2024 2:01 pm
by ajcoil
Hello @rj-admin2,

Thank you for reaching out!

Since you are using Ubuntu 20, the PHP log files should be located in the following directory:

Code: Select all

/var/log/apache2/
Do let us know what you find!

Re: Nagios Standard Availability Reports

Posted: Mon Jun 24, 2024 2:59 am
by rj-admin2
Hi @ajcoil,

Many thanks for the reply.

Unfortunately I do not see any PHP log files in that directory - just access.log* and error.log* files.


TIA.