Nagios Standard Availability Reports

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
rj-admin2
Posts: 34
Joined: Thu Apr 11, 2024 1:16 am

Nagios Standard Availability Reports

Post by rj-admin2 »

I'm looking for a way of reporting on service availability for selected mission critical services. Is there anything within the standard reporting that would allow me to get this information?

TIA.
User avatar
lgute
Posts: 420
Joined: Mon Apr 06, 2020 2:49 pm

Re: Nagios Standard Availability Reports

Post by lgute »

Hi @rj-admin2, thanks for reaching out.

Would the "Availability Summary" report work?

You could create a Service Group for your mission critical services and then run the "Availability Summary" report on that Service Group for the period of time you need.
Please let us know if you have any other questions or concerns.

-Laura
rj-admin2
Posts: 34
Joined: Thu Apr 11, 2024 1:16 am

Re: Nagios Standard Availability Reports

Post by rj-admin2 »

Hi,

Many thanks for the reply.

Yeah I was kind of looking at that as a solution. The only problem I have is the host services I'm interested in are not all available for selection. Most are contained in a Host Group along with the same monitored services on our Dev and QA systems. I'm only interested in the services that are run on our Production server(s) for this report. Not sure how I would get round this.


TIA.
swolf

Re: Nagios Standard Availability Reports

Post by swolf »

Hi @rj-admin2,

I would take the exact services you want in the report and add them to a new servicegroup in the CCM. If you need to include some host checks then you could also make a host group that specifically covers your production servers instead.

-Sebastian
rj-admin2
Posts: 34
Joined: Thu Apr 11, 2024 1:16 am

Re: Nagios Standard Availability Reports

Post by rj-admin2 »

Hi @swolf,

Many thanks for the reply.

Sorry are you saying that I should have additional services defined just for the new servicegroup in CCM for reporting on?

Also, in terms of the Executive Summary Report, although we get "Top Alert Producers" and "Latest Alerts" listed in the report tables, we tend to find the Alert Histogram shows no data. Is there possibly a bug in this report or am I reading it incorrectly?

Finally is there a way of removing the "Critical" service availability from the report - although it may be in a critical state, it does not necessarily mean the production service is down and unavailable?

Apologies for the additional questions on this.

TIA.
snapier3
Posts: 144
Joined: Tue Apr 23, 2019 7:12 pm

Re: Nagios Standard Availability Reports

Post by snapier3 »

I ended up going to the version1 API to get the data for my reporting.

My primary reasoning is that using the "limited queries" available for your endpoint in the API do allow for a more granular selection of non grouped objects and tends to give you faster results than going through the UI.

Something like this for the current service status:

Code: Select all

objects/servicestatus?host_name=in:server1,server2,server42&service_description=in:service-one,service-two,service-order66
I parse the results to CSV and then pass them to other python data analysis tools to do the visualization of the results.

There are also the serviceavailability and hostavailability endpoints in the API but, I have not explored the extent in which the limited queries can be used to data mine for multiple objects that are not grouped in XI.

--SN
sgardil
Posts: 355
Joined: Wed Aug 09, 2023 9:58 am

Re: Nagios Standard Availability Reports

Post by sgardil »

rj-admin2 wrote: Wed Jun 19, 2024 1:11 am Hi @swolf,

Many thanks for the reply.

Sorry are you saying that I should have additional services defined just for the new servicegroup in CCM for reporting on?

Also, in terms of the Executive Summary Report, although we get "Top Alert Producers" and "Latest Alerts" listed in the report tables, we tend to find the Alert Histogram shows no data. Is there possibly a bug in this report or am I reading it incorrectly?

Finally is there a way of removing the "Critical" service availability from the report - although it may be in a critical state, it does not necessarily mean the production service is down and unavailable?

Apologies for the additional questions on this.

TIA.

Hey @rj-admin2

You'll want to make a new service group and in that service group you want to add the selected services that you want a separate availability report for.

For the Alert Histogram should display data. What Xi version are you on and what distro are you using? Also do you have any information in your php logs?

I am a little confused by this last question. From the first response it sounded like you wanted to have an availability report for some services that were in a critical state. So this last question seems a little conflicting to me. Is it you just dont like the word critical being used for this? It's correct that if its in a critical state that it doesn't mean its down or unavailable. It just means that its exceeding the critical threshold that was set for the service upon making the service.

-SG
rj-admin2
Posts: 34
Joined: Thu Apr 11, 2024 1:16 am

Re: Nagios Standard Availability Reports

Post by rj-admin2 »

Hi @sgardil,

Apologies for the confusion.

Basically I'm really only interested in the percentage of actual downtime, which as you have pointed out does not necessarily include the length of time services are in a critical state. So for instance we have high CPU and low free disk space on a consistent basis for some of our production systems. This doesn't mean users cannot use the systems - so for these reports I'm really only interested in the percentage of when the system(s) could not be accessed due to a mission critical service being offline (e.g., the Windows IIS service being offline on our production web server would stop access to our customer and supplier portals). Hope this helps to make a bit more sense.

In terms of the service group - I agree I should create a new one for the reporting. What I was driving at with the services was that we have multiple servers (Development, QA and Production) all running the same services like IIS, SQL Server etc. As things stand, from a configuration viewpoint, specific services are already contained in a relevant host group and all we do is add the hosts into that group to get the relevant services defined. This saves having to define the same individual services for each host.

We are currently using Nagios XI 2024R1.1.3 running on Ubuntu 20.04.6 LTS. I can't seem to find a /var/log/httpd folder so not sure where the php log file would exist?

TIA.
User avatar
ajcoil
Posts: 108
Joined: Wed Aug 09, 2023 10:30 am

Re: Nagios Standard Availability Reports

Post by ajcoil »

Hello @rj-admin2,

Thank you for reaching out!

Since you are using Ubuntu 20, the PHP log files should be located in the following directory:

Code: Select all

/var/log/apache2/
Do let us know what you find!
rj-admin2
Posts: 34
Joined: Thu Apr 11, 2024 1:16 am

Re: Nagios Standard Availability Reports

Post by rj-admin2 »

Hi @ajcoil,

Many thanks for the reply.

Unfortunately I do not see any PHP log files in that directory - just access.log* and error.log* files.


TIA.
Post Reply