We're having a consistent issue with a group of AWS instances (70) that get launched at the same time.
Because their launched at the same time, the step in the bootstrap (User Data) they execute that sends a request to XI to add the instance as a host gets sent roughly within the same minute, which i'm wondering if that may be an issue where Nagios XI is getting overloaded with API calls, and because of this it's not acknowledging/queuing some of them.
ApplyConfig is NOT performed in each of the API calls.A single api call to apply config is sent after all the instances have completed their provisioning steps. There is on average a 8 minute gap of time between when all the instances send the Add Host API call, and the apply config call is sent.
For instance, i just launched another ASG (Auto Scaling Group) of 70 instances, and only 46 of the instances showed up in Nagios XI. Looking at the results of the bootstrap for the instances that did not show up after the config was applied, shows that the call was sent successfully, the results looks like this in each of the instances (Granted the host name is different of course)
Code: Select all
StatusCode : 200
StatusDescription : OK
Content : {
"success": "Added SWSIM-015E94 to the system. Config imported but not yet applied."
}
RawContent : HTTP/1.1 200 OK
Access-Control-Allow-Orgin: *
Access-Control-Allow-Methods: *
Keep-Alive: timeout=5, max=100
Connection: Keep-Alive
Content-Length: 92
Content-Type: application/json
Date: Fri, ...
Forms : {}
Headers : {[Access-Control-Allow-Orgin, *], [Access-Control-Allow-Methods, *], [Keep-Alive, timeout=5,
max=100], [Connection, Keep-Alive]...}
Images : {}
InputFields : {}
Links : {}
ParsedHtml : System.__ComObject
RawContentLength : 92https://support.nagios.com/forum/viewto ... 16&t=57112
Please advise,
Nagios XI Version: 5.6.9
AWS instance type: c5.4xlarge