AIX monitoring issue for just some servers
Posted: Thu Jan 04, 2024 9:38 am
I have several AIX servers being monitored successfully but there are about 18 that are having issues. Everything is exactly the same - OS version, RPMs, python versions, etc. I've reinstalled the agent and even reinstalled python.
Using the command line, I get this error:
/usr/local/nagios/libexec/check_ncpa.py -H hostname -t mytoken -P 5693 --list
UNKNOWN: An error occurred connecting to API. (HTTP error: '500 INTERNAL SERVER ERROR')
When I go the GUI on port 5693, I don't see any checks, any live data, and the API just spins and throws an error into the listener log.
Here is what I am seeing in the /usr/local/ncpa/var/log/ncpa_listener.log
2024-01-04 09:29:06,082 8323480 INFO ::ffff:10.15.14.86 - - [2024-01-04 09:29:06] "GET /api/services/?token=mytoken&check=1&service=sshd&status=running HTTP/1.1" 500 2362 0.004727
2024-01-04 09:29:15,245 8323480 ERROR Exception on /api/services/ [GET]
Traceback (most recent call last):
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1817, in wsgi_app
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1477, in full_dispatch_request
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1381, in handle_user_exception
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1475, in full_dispatch_request
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1461, in dispatch_request
File "/tmp/test/ncpa/agent/listener/server.py", line 185, in token_auth_decoration
File "/tmp/test/ncpa/agent/listener/server.py", line 931, in api
File "/tmp/test/ncpa/agent/listener/psapi.py", line 279, in getter
File "/tmp/test/ncpa/agent/listener/psapi.py", line 260, in refresh
File "/tmp/test/ncpa/agent/listener/psapi.py", line 230, in get_root_node
File "/tmp/test/ncpa/agent/listener/psapi.py", line 185, in get_disk_node
File "/opt/freeware/lib/python2.7/site-packages/psutil/__init__.py", line 2133, in disk_partitions
File "/opt/freeware/lib/python2.7/site-packages/psutil/_psaix.py", line 186, in disk_partitions
OSError: [Errno 13] Permission denied
I cannot figure out why most AIX servers work but some do not. Any help would be very much appreciated!
Using the command line, I get this error:
/usr/local/nagios/libexec/check_ncpa.py -H hostname -t mytoken -P 5693 --list
UNKNOWN: An error occurred connecting to API. (HTTP error: '500 INTERNAL SERVER ERROR')
When I go the GUI on port 5693, I don't see any checks, any live data, and the API just spins and throws an error into the listener log.
Here is what I am seeing in the /usr/local/ncpa/var/log/ncpa_listener.log
2024-01-04 09:29:06,082 8323480 INFO ::ffff:10.15.14.86 - - [2024-01-04 09:29:06] "GET /api/services/?token=mytoken&check=1&service=sshd&status=running HTTP/1.1" 500 2362 0.004727
2024-01-04 09:29:15,245 8323480 ERROR Exception on /api/services/ [GET]
Traceback (most recent call last):
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1817, in wsgi_app
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1477, in full_dispatch_request
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1381, in handle_user_exception
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1475, in full_dispatch_request
File "/opt/freeware/lib/python2.7/site-packages/flask/app.py", line 1461, in dispatch_request
File "/tmp/test/ncpa/agent/listener/server.py", line 185, in token_auth_decoration
File "/tmp/test/ncpa/agent/listener/server.py", line 931, in api
File "/tmp/test/ncpa/agent/listener/psapi.py", line 279, in getter
File "/tmp/test/ncpa/agent/listener/psapi.py", line 260, in refresh
File "/tmp/test/ncpa/agent/listener/psapi.py", line 230, in get_root_node
File "/tmp/test/ncpa/agent/listener/psapi.py", line 185, in get_disk_node
File "/opt/freeware/lib/python2.7/site-packages/psutil/__init__.py", line 2133, in disk_partitions
File "/opt/freeware/lib/python2.7/site-packages/psutil/_psaix.py", line 186, in disk_partitions
OSError: [Errno 13] Permission denied
I cannot figure out why most AIX servers work but some do not. Any help would be very much appreciated!