Use Nagios and Xen

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
User avatar
Box293
Too Basu
Posts: 5126
Joined: Sun Feb 07, 2010 10:55 pm
Location: Deniliquin, Australia
Contact:

Re: Use Nagios and Xen

Post by Box293 »

What is the output when you execute this command:

Code: Select all

/usr/local/nagios/bin/nagios -ud /usr/local/nagios/etc/nagios.cfg
Anything in the logs:

Code: Select all

tail -n 30 /usr/local/nagios/var/nagios.log
tail -n 30 /var/log/messages
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
hack3rcon
Posts: 27
Joined: Sat Jul 16, 2016 9:50 am

Re: Use Nagios and Xen

Post by hack3rcon »

Results are :

Code: Select all

[root@localhost ~]# /usr/local/nagios/bin/nagios -ud /usr/local/nagios/etc/nagios.cfg
[root@localhost ~]#

Code: Select all

[root@localhost ~]# tail -n 30 /usr/local/nagios/var/nagios.log
[1469430347] Nagios 4.0.1 starting... (PID=13788)
[1469430347] Local time is Mon Jul 25 03:05:47 EDT 2016
[1469430347] LOG VERSION: 2.0
[1469430347] qh: Socket '/usr/local/nagios/var/rw/nagios.qh' successfully initialized
[1469430347] qh: core query handler registered
[1469430347] nerd: Channel hostchecks registered successfully
[1469430347] nerd: Channel servicechecks registered successfully
[1469430347] nerd: Channel opathchecks registered successfully
[1469430347] nerd: Fully initialized and ready to rock!
[1469430347] wproc: Successfully registered manager as @wproc with query handler
[1469430347] wproc: Registry request: name=Core Worker 13789;pid=13789
[1469430347] wproc: Registry request: name=Core Worker 13790;pid=13790
[1469430347] wproc: Registry request: name=Core Worker 13791;pid=13791
[1469430347] wproc: Registry request: name=Core Worker 13792;pid=13792
[1469430347] wproc: Registry request: name=Core Worker 13793;pid=13793
[1469430347] wproc: Registry request: name=Core Worker 13794;pid=13794
[1469430347] wproc: Registry request: name=Core Worker 13796;pid=13796
[1469430347] wproc: Registry request: name=Core Worker 13795;pid=13795
[1469430347] wproc: Registry request: name=Core Worker 13798;pid=13798
[1469430347] wproc: Registry request: name=Core Worker 13797;pid=13797
[1469430347] wproc: Registry request: name=Core Worker 13800;pid=13800
[1469430347] wproc: Registry request: name=Core Worker 13799;pid=13799
[1469430347] Successfully launched command file worker with pid 13801
[1469430645] SERVICE NOTIFICATION: nagiosadmin;localhost;Root Partition;CRITICAL;notify-service-by-email;DISK CRITICAL - free space: / 10770 MB (3% inode=99%):
[1469430645] wproc: NOTIFY job 1916 from worker Core Worker 1457 is a non-check helper but exited with return code 127
[1469430645] wproc:   command: /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Mon Jul 25 03:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10770 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
[1469430645] wproc:   host=localhost; service=Root Partition; contact=nagiosadmin
[1469430645] wproc:   early_timeout=0; exited_ok=1; wait_status=32512; error_code=0;
[1469430645] wproc:   stderr line 01: /bin/sh: /bin/mail: No such file or directory
[1469430645] wproc:   stderr line 02: /usr/bin/printf: write error: Broken pipe

Code: Select all

[root@localhost ~]# tail -n 30 /var/log/messages
Jul 25 03:07:35 localhost nagios: nagios (pid 13953 1493 1459 1458 1457 1456 1455 1454 1453 1452 1451 1450 1449 1448 1444) is running...
Jul 25 03:07:35 localhost systemd: nagios.service: PID file /var/nagios/nagios.pid not readable (yet?) after start: No such file or directory
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14015 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=14015 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14015 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=14015 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14015 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=14015 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14014 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=14015 suid=74  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14014 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=14014 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14014 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=14014 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: CRYPTO_KEY_USER pid=14014 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=14014 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:08:25 localhost audit: USER_LOGIN pid=14014 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=login acct="(unknown)" exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=ssh res=failed'
Jul 25 03:10:45 localhost nagios: SERVICE NOTIFICATION: nagiosadmin;localhost;Root Partition;CRITICAL;notify-service-by-email;DISK CRITICAL - free space: / 10770 MB (3% inode=99%):
Jul 25 03:10:45 localhost nagios: wproc: NOTIFY job 1916 from worker Core Worker 1457 is a non-check helper but exited with return code 127
Jul 25 03:10:45 localhost nagios: wproc:   command: /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Mon Jul 25 03:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10770 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
Jul 25 03:10:45 localhost nagios: wproc:   host=localhost; service=Root Partition; contact=nagiosadmin
Jul 25 03:10:45 localhost nagios: wproc:   early_timeout=0; exited_ok=1; wait_status=32512; error_code=0;
Jul 25 03:10:45 localhost nagios: wproc:   stderr line 01: /bin/sh: /bin/mail: No such file or directory
Jul 25 03:10:45 localhost nagios: wproc:   stderr line 02: /usr/bin/printf: write error: Broken pipe
Jul 25 03:12:36 localhost systemd: nagios.service: Start operation timed out. Terminating.
Jul 25 03:12:36 localhost systemd: Failed to start LSB: start and stop Nagios monitoring server.
Jul 25 03:12:36 localhost systemd: nagios.service: Unit entered failed state.
Jul 25 03:12:36 localhost audit: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nagios comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
Jul 25 03:12:36 localhost systemd: nagios.service: Failed with result 'timeout'.
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14295 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=14295 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14295 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=14295 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14295 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=14295 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14294 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=14295 suid=74  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14294 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=14294 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14294 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=14294 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: CRYPTO_KEY_USER pid=14294 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=14294 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 25 03:13:25 localhost audit: USER_LOGIN pid=14294 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=login acct="(unknown)" exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=ssh res=failed'
hack3rcon
Posts: 27
Joined: Sat Jul 16, 2016 9:50 am

Re: Use Nagios and Xen

Post by hack3rcon »

Any idea?
User avatar
tgriep
Madmin
Posts: 9177
Joined: Thu Oct 30, 2014 9:02 am

Re: Use Nagios and Xen

Post by tgriep »

It looks like you need to install the /bin/mail application on the Nagios server.
When Nagios sends an notification, it uses that application to create the email to be sent to the user.
Try that and run the test on the server to see if the error is gone and that the Nagios daemon stays running.
Be sure to check out our Knowledgebase for helpful articles and solutions!
hack3rcon
Posts: 27
Joined: Sat Jul 16, 2016 9:50 am

Re: Use Nagios and Xen

Post by hack3rcon »

OK.
I used below commands :

Code: Select all

# yum provides /bin/mail
# yum -y install mailx
But problem not solved, Can I disable it in Nagios?

Code: Select all

[root@localhost objects]# tail -n 30 /usr/local/nagios/var/nagios.log
[1469505600] LOG ROTATION: DAILY
[1469505600] LOG VERSION: 2.0
[1469505600] CURRENT HOST STATE: localhost;UP;HARD;1;PING OK - Packet loss = 0%, RTA = 0.05 ms
[1469505600] CURRENT SERVICE STATE: localhost;Current Load;OK;HARD;1;OK - load average: 0.00, 0.01, 0.05
[1469505600] CURRENT SERVICE STATE: localhost;Current Users;OK;HARD;1;USERS OK - 0 users currently logged in
[1469505600] CURRENT SERVICE STATE: localhost;HTTP;WARNING;HARD;4;HTTP WARNING: HTTP/1.1 403 Forbidden - 4892 bytes in 0.001 second response time
[1469505600] CURRENT SERVICE STATE: localhost;PING;OK;HARD;1;PING OK - Packet loss = 0%, RTA = 0.05 ms
[1469505600] CURRENT SERVICE STATE: localhost;Root Partition;CRITICAL;HARD;4;DISK CRITICAL - free space: / 10760 MB (3% inode=99%):
[1469505600] CURRENT SERVICE STATE: localhost;SSH;OK;HARD;1;SSH OK - OpenSSH_7.2 (protocol 2.0)
[1469505600] CURRENT SERVICE STATE: localhost;Swap Usage;OK;HARD;1;SWAP OK - 100% free (16383 MB out of 16383 MB)
[1469505600] CURRENT SERVICE STATE: localhost;Total Processes;OK;HARD;1;PROCS OK: 95 processes with STATE = RSZDT
[1469505925] Auto-save of retention data completed successfully.
[1469506245] SERVICE NOTIFICATION: nagiosadmin;localhost;Root Partition;CRITICAL;notify-service-by-email;DISK CRITICAL - free space: / 10760 MB (3% inode=99%):
[1469506245] wproc: NOTIFY job 2128 from worker Core Worker 1454 is a non-check helper but exited with return code 127
[1469506245] wproc:   command: /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Tue Jul 26 00:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10760 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
[1469506245] wproc:   host=localhost; service=Root Partition; contact=nagiosadmin
[1469506245] wproc:   early_timeout=0; exited_ok=1; wait_status=32512; error_code=0;
[1469506245] wproc:   stderr line 01: /bin/sh: /bin/mail: No such file or directory
[1469506245] wproc:   stderr line 02: /usr/bin/printf: write error: Broken pipe
[1469509525] Auto-save of retention data completed successfully.
[1469509845] SERVICE NOTIFICATION: nagiosadmin;localhost;Root Partition;CRITICAL;notify-service-by-email;DISK CRITICAL - free space: / 10760 MB (3% inode=99%):
[1469509845] wproc: NOTIFY job 2138 from worker Core Worker 1453 is a non-check helper but exited with return code 127
[1469509845] wproc:   command: /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Tue Jul 26 01:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10760 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
[1469509845] wproc:   host=localhost; service=Root Partition; contact=nagiosadmin
[1469509845] wproc:   early_timeout=0; exited_ok=1; wait_status=32512; error_code=0;
[1469509845] wproc:   stderr line 01: /bin/sh: /bin/mail: No such file or directory
[1469509845] wproc:   stderr line 02: /usr/bin/printf: write error: Broken pipe

Code: Select all

[root@localhost objects]# tail -n 30 /var/log/messages
Jul 26 01:43:08 localhost audit: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nagios comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 26 01:43:08 localhost systemd: Starting LSB: start and stop Nagios monitoring server...
Jul 26 01:43:08 localhost nagios: nagios (pid 21972 1493 1459 1458 1457 1456 1455 1454 1453 1452 1451 1450 1449 1448 1444) is running...
Jul 26 01:43:08 localhost systemd: nagios.service: PID file /var/nagios/nagios.pid not readable (yet?) after start: No such file or directory
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22008 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=22008 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22008 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=22008 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22008 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=22008 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22007 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=22008 suid=74  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22007 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=22007 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22007 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=22007 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: CRYPTO_KEY_USER pid=22007 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=22007 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:43:25 localhost audit: USER_LOGIN pid=22007 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=login acct="(unknown)" exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=ssh res=failed'
Jul 26 01:48:08 localhost systemd: nagios.service: Start operation timed out. Terminating.
Jul 26 01:48:08 localhost systemd: Failed to start LSB: start and stop Nagios monitoring server.
Jul 26 01:48:08 localhost audit: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nagios comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
Jul 26 01:48:08 localhost systemd: nagios.service: Unit entered failed state.
Jul 26 01:48:08 localhost systemd: nagios.service: Failed with result 'timeout'.
Jul 26 01:48:12 localhost systemd: Starting dnf makecache...
Jul 26 01:48:12 localhost dnf: Metadata cache refreshed recently.
Jul 26 01:48:12 localhost systemd: Started dnf makecache.
Jul 26 01:48:12 localhost audit: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=dnf-makecache comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 26 01:48:12 localhost audit: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=dnf-makecache comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22282 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=22282 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22282 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=22282 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22282 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=22282 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22281 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=22282 suid=74  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22281 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:0b:25:c2:7d:3f:bf:15:db:a7:9b:94:fe:e4:a0:7f:d9:ed:20:14:a2:cb:6c:62:2c:fe:7f:29:70:08:14:34:8a direction=? spid=22281 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22281 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:da:bc:c9:88:e3:7b:fc:25:8c:05:e4:03:04:72:50:b8:6c:0f:8f:ac:8d:dd:54:74:7a:d1:96:b8:38:1d:fd:11 direction=? spid=22281 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: CRYPTO_KEY_USER pid=22281 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=destroy kind=server fp=SHA256:98:90:14:59:1c:cd:f6:56:f6:ee:fc:18:73:90:6c:86:4c:86:3c:33:b5:f5:20:72:5e:7d:df:ca:13:17:f4:de direction=? spid=22281 suid=0  exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=? res=success'
Jul 26 01:48:25 localhost audit: USER_LOGIN pid=22281 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:sshd_t:s0-s0:c0.c1023 msg='op=login acct="(unknown)" exe="/usr/sbin/sshd" hostname=? addr=127.0.0.1 terminal=ssh res=failed'
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Use Nagios and Xen

Post by rkennedy »

Code: Select all

[1469506245] SERVICE NOTIFICATION: nagiosadmin;localhost;Root Partition;CRITICAL;notify-service-by-email;DISK CRITICAL - free space: / 10760 MB (3% inode=99%):
[1469506245] wproc: NOTIFY job 2128 from worker Core Worker 1454 is a non-check helper but exited with return code 127
[1469506245] wproc:   command: /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Tue Jul 26 00:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10760 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
[1469506245] wproc:   host=localhost; service=Root Partition; contact=nagiosadmin
[1469506245] wproc:   early_timeout=0; exited_ok=1; wait_status=32512; error_code=0;
[1469506245] wproc:   stderr line 01: /bin/sh: /bin/mail: No such file or directory
[1469506245] wproc:   stderr line 02: /usr/bin/printf: write error: Broken pipe
What is the output of which mail? You'll need to update your notify-host-by-mail and notify-service-by-mail definitions, in the commands.cfg file. It looks like /bin/mail still doesn't exist.

Code: Select all

/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Tue Jul 26 00:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10760 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
Above is the command it's trying to execute. You can simulate it over the CLI to troubleshoot as needed.
Former Nagios Employee
hack3rcon
Posts: 27
Joined: Sat Jul 16, 2016 9:50 am

Re: Use Nagios and Xen

Post by hack3rcon »

It is :

Code: Select all

[root@localhost ~]# which mail
/usr/bin/mail
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Use Nagios and Xen

Post by rkennedy »

You need to update your commands.cfg file, and change the notify-host-by-email and notify-service-by-email commands to use /usr/bin/mail and not /bin/mail. The command that is running isn't using the proper mail client -

Code: Select all

/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Root Partition\nHost: localhost\nAddress: 127.0.0.1\nState: CRITICAL\n\nDate/Time: Tue Jul 26 00:10:45 EDT 2016\n\nAdditional Info:\n\nDISK CRITICAL - free space: / 10760 MB (3% inode=99%):\n" | /bin/mail -s "** PROBLEM Service Alert: localhost/Root Partition is CRITICAL **" nagios@localhost
Former Nagios Employee
Locked