Hey Tgriep,
Thanks for your reply.
Let me toggle the answers for you questions.
Are you running a host check of that system and if so, do you see anything there that could help debugging this?
Yes, hostcheck of that system is been enabled and the hostcheck too fails on that particular time scenario.
Are there any backups happening or any high load on the system that could be causing it?
I too suspected this and i logged the both the servers (Nagios Server and the Server Monitored) process,memory stack for the particular time scene where SNMPD gets punished.
#What is found is, i got a confirmation from the Production server that it has validated the SNMPD request from the Nagios Server.
#On that particular time scenario, a single process (backup process) peeks the memory and cpu utilization to maxmimum (But that too for merely a minute).
#But other side, i have nagios timeout for constant +3 minutes.
#Also i checked for system updates, but nothing like that is happening.
I believe that the Process which is consuming my memory and cpu should be the root cause for this.
Also I checked for kernel log and found "Selinux block-on some files". May be selinux might be the thief?
Attaching logs and proofs for you review.
dmesg.log
SELinux: initialized (dev sdf12, type ext3), uses xattr
type=1400 audit(1437886427.155:3): avc: denied { read } for pid=999 comm="pam_console_app" name="fstab" dev=sda2 ino=63637 scontext=system_u:system_r:pam_console_t:s0 tcontext=system_u:object_r:file_t:s0 tclass=file
type=1400 audit(1437886427.155:4): avc: denied { getattr } for pid=999 comm="pam_console_app" path="/etc/fstab" dev=sda2 ino=63637 scontext=system_u:system_r:pam_console_t:s0 tcontext=system_u:object_r:file_t:s0 tclass=file
Adding 19570584k swap on /swapmemory/swapfile. Priority:-1 extents:4931 across:19938584k
syslog
Jul 30 23:30:16 dhclient: DHCPREQUEST on eth0 to 10.0.0.1 port 67
Jul 30 23:30:16 dhclient: DHCPACK from 10.0.0.1
Jul 30 23:30:16 dhclient: bound to 10.0.0.151 -- renewal in 1432 seconds.
Jul 30 23:30:16 setroubleshoot:
SELinux is preventing access to files with the label, file_t. For complete SELinux messages. run sealert -l 0f0b393a-1629-4976-9d51-e76cc937163a
Jul 30 23:30:16 setroubleshoot: SELinux is preventing access to files with the label, file_t. For complete SELinux messages. run sealert -l 13e20a79-598c-4c3e-949d-00acf7a46ce0
Jul 30 23:30:24 snmpd[1676]: Connection from UDP: [10.0.0.226]:50424
(snmpd confirmation)
Jul 30 23:30:24 snmpd[1676]: Received SNMP packet(s) from UDP: [10.0.0.226]:50424
Jul 30 23:30:24 snmpd[1676]: Connection from UDP: [10.0.0.226]:44803
Audit.log
type=AVC msg=audit(1438314848.048:6131):
avc: denied { read } for pid=31517 comm="hostname" name="config" dev=sda2 ino=62733 scontext=system_u:system_r:hostname_t:s0 tcontext=system_u:object_r:file_t:s0 tclass=file
type=AVC msg=audit(1438314848.048:6132): avc: denied { getattr } for pid=31517 comm="hostname" path="/etc/selinux/config" dev=sda2 ino=62733 scontext=system_u:system_r:hostname_t:s0 tcontext=system_u:object_r:file_t:s0 tclass=file
ps.output
Thu Jul 30 23:00:01 EDT 2015
PID %CPU %MEM STAT COMMAND
3076 0.1 0.1 Ss ora_rsm0_
3012 0.0 4.7 Ss ora_dbw0_
3014 0.0 4.6 Ss ora_dbw1_
3008 0.0 0.1 Ss ora_dia0_
30626 0.0 0.0 Ss sshd: root@pts/0
112 0.0 0.0 S< [kswapd0]
3093 0.0 4.5 Ss ora_pr01_
1676 0.0 0.0 Sl /usr/sbin/snmpd -Lsd -Lf /dev/null -p /var/run/snmpd.pid -a
3095 0.0 4.5 Ss ora_pr02_