jdalrymple wrote:1) Your support experience will be improved if you adhere to the advice offered twice now by staff. I'll repeat it again in case you've missed it - if you have anything to add to your diagnostic information please edit your last post, do not create a new one. If you create a new one your age counter resets and it looks to us like it's a "new" issue.
Dimitri wrote:
- Really unsure how to keep editing same first post if I have quote new ones.
2) Am I correct in assuming that you're running the Linux Server wizard against an ESXi host and that's where the problem lies?
Dimitri wrote:
- NOT CORRECT AT ALL. I am running SNMP Windows wizard against mostly Windows 2008 R8 servers.
Your product (we were already approving purchase of) developed persisting errors after v5 upgrade: 1 x macBuffer + 1 x SNMP walk via Wizard.
I can't seem to get Nagios support team to this one still.
3) Regarding latency, how much latency are we talking about. SNMP can handle some, but it's more devastated by lossy connections than latency as it's a UDP (stateless) protocol. Sometimes it becomes necessary to wrap SNMP checks into SSH or NRPE or split off a gearman worker so that the transport protocol can be relied upon.
Dimitri wrote:
Not entirely sure about latency thing because CLI-based SNMP walk works, while Apache-based (Wizard) does not. Would also work until certain number of hosts reached. Really want to know more about "default" SNMP configurations your product has and any changes that happened in ver. 5.
One of the reasons we've chosen Nagios XI is because in old version its SNMP monitoring worked just fine (being less intrusive and more practical for many applications in our case, as we are talking hundreds of them)
4) Can you be more specific about what broke as a result of the upgrade? What was working before the upgrade, and in what way did it stop working after the upgrade? The way I read it only the wizard broke, not existing hosts/services. I may be mistaken in my interpretation though.
Dimitri wrote:
I mentioned in every other post of mine from the last week, so again:
Before an upgrade, SNMP-based monitoring against Windows 2008 R8 servers WORKED (many test installs, various same or different nodes).
After an upgrade I immediately had MaxBuffer errors for all hosts under monitoring (about 20-30 in the XI development trial), when it came to monitoring services/processes. I re-run upgrade manually (and done so on 4 XI test/dev installs in total), deleted all services/hosts and attempted to re-create monitoring for them via SNMP for Windows server Wizard. This is when I discovered that the Wizard would either fail right away on the second screen or work for the first 5-7 hosts and fail afterwards, or fail while still showing drives as if SNMP walk have partially worked. Faulty results were same on 3-4 test/dev installs (situated in different LANs in different geographic locations). In some cases elevating max_msg_size from 5000 to 10000 would allow me to run the Wizard few times before it fails, while in other cases it would fail right away anyways. Same fix would work when trying to deal with max_Buffer error, produced by SNMP-based service checks (as well as the error would return moment I switched max_msg_size back to 5000). 5.2 Upgrade did not help, but suddenly noticed that running this Wizard against same LAN hosts works w/o changing any settings at least for some time.