[Nagios-devel] BUG: nagios3 not closing filehandles for checkresult
Posted: Tue Apr 01, 2008 4:44 am
Hi list
we are testing nagios3 and since the release of the final version we
encountered a massive problem. Once a day nagios stops working
correctly. Log file ist full of:
-----
[$timesatmp] Warning: The check of service '$SERVICE' on host '$HOST'
could not be performed due to a fork() error. The check will be
rescheduled.
-----
After a restart of nagios everything works fine again. I have checked
the checkresult-spool-dir and found tons of _old_ files. When the
problem occurs nagios has open 1024 FHs most of them (> 1000) for
checkresult files. Of course 1024 is the open files limit for my nagios
user.
This problem occurs every day and it started exactly with the install of
3.0 final. We where running 3.0rc3 before and did not have any problems.
When i look at the process from time to time i do not see a list of FHs
building up slowly so i assume that there is some "event" which is
trigging this malfunction.
Chris
financial.com AG
Munich head office/Hauptsitz M=C3=BCnchen: Maria-Probst-Str. 19 | 80939 M=
=C3=BCnchen | Germany
Frankfurt branch office/Niederlassung Frankfurt: Messeturm | Friedrich-Eber=
t-Anlage 49 | 60327 Frankfurt | Germany
Management board/Vorstand: Dr. Steffen Boehnert (CEO/Vorsitzender) | Dr. Al=
exis Eisenhofer | Dr. Yann Samson | Matthias Wiederwach
Supervisory board/Aufsichtsrat: Dr. Dr. Ernst zur Linden (chairman/Vorsitze=
nder)
Register court/Handelsregister: Munich =E2=80=93 HRB 128 972 | Sales tax ID=
number/St.Nr.: DE205 370 553
This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]
we are testing nagios3 and since the release of the final version we
encountered a massive problem. Once a day nagios stops working
correctly. Log file ist full of:
-----
[$timesatmp] Warning: The check of service '$SERVICE' on host '$HOST'
could not be performed due to a fork() error. The check will be
rescheduled.
-----
After a restart of nagios everything works fine again. I have checked
the checkresult-spool-dir and found tons of _old_ files. When the
problem occurs nagios has open 1024 FHs most of them (> 1000) for
checkresult files. Of course 1024 is the open files limit for my nagios
user.
This problem occurs every day and it started exactly with the install of
3.0 final. We where running 3.0rc3 before and did not have any problems.
When i look at the process from time to time i do not see a list of FHs
building up slowly so i assume that there is some "event" which is
trigging this malfunction.
Chris
financial.com AG
Munich head office/Hauptsitz M=C3=BCnchen: Maria-Probst-Str. 19 | 80939 M=
=C3=BCnchen | Germany
Frankfurt branch office/Niederlassung Frankfurt: Messeturm | Friedrich-Eber=
t-Anlage 49 | 60327 Frankfurt | Germany
Management board/Vorstand: Dr. Steffen Boehnert (CEO/Vorsitzender) | Dr. Al=
exis Eisenhofer | Dr. Yann Samson | Matthias Wiederwach
Supervisory board/Aufsichtsrat: Dr. Dr. Ernst zur Linden (chairman/Vorsitze=
nder)
Register court/Handelsregister: Munich =E2=80=93 HRB 128 972 | Sales tax ID=
number/St.Nr.: DE205 370 553
This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]