| Summary: | net-mgmt/nagios-plugins check_procs wrong CPU matching [idle] thread with --metric=CPU | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Ports & Packages | Reporter: | Volodymyr Pushkar <vladimir.pushkar> | ||||
| Component: | Individual Port(s) | Assignee: | Jochen Neumeister <joneum> | ||||
| Status: | New --- | ||||||
| Severity: | Affects Some People | CC: | mike.walker | ||||
| Priority: | --- | ||||||
| Version: | Latest | ||||||
| Hardware: | Any | ||||||
| OS: | Any | ||||||
| Attachments: |
|
||||||
|
Description
Volodymyr Pushkar
2021-03-13 08:22:11 UTC
Sorry for taking this long to get to this patch. I am sorry but I don't understand what the patch actually does. The issue is that without the included patch, check_procs will emit WARNING and CRITICAL for the system "idle" process.
For example on one of my servers, "top" lists the idle process as using 354% of the CPU:
# top -SCb | egrep '(COMMAND|idle)$'
CPU: 3.9% user, 0.2% nice, 1.9% system, 0.3% interrupt, 93.7% idle
PID USERNAME THR PRI NICE SIZE RES STATE C TIME CPU COMMAND
11 root 4 155 ki31 0B 64K CPU0 0 1797.0 354.88% idle
And because of this, the "idle" system process will be flagged as having too much CPU time by "check_procs" if "--metric=CPU" is passed, like this:
# /usr/local/libexec/nagios/check_procs -v -w 100 -c 105 --metric=CPU
CPU CRITICAL: 1 crit, 0 warn out of 110 processes [idle] | procs=110;;;0; procs_warn=0;;;0; procs_crit=1;;;0; procpcpu=410.899994;
|