This page includes notes and references for setting up a Dell PowerEdge server to be monitored by CSCF's nagios system via OpenManage SNMP.
sudo apt-get install snmpd
sudo wget https://cs.uwaterloo.ca/cscf/infrastructure/setups/services/snmp/etc/snmp/snmpd.conf -O /etc/snmp/snmpd.conf sudo wget https://cs.uwaterloo.ca/cscf/infrastructure/setups/services/snmp/etc/default/snmpd -O /etc/default/snmpd
sudo echo "deb http://linux.dell.com/repo/community/ubuntu `lsb_release -s -c` openmanage" | sudo tee -a /etc/apt/sources.list.d/linux.dell.com.sources.list
sudo gpg --keyserver pool.sks-keyservers.net --recv-key 1285491434D8786F sudo gpg -a --export 1285491434D8786F | sudo apt-key add -
sudo apt-get update
sudo apt-get install srvadmin-base
sudo service dataeng start
sudo echo "# Add DELL OMSA capability to SNMP" >>/etc/snmp/snmpd.conf && echo "smuxpeer .1.3.6.1.4.1.674.10892.1" >>/etc/snmp/snmpd.conf
/etc/default/snmpd
. Replace SNMPDOPTS='-Lsd -Lf /dev/null -u snmp -g snmp -I -smux -p /var/run/snmpd.pid'with
SNMPDOPTS='-Lsd -Lf /dev/null -u snmp -g snmp -p /var/run/snmpd.pid'
sudo service snmpd restart
esxcli software vib install –d /tmp/OM-SrvAdmin-Dell-Web-7.1.0-5304.VIB-ESX50i_A00.zip
The check_openmanage plugin is installed on nagios.cscf and can be located at /usr/lib/nagios/plugins/check_openmanage
. To have a host monitored for Dell OpenManage, add the "Dell PowerEdge OpenManage" service in E&I.
To check your setup configuration, you can run the following on nagios.cscf (replacing HOSTNAME with the machine's hostname or IP address):
/usr/lib/nagios/plugins/check_openmanage -H HOSTNAME -C UofW -dwhich should result in something like this:
System: PowerEdge R415 II OMSA version: 7.4.0-1 ServiceTag: -redacted- Plugin version: 3.7.12 BIOS/date: 2.2.1 09/16/2013 Checking mode: SNMPv2c UDP/IPv4 ----------------------------------------------------------------------------- Storage Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+----------+-------------------------------------------------------- UNKNOWN | | Storage Error! No controllers found ----------------------------------------------------------------------------- Chassis Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+------+------------------------------------------------------------ OK | 0 | Memory module 0 [DIMM_A1, 16384 MB] is Ok OK | 1 | Memory module 1 [DIMM_A2, 16384 MB] is Ok OK | 2 | Memory module 2 [DIMM_B1, 16384 MB] is Ok OK | 3 | Memory module 3 [DIMM_B2, 16384 MB] is Ok OK | 0 | Chassis fan 0 [System Board FAN MOD 1A RPM] reading: 4440 RPM OK | 1 | Chassis fan 1 [System Board FAN MOD 1B RPM] reading: 3120 RPM OK | 2 | Chassis fan 2 [System Board FAN MOD 2A RPM] reading: 4440 RPM OK | 3 | Chassis fan 3 [System Board FAN MOD 2B RPM] reading: 3120 RPM OK | 4 | Chassis fan 4 [System Board FAN MOD 3A RPM] reading: 4440 RPM OK | 5 | Chassis fan 5 [System Board FAN MOD 3B RPM] reading: 3120 RPM OK | 6 | Chassis fan 6 [System Board FAN MOD 4A RPM] reading: 4440 RPM OK | 7 | Chassis fan 7 [System Board FAN MOD 4B RPM] reading: 3240 RPM OK | 8 | Chassis fan 8 [System Board FAN MOD 5A RPM] reading: 4560 RPM OK | 9 | Chassis fan 9 [System Board FAN MOD 5B RPM] reading: 3120 RPM OK | 10 | Chassis fan 10 [System Board FAN MOD 6A RPM] reading: 4560 RPM OK | 11 | Chassis fan 11 [System Board FAN MOD 6B RPM] reading: 3120 RPM OK | 0 | Power Supply 0 [AC]: Presence detected OK | 1 | Power Supply 1 [AC]: Presence detected OK | 0 | Temperature Probe 0 [Ambient Temp] reads 22 C (min=8/3, max=42/47) OK | 0 | Processor 0 [AMD Opteron 4376 HE] is Present OK | 1 | Processor 1 [AMD Opteron 4376 HE] is Present OK | 0 | Voltage sensor 0 [CPU1 VCORE] is Good OK | 1 | Voltage sensor 1 [CPU2 VCORE] is Good OK | 2 | Voltage sensor 2 [CPU1 2.5V VDDA PG] is Good OK | 3 | Voltage sensor 3 [CPU1 1.2V VDDR PG] is Good OK | 4 | Voltage sensor 4 [CPU2 2.5V VDDA PG] is Good OK | 5 | Voltage sensor 5 [CPU2 1.2V VDDR PG] is Good OK | 6 | Voltage sensor 6 [System Board 1.2V LOM PG] is Good OK | 7 | Voltage sensor 7 [System Board 1.2V AUX PG] is Good OK | 8 | Voltage sensor 8 [System Board PSU PG] is Good OK | 9 | Voltage sensor 9 [System Board 1.2V VM PG] is Good OK | 10 | Voltage sensor 10 [System Board 3.3V AUX PG] is Good OK | 11 | Voltage sensor 11 [System Board 1.8V AUX PG] is Good OK | 12 | Voltage sensor 12 [System Board 1.8V IOB PG] is Good OK | 13 | Voltage sensor 13 [System Board 1.1V IOB PG] is Good OK | 14 | Voltage sensor 14 [1.5V MEM PG] is Good OK | 15 | Voltage sensor 15 [1.5V MEM PG] is Good OK | 16 | Voltage sensor 16 [PS 1 Voltage 1] reads 210 V OK | 17 | Voltage sensor 17 [PS 2 Voltage 2] reads 0 V OK | 0 | Battery probe 0 [System Board CMOS Battery] is Presence Detected OK | 0 | Amperage probe 0 [PS 1 Current 1] reads 0.2 A OK | 1 | Amperage probe 1 [PS 2 Current 2] reads 0.002 A OK | 2 | Amperage probe 2 [System Board System Level] reads 80 W OK | 0 | Chassis intrusion 0 detection: Ok (Not Breached) OK | 0 | SD Card 0 [VFlash Media] is Absent ----------------------------------------------------------------------------- Other messages ============================================================================= STATE | MESSAGE TEXT ---------+------------------------------------------------------------------- OK | ESM log health is Ok (less than 80% full) OK | Chassis Service Tag is sane