HP Health SNMP Configuration
This page includes notes and references for setting up an HP Proliant server to be monitored by CSCF's nagios system via hp-health SNMP.
Ubuntu Configuration
- Install SNMP Daemon
sudo apt-get install snmpd
- Install the CSCF SNMP configuration files
sudo wget https://cs.uwaterloo.ca/cscf/infrastructure/setups/services/snmp/etc/snmp/snmpd.conf -O /etc/snmp/snmpd.conf
sudo wget https://cs.uwaterloo.ca/cscf/infrastructure/setups/services/snmp/etc/default/snmpd -O /etc/default/snmpd
- Add HP's repository
sudo echo "deb http://downloads.linux.hpe.com/SDR/repo/mcp `lsb_release -s -c`/current non-free" > /etc/apt/sources.list.d/hp.list
- Add HP's GPG public keys
sudo curl http://downloads.linux.hpe.com/SDR/hpPublicKey1024.pub | apt-key add -
sudo curl http://downloads.linux.hpe.com/SDR/hpPublicKey2048.pub | apt-key add -
sudo curl http://downloads.linux.hpe.com/SDR/hpPublicKey2048_key1.pub | apt-key add -
- Make apt aware of our changes
sudo apt-get update
- Install hp-health and SNMP agents
sudo apt-get install hp-health hp-snmp-agents
- Add HP Health capability to SNMP:
echo "# HP Health Monitoring" >> /etc/snmp/snmpd.conf && echo "dlmod cmaX /usr/lib/libcmaX64.so" >> /etc/snmp/snmpd.conf
- Restart SNMP
sudo service snmpd restart
Nagios Configuration
The check_hpasm plugin is installed on nagios.cscf and can be located at
/usr/lib/nagios/plugins/check_hpasm
. To have a host monitored for HP Health, add the "HP Health" service in E&I.
To check your setup configuration, you can run the following on nagios.cscf (replacing HOSTNAME with the machine's hostname or IP address):
/usr/lib/nagios/plugins/check_hpasm --hostname HOSTNAME --community UofW --protocol 1
which should result in something like this:
OK - System: 'proliant dl585 g7', S/N: '-redacted-', ROM: '0x31322f31302f32301818d2b8e57f2c2046616d696c7920413136', hardware working fine, da: 2 logical drives, 6 physical drives, cpu_0=ok cpu_1=ok cpu_2=ok cpu_3=ok ps_1=ok ps_2=ok ps_3=ok ps_4=ok fan_1=50% fan_2=50% fan_3=50% fan_4=50% temp_1=18 temp_2=11 temp_3=11 temp_4=12 temp_5=9 temp_6=35 temp_7=25 temp_8=24 temp_9=25 temp_10=25 temp_11=25 temp_12=25 temp_13=25 temp_14=25 temp_15=25 temp_16=27 temp_17=25 temp_18=25 temp_19=24 temp_20=24 temp_21=24 temp_22=26 temp_23=29 temp_24=25 temp_25=30 temp_26=32 temp_27=37 temp_28=41 temp_29=37 temp_30=27 temp_31=28 temp_32=25 | pc_1=70;1200;1200 pc_2=50;1200;1200 pc_3=40;1200;1200 pc_4=40;1200;1200 fan_1=50% fan_2=50% fan_3=50% fan_4=50% temp_1_ambient=18;41;41 temp_2_cpu=11;75;75 temp_3_cpu=11;75;75 temp_4_cpu=12;75;75 temp_5_cpu=9;75;75 temp_6_storage=35;95;95 temp_7_memory=25;95;95 temp_8_memory=24;95;95 temp_9_memory=25;95;95 temp_10_memory=25;95;95 temp_11_memory=25;95;95 temp_12_memory=25;95;95 temp_13_memory=25;95;95 temp_14_memory=25;95;95 temp_15_system=25;126;126 temp_16_system=27;126;126 temp_17_system=25;126;126 temp_18_system=25;126;126 temp_19_system=24;126;126 temp_20_system=24;126;126 temp_21_system=24;126;126 temp_22_system=26;126;126 temp_23_system=29;126;126 temp_24_system=25;60;60 temp_25_system=30;126;126 temp_26_system=32;70;70 temp_27_system=37;126;126 temp_28_ioBoard=41;92;92 temp_29_ioBoard=37;92;92 temp_30_ioBoard=27;70;70 temp_31_ioBoard=28;70;70 temp_32_system=25;126;126