Web lists-archives.com

Unable to get the temp for one disk in Stretch




Hi all!

I recently updated my firewall from Jessie to Stretch and I realized
that the check_lm_sensors plugin was apparently not working because it
was not possible to get the temperature of one of the disks:

# /usr/lib/nagios/plugins/check_lm_sensors --list
LM_SENSORS UNKNOWN - Error while executing /usr/sbin/hddtemp -n /dev/sdb

These are two disks which form several MD arrays.

# /usr/sbin/hddtemp -n /dev/sda
44

# /usr/sbin/hddtemp -n /dev/sdb
WARNING: Drive /dev/sdb doesn't seem to have a temperature sensor.
WARNING: This doesn't mean it hasn't got one.
WARNING: If you are sure it has one, please contact me (hddtemp@xxxxxxxx).
WARNING: See --help, --debug and --drivebase options.
/dev/sdb: WDC WD5000AAKX-60U6AA0:  no sensor

In fact, not even smartctl is able to see the temperature:

# smartctl -a /dev/sda | grep Temperature_Celsius
194 Temperature_Celsius     0x0022   099   090   000    Old_age   Always
      -       44

# smartctl -a /dev/sdb | grep Temperature_Celsius
#

It's weird because in Jessie I do not remember having this problem. I
can even find smartd entries in the syslog before doing the update:

# grep smart syslog
Jan  7 06:52:27 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 60 to 61
Jan  7 10:52:27 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 61 to 60
Jan  7 11:22:27 alderamin smartd[438]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 102 to 101
Jan  7 12:22:28 alderamin smartd[438]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 101 to 100
Jan  7 12:22:28 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 60 to 59
Jan  7 13:52:27 alderamin smartd[438]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 100 to 99
Jan  7 14:22:27 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 59 to 58

So I'm not sure where the problem may be. If it is in smartd (and
hddtemp is based on smartd) or maybe in some kernel module.

Any thoughts will be greatly appreciated.

Thanks in advance.


Kind regards,
Daniel

Attachment: signature.asc
Description: OpenPGP digital signature