Web lists-archives.com

Re: Unable to get the temp for one disk in Stretch




On Sun, Jan 07, 2018 at 03:08:48PM -0300, Daniel Bareiro wrote:
Hi all!

I recently updated my firewall from Jessie to Stretch and I realized
that the check_lm_sensors plugin was apparently not working because it
was not possible to get the temperature of one of the disks:

# /usr/lib/nagios/plugins/check_lm_sensors --list
LM_SENSORS UNKNOWN - Error while executing /usr/sbin/hddtemp -n /dev/sdb

These are two disks which form several MD arrays.

# /usr/sbin/hddtemp -n /dev/sda
44

# /usr/sbin/hddtemp -n /dev/sdb
WARNING: Drive /dev/sdb doesn't seem to have a temperature sensor.
WARNING: This doesn't mean it hasn't got one.
WARNING: If you are sure it has one, please contact me (hddtemp@xxxxxxxx).
WARNING: See --help, --debug and --drivebase options.
/dev/sdb: WDC WD5000AAKX-60U6AA0:  no sensor

It *might* be that the drive database doesn't recognise the drive (it'd be odd for the database to forget about a drive, but accidents happen).

Try running "sudo /usr/sbin/update-smart-drivedb" and check again.


In fact, not even smartctl is able to see the temperature:

# smartctl -a /dev/sda | grep Temperature_Celsius
194 Temperature_Celsius     0x0022   099   090   000    Old_age   Always
     -       44

# smartctl -a /dev/sdb | grep Temperature_Celsius
#

It's weird because in Jessie I do not remember having this problem. I
can even find smartd entries in the syslog before doing the update:

# grep smart syslog
Jan  7 06:52:27 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 60 to 61
Jan  7 10:52:27 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 61 to 60
Jan  7 11:22:27 alderamin smartd[438]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 102 to 101
Jan  7 12:22:28 alderamin smartd[438]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 101 to 100
Jan  7 12:22:28 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 60 to 59
Jan  7 13:52:27 alderamin smartd[438]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 100 to 99
Jan  7 14:22:27 alderamin smartd[438]: Device: /dev/sdb [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 59 to 58

So I'm not sure where the problem may be. If it is in smartd (and
hddtemp is based on smartd) or maybe in some kernel module.

Any thoughts will be greatly appreciated.

Thanks in advance.


Kind regards,
Daniel





--
For more information, please reread.

Attachment: signature.asc
Description: PGP signature