[OmniOS-discuss] Chasing down scsi-related warnings

Stephan Budach stephan.budach at JVM.DE
Sun Jul 3 14:33:24 UTC 2016


Hi all,

I am having trouble chasing down some network or drive-related errors on 
one of my OmniOS r018 boxes. It started by me noticing these errors in 
the syslog on one of my RSF-1 nodes. These are just a few, but I found 
almost every drive/LUN of that target node mentioned in the syslogd on 
the RSF-1 node:

Jul  3 15:51:01 zfsha01colt scsi: [ID 107833 kern.warning] WARNING: 
/scsi_vhci/disk at g600144f0564d504f4f4c3033534c3034 (sd4):
Jul  3 15:51:01 zfsha01colt     incomplete write- retrying
Jul  3 15:51:29 zfsha01colt scsi: [ID 107833 kern.warning] WARNING: 
/scsi_vhci/disk at g600144f0564d504f4f4c3033534c3035 (sd5):
Jul  3 15:51:29 zfsha01colt     incomplete write- retrying
Jul  3 15:55:25 zfsha01colt scsi: [ID 107833 kern.warning] WARNING: 
/scsi_vhci/disk at g600144f0564d504f4f4c3033534c3039 (sd6):
Jul  3 15:55:25 zfsha01colt     incomplete write- retrying
Jul  3 16:06:43 zfsha01colt scsi: [ID 107833 kern.warning] WARNING: 
/scsi_vhci/disk at g600144f0564d504f4f4c3033534c3135 (sd43):
Jul  3 16:06:43 zfsha01colt     incomplete write- retrying

Also, iostat -exM is showing HW errors for those LUNs, although I can't 
confirm that the actual drives are at fault on the iSCSI target, which 
is provided by another OmniOS box.

I then failed the zpools over from that target to the second HA node and 
the errors went along with it, so I am assuming that these errors are 
either network related to the storage node or maybe even 
drive/controller related to the storage node. However, I can't seem to 
pin point the problem. As these are only warnings, there is no visisble 
sign about any issue on the storage node, but nonetheless I'd like to 
know, what the underlying issue is.

Any ideas, anyone?

Thanks,
Stephan


More information about the OmniOS-discuss mailing list