<font face="Default Sans Serif,Verdana,Arial,Helvetica,sans-serif" size="2"><div>Hi, Kevin!</div><div><br></div><div>W<span style="font-family: arial, helvetica, sans-serif;">hat if you replace the drive with one of the hotspares? I mean, </span><span style="font-family: arial, helvetica, sans-serif;">let the hotspare stay at its place, and configure it for replacing the problematic drive. Then you will find out wether the backplane has a bad port or not. Allways start to try to narrow it down.</span></div><div><span style="font-family: arial, helvetica, sans-serif;"><br></span></div><div><span style="font-family: arial, helvetica, sans-serif;">Rgrds Johan</span></div><div><br></div><div><br></div><br><font color="#990099">-----"OmniOS-discuss" <omnios-discuss-bounces@lists.omniti.com> skrev: -----</font><div class="iNotesHistory" style="padding-left:5px;"><div style="padding-right:0px;padding-left:5px;border-left:solid black 2px;">Till: omnios-discuss@lists.omniti.com<br>Från: Kevin Swab <kevin.swab@colostate.edu><br>Sänt av: "OmniOS-discuss" <omnios-discuss-bounces@lists.omniti.com><br>Datum: 2013.10.30 18:38<br>Ärende: [OmniOS-discuss] multipath problem when replacing a failed SAS drive<br><br><div><font face="Courier New,Courier,monospace" size="3">Hello,<br><br>I'm running OmniOS r151006p on the following system:<br><br>- Supermicro X8DT6 board, Xeon E5606 CPU, 48GB ram<br>- Supermicro SC847 chassis, 36 drive bays, SAS expanders, LSI 9211-8i<br>controller<br>- 34 x Toshiba 3T SAS drives MG03SCA300 in one pool w/ 16 mirrored sets<br>+ 2 hot spares<br><br>'mpathadm list lu' showed all drives as having two paths to the controller.<br><br>Yesterday, one of the drives failed and was replaced. The new drive is<br>only showing one path in mpathadm, and errors have started showing up<br>periodically in /var/adm/messages:<br><br><br><br># mpathadm list lu /dev/rdsk/c1t5000039478CA7150d0<br>mpath-support: libmpscsi_vhci.so<br> /dev/rdsk/c1t5000039478CA7150d0s2<br> Total Path Count: 1<br> Operational Path Count: 1<br><br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.<br>Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.<br>Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.<br>Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event_sync: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br>Oct 30 09:30:22 hagler scsi: [ID 365881 kern.info]<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler Log info 0x31120101 received for target 89.<br>Oct 30 09:30:22 hagler scsi_status=0x0, ioc_status=0x804b, scsi_state=0xc<br>Oct 30 09:30:22 hagler scsi: [ID 243001 kern.warning] WARNING:<br>/pci@0,0/pci8086,3410@9/pci1000,3020@0 (mpt_sas0):<br>Oct 30 09:30:22 hagler mptsas_handle_event: IOCStatus=0x8000,<br>IOCLogInfo=0x31120101<br><br><br><br>The error messages refer to target 89, which I can confirm corresponds<br>to the missing path for my replacement drive using "lsiutil":<br><br><br><br># lsiutil -p 1 16<br><br>LSI Logic MPT Configuration Utility, Version 1.63, June 4, 2009<br><br>1 MPT Port found<br><br> Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC<br> 1. mpt_sas0 LSI Logic SAS2008 03 200 0d000100 0<br><br>SAS2008's links are 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G<br><br> B___T SASAddress PhyNum Handle Parent Type<br>[ ... cut ... ]<br> 0 89 5000039478ca7152 17 0059 0032 SAS Target<br> 0 90 5000039478ca7153 17 005a 000a SAS Target<br>[ ... cut ... ]<br><br><br><br>When I ask "lsiutil" to rescan the bus, I see the following error when<br>it gets to target 89:<br><br><br><br># lsiutil -p 1 8<br><br>LSI Logic MPT Configuration Utility, Version 1.63, June 4, 2009<br><br>1 MPT Port found<br><br> Port Name Chip Vendor/Type/Rev MPT Rev Firmware Rev IOC<br> 1. mpt_sas0 LSI Logic SAS2008 03 200 0d000100 0<br><br>SAS2008's links are 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G, 6.0 G<br><br> B___T___L Type Vendor Product Rev<br>[ ... cut ... ]<br>ScsiIo to Bus 0 Target 89 failed, IOCStatus = 004b (IOC Terminated)<br> 0 90 0 Disk TOSHIBA MG03SCA300 0108 5000039478ca7153<br> 17<br>[ ... cut ... ]<br><br><br><br>This problem has happened to me once before on a similar system. At<br>that time, I tried reseating the drive, and tried several different<br>replacement drives, all had the same issue. I even tried rebooting the<br>system and that didn't help.<br><br>Does anyone know how I can clear this issue up? I'd be happy to provide<br>any additional information that might be helpful,<br><br>TIA,<br>Kevin<br><br><br><br>-- <br>-------------------------------------------------------------------<br>Kevin Swab UNIX Systems Administrator<br>ACNS Colorado State University<br>Phone: (970)491-6572 Email: Kevin.Swab@ColoState.EDU<br>GPG Fingerprint: 7026 3F66 A970 67BD 6F17 8EB8 8A7D 142F 2392 791C<br>_______________________________________________<br>OmniOS-discuss mailing list<br>OmniOS-discuss@lists.omniti.com<br><a href="http://lists.omniti.com/mailman/listinfo/omnios-discuss">http://lists.omniti.com/mailman/listinfo/omnios-discuss</a><br><br></font></div></omnios-discuss-bounces@lists.omniti.com></kevin.swab@colostate.edu></div></div><div></div></font>