[OmniOS-discuss] device probe related command timeouts

Joshua M. Clulow josh at sysmgr.org
Wed Jan 4 20:52:07 UTC 2017


On 4 January 2017 at 12:29, John Barfield <john.barfield at bissinc.com> wrote:
> I’ve got a SAN that seems to be timing out on any hardware probing commands such as “format” or “diskinfo” although prtconf seems to work.
>
> Does anyone happen to have a dtrace one liner or maybe kstat command I can run to see why/what they’re hanging on?

I would start by running "pstack" with the pid of one of the stuck
processes.  That will give you the part of the user program which is
stuck.  Then, I would get the in-kernel state of the stuck threads;
e.g., looking at my bash process:

    asgard # echo $$
    45435
    asgard # ps -fp 45435
         UID   PID  PPID   C    STIME TTY         TIME CMD
        root 45435 45433   0 20:47:17 pts/3       0:00 -bash

    asgard # mdb -k
    Loading modules: [ unix genunix specfs ... ]
    > 0t45435::pid2proc | ::ps -f
    S    PID   PPID   PGID    SID    UID      FLAGS             ADDR NAME
    R  45435  45433  45435  45435      0 0x4a014000 ffffff1b14d33048 -bash
    > 0t45435::pid2proc | ::walk thread | ::findstack -v
    stack pointer for thread ffffff03f776c080: ffffff0011b57c10
    [ ffffff0011b57c10 _resume_from_idle+0x112() ]
      ffffff0011b57c40 swtch+0x141()
      ffffff0011b57cd0 cv_wait_sig_swap_core+0x1b9(ffffff1b14d33108, ...)
      ffffff0011b57cf0 cv_wait_sig_swap+0x17(ffffff1b14d33108, ...)
      ffffff0011b57da0 waitid+0x315(7, 0, ffffff0011b57e30, f)
      ffffff0011b57eb0 waitsys32+0x36(7, 0, 8047750, f)
      ffffff0011b57f10 sys_syscall32+0x123()


That might tell us where in the storage subsystem you're getting stuck.


Cheers.

-- 
Joshua M. Clulow
UNIX Admin/Developer
http://blog.sysmgr.org


More information about the OmniOS-discuss mailing list