<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">I’m not sure I’ve followed properly.  You have *two* interfaces.  You are not trying to provision these in an aggr are you? As far as I’m aware, VMware does not support 802.3ad link aggregations.  (Its possible that you can make it work with ESXi if you give the entire NIC to the guest — but I’m skeptical.)  The problem is that if you try to use link aggregation, some packets (up to half!) will be lost.  TCP and other protocols fare poorly in this situation.<div class=""><br class=""></div><div class="">Its possible I’ve totally misunderstood what you’re trying to do, in which case I apologize.</div><div class=""><br class=""></div><div class="">The idle thing is a red-herring — the cpu is waiting for work to do, probably because packets haven’t arrived (or where dropped by the hypervisor!)  I wouldn’t read too much into that except that your network stack is in trouble.  I’d look a bit more closely at the kstats for tcp — I suspect you’ll see retransmits or out of order values that are unusually high — if so this may help validate my theory above.</div><div class=""><br class=""></div><div class=""><span class="Apple-tab-span" style="white-space:pre">     </span>- Garrett<br class=""><div class=""><div class=""><div class=""><br class=""><div><blockquote type="cite" class=""><div class="">On Mar 1, 2015, at 9:03 PM, W Verb via illumos-developer <<a href="mailto:developer@lists.illumos.org" class="">developer@lists.illumos.org</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class=""><p class="">Hello all,</p><div class=""> <br class="webkit-block-placeholder"></div><p class="">Well, I no longer blame the ixgbe driver for the problems I'm seeing.</p><p class=""><br class=""></p><p class="">I tried Joerg's updated driver, which didn't improve the issue. So I went back to the drawing board and rebuilt the
server from scratch.</p><p class="">What I noted is that if I have only a single 1-gig
physical interface active on the ESXi host, everything works as expected. As
soon as I enable two interfaces, I start seeing the performance problems I've
described.</p><p class="">Response pauses from the server that I see in TCPdumps are still leading me to believe the problem is
delay on the server side, so I ran a series of kernel dtraces and produced some
flamegraphs.</p><p class=""><br class=""></p><p class=""><span style="style" class=""></span>This was taken
during a read operation with two active 10G interfaces on the server, with a single target being shared by two tpgs- one tpg for each 10G physical port. The host device has two 1G ports enabled, with VLANs separating the active ports into 10G/1G pairs. ESXi is set to multipath using both VLANS with a round-robin IO interval of 1.<br class=""></p><p class=""><a href="https://drive.google.com/file/d/0BwyUMjibonYQd3ZYOGh4d2pteGs/view?usp=sharing" class="">https://drive.google.com/file/d/0BwyUMjibonYQd3ZYOGh4d2pteGs/view?usp=sharing</a></p><p class=""><br class=""><span style="style" class=""></span></p><p class=""><span style="style" class=""></span>This was taken
during a write operation:</p><p class=""><a href="https://drive.google.com/file/d/0BwyUMjibonYQMnBtU1Q2SXM2ams/view?usp=sharing" class="">https://drive.google.com/file/d/0BwyUMjibonYQMnBtU1Q2SXM2ams/view?usp=sharing</a></p><p class=""><br class=""></p><p class="">I then rebooted the server and disabled C-State, ACPI
T-State, and general EIST (Turbo boost) functionality in the CPU.</p><p class="">I when I attempted to boot my guest VM, the iSCSI
transfer gradually ground to a halt during the boot loading process, and the guest OS never did complete its boot process.<br class=""></p><p class="">Here is a flamegraph taken while iSCSI is slowly dying:</p><p class=""><a href="https://drive.google.com/file/d/0BwyUMjibonYQM21JeFZPX3dZWTg/view?usp=sharing" class="">https://drive.google.com/file/d/0BwyUMjibonYQM21JeFZPX3dZWTg/view?usp=sharing</a></p><div class=""> <br class="webkit-block-placeholder"></div><p class="">I edited out cpu_idle_adaptive from the dtrace output and regenerated the
slowdown graph:</p><p class=""><a href="https://drive.google.com/file/d/0BwyUMjibonYQbTVwV3NvXzlPS1E/view?usp=sharing" class="">https://drive.google.com/file/d/0BwyUMjibonYQbTVwV3NvXzlPS1E/view?usp=sharing</a></p><div class=""> <br class="webkit-block-placeholder"></div><p class="">I then edited cpu_idle_adaptive out
of the speedy write operation and regenerated that graph:<br class=""></p><p class=""><a href="https://drive.google.com/file/d/0BwyUMjibonYQeWFYM0pCMDZ1X2s/view?usp=sharing" class="">https://drive.google.com/file/d/0BwyUMjibonYQeWFYM0pCMDZ1X2s/view?usp=sharing</a></p><div class=""> <br class="webkit-block-placeholder"></div><p class="">I have zero experience with interpreting flamegraphs, but
the most significant difference I see between the slow
read example and the fast write example is in unix`thread_start --> unix`idle. There's a good chunk of "unix`i86_mwait" in the read example that is not present in the write example at all.<br class=""></p><p class="">Disabling the l2arc cache device didn't make a
difference, and I had to reenable EIST support on the CPU to get my VMs to boot.<br class=""></p><p class="">I am seeing a variety of bug reports going back to 2010 regarding excessive mwait operations, with the suggested solutions usually being to set "cpupm enable poll-mode" in power.conf. That change also had no effect on speed.</p><p class="">-Warren V</p><div class=""> <br class="webkit-block-placeholder"></div><div class=""> <br class="webkit-block-placeholder"></div><div class=""> <br class="webkit-block-placeholder"></div><p class="">-----Original Message-----</p><p class="" style="style">From: Chris Siebenmann [<a href="mailto:cks@cs.toronto.edu" class="">mailto:cks@cs.toronto.edu</a>]</p><p class="">Sent: Monday, February 23, 2015 8:30 AM</p><p class="">To: W Verb</p><p class="">Cc: <a href="mailto:omnios-discuss@lists.omniti.com" class="">omnios-discuss@lists.omniti.com</a>;
<a href="mailto:cks@cs.toronto.edu" class="">cks@cs.toronto.edu</a></p><p class="">Subject: Re: [OmniOS-discuss] The ixgbe driver, Lindsay
Lohan, and the Greek economy</p><div class=""> <br class="webkit-block-placeholder"></div><p class="">> Chris, thanks for your specific details. I'd
appreciate it if you </p><p class="">> could tell me which copper NIC you tried, as well as
to pass on the </p><p class="">> iSCSI tuning parameters.</p><div class=""> <br class="webkit-block-placeholder"></div><p class=""><span style="style" class=""> </span>Our copper NIC
experience is with onboard X540-AT2 ports on SuperMicro hardware (which have
the guaranteed 10-20 msec lock hold) and dual-port 82599EB TN cards (which have
some sort of driver/hardware failure under load that eventually leads to
2-second lock holds). I can't recommend either with the current driver; we had
to revert to 1G networking in order to get stable servers.</p><div class=""> <br class="webkit-block-placeholder"></div><p class=""><span style="style" class=""> </span>The iSCSI
parameter modifications we do, across both initiators and targets, are:</p><div class=""> <br class="webkit-block-placeholder"></div><p class=""><span style="style" class="">      </span>initialr2t<span style="style" class="">        </span>no</p><p class=""><span style="style" class="">      </span>firstburstlength<span style="style" class="">  </span>128k</p><p class=""><span style="style" class="">      </span>maxrecvdataseglen<span style="style" class=""> </span>128k<span style="style" class="">        </span>[only
on Linux backends]</p><p class=""><span style="style" class="">      </span>maxxmitdataseglen<span style="style" class=""> </span>128k<span style="style" class="">        </span>[only
on Linux backends]</p><div class=""> <br class="webkit-block-placeholder"></div><p class="">The OmniOS initiator doesn't need tuning for more than
the first two parameters; on the Linux backends we tune up all four. My
extended thoughts on these tuning parameters and why we touch them can be found</p><p class="">here:</p><div class=""> <br class="webkit-block-placeholder"></div><p class=""><span style="style" class="">   </span><a href="http://utcc.utoronto.ca/~cks/space/blog/tech/UnderstandingiSCSIProtocol" class="">http://utcc.utoronto.ca/~cks/space/blog/tech/UnderstandingiSCSIProtocol</a></p><p class=""><span style="style" class="">   </span><a href="http://utcc.utoronto.ca/~cks/space/blog/tech/LikelyISCSITuning" class="">http://utcc.utoronto.ca/~cks/space/blog/tech/LikelyISCSITuning</a></p><div class=""> <br class="webkit-block-placeholder"></div><p class="">The short version is that these parameters probably only
make a small difference but their overall goal is to do 128KB ZFS reads and
writes in single iSCSI operations (although they will be fragmented at the TCP</p><p class="">layer) and to do iSCSI writes without a back-and-forth
delay between initiator and target (that's 'initialr2t no').</p><div class=""> <br class="webkit-block-placeholder"></div><p class=""><span style="style" class=""> </span>I think basically
everyone should use InitialR2T set to no and in fact that it should be the
software default. These days only unusually limited iSCSI targets should need
it to be otherwise and they can change their setting for it (initiator and
target must both agree to it being 'yes', so either can veto it).</p><div class=""> <br class="webkit-block-placeholder"></div><p class=""><span style="style" class="">      </span>- cks</p><div class=""> <br class="webkit-block-placeholder"></div></div><div class="gmail_extra"><br class=""><div class="gmail_quote">On Mon, Feb 23, 2015 at 8:21 AM, Joerg Goltermann <span dir="ltr" class=""><<a href="mailto:jg@osn.de" target="_blank" class="">jg@osn.de</a>></span> wrote:<br class=""><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br class=""><br class="">
I think your problem is caused by your link properties or your<br class="">
switch settings. In general the standard ixgbe seems to perform<br class="">
well.<br class=""><br class="">
I had trouble after changing the default flow control settings to "bi"<br class="">
and this was my motivation to update the ixgbe driver a long time ago.<br class="">
After I have updated our systems to ixgbe 2.5.8 I never had any<br class="">
problems ....<br class=""><br class="">
Make sure your switch has support for jumbo frames and you use<br class="">
the same mtu on all ports, otherwise the smallest will be used.<br class=""><br class="">
What switch do you use? I can tell you nice horror stories about<br class="">
different vendors....<br class=""><br class="">
 - Joerg<span class=""><br class=""><br class="">
On 23.02.2015 10:31, W Verb wrote:<br class=""></span><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">
Thank you Joerg,<br class=""><br class="">
I've downloaded the package and will try it tomorrow.<br class=""><br class="">
The only thing I can add at this point is that upon review of my<br class="">
testing, I may have performed my "pkg -u" between the initial quad-gig<br class="">
performance test and installing the 10G NIC. So this may be a new<br class="">
problem introduced in the latest updates.<br class=""><br class="">
Those of you who are running 10G and have not upgraded to the latest<br class="">
kernel, etc, might want to do some additional testing before running the<br class="">
update.<br class=""><br class="">
-Warren V<br class=""><br class="">
On Mon, Feb 23, 2015 at 1:15 AM, Joerg Goltermann <<a href="mailto:jg@osn.de" target="_blank" class="">jg@osn.de</a><br class=""></span><span class="">
<mailto:<a href="mailto:jg@osn.de" target="_blank" class="">jg@osn.de</a>>> wrote:<br class=""><br class="">
    Hi,<br class=""><br class="">
    I remember there was a problem with the flow control settings in the<br class="">
    ixgbe<br class="">
    driver, so I updated it a long time ago for our internal servers to<br class="">
    2.5.8.<br class="">
    Last weekend I integrated the latest changes from the FreeBSD driver<br class="">
    to bring<br class="">
    the illumos ixgbe to 2.5.25 but I had no time to test it, so it's<br class="">
    completely<br class="">
    untested!<br class=""><br class=""><br class="">
    If you would like to give the latest driver a try you can fetch the<br class="">
    kernel modules from<br class=""></span>
    <a href="https://cloud.osn.de/index.__php/s/Fb4so9RsNnXA7r9" target="_blank" class="">https://cloud.osn.de/index.__<u class=""></u>php/s/Fb4so9RsNnXA7r9</a><span class=""><br class="">
    <<a href="https://cloud.osn.de/index.php/s/Fb4so9RsNnXA7r9" target="_blank" class="">https://cloud.osn.de/index.<u class=""></u>php/s/Fb4so9RsNnXA7r9</a>><br class=""><br class="">
    Clone your boot environment, place the modules in the new environment<br class="">
    and update the boot-archive of the new BE.<br class=""><br class="">
      - Joerg<br class=""><br class=""><br class=""><br class=""><br class=""><br class="">
    On 23.02.2015 02:54, W Verb wrote:<br class=""><br class="">
        By the way, to those of you who have working setups: please send me<br class="">
        your pool/volume settings, interface linkprops, and any kernel<br class="">
        tuning<br class="">
        parameters you may have set.<br class=""><br class="">
        Thanks,<br class="">
        Warren V<br class=""><br class="">
        On Sat, Feb 21, 2015 at 7:59 AM, Schweiss, Chip<br class=""></span><div class=""><div class="h5">
        <<a href="mailto:chip@innovates.com" target="_blank" class="">chip@innovates.com</a> <mailto:<a href="mailto:chip@innovates.com" target="_blank" class="">chip@innovates.com</a>>> wrote:<br class=""><br class="">
            I can't say I totally agree with your performance<br class="">
            assessment.   I run Intel<br class="">
            X520 in all my OmniOS boxes.<br class=""><br class="">
            Here is a capture of nfssvrtop I made while running many<br class="">
            storage vMotions<br class="">
            between two OmniOS boxes hosting NFS datastores.   This is a<br class="">
            10 host VMware<br class="">
            cluster.  Both OmniOS boxes are dual 10G connected with<br class="">
            copper twin-ax to<br class="">
            the in rack Nexus 5010.<br class=""><br class="">
            VMware does 100% sync writes, I use ZeusRAM SSDs for log<br class="">
            devices.<br class=""><br class="">
            -Chip<br class=""><br class="">
            2014 Apr 24 08:05:51, load: 12.64, read: 17330243 KB,<br class="">
            swrite: 15985    KB,<br class="">
            awrite: 1875455  KB<br class=""><br class="">
            Ver     Client           NFSOPS   Reads SWrites AWrites<br class="">
            Commits   Rd_bw<br class="">
            SWr_bw  AWr_bw    Rd_t   SWr_t   AWr_t   Com_t  Align%<br class=""><br class="">
            4       10.28.17.105          0       0       0       0<br class="">
              0       0<br class="">
            0       0       0       0       0       0       0<br class=""><br class="">
            4       10.28.17.215          0       0       0       0<br class="">
              0       0<br class="">
            0       0       0       0       0       0       0<br class=""><br class="">
            4       10.28.17.213          0       0       0       0<br class="">
              0       0<br class="">
            0       0       0       0       0       0       0<br class=""><br class="">
            4       10.28.16.151          0       0       0       0<br class="">
              0       0<br class="">
            0       0       0       0       0       0       0<br class=""><br class="">
            4       all                   1       0       0       0<br class="">
              0       0<br class="">
            0       0       0       0       0       0       0<br class=""><br class="">
            3       10.28.16.175          3       0       3       0<br class="">
              0       1<br class="">
            11       0    4806      48       0       0      85<br class=""><br class="">
            3       10.28.16.183          6       0       6       0<br class="">
              0       3<br class="">
            162       0     549     124       0       0      73<br class=""><br class="">
            3       10.28.16.180         11       0      10       0<br class="">
              0       3<br class="">
            27       0     776      89       0       0      67<br class=""><br class="">
            3       10.28.16.176         28       2      26       0<br class="">
              0      10<br class="">
            405       0    2572     198       0       0     100<br class=""><br class="">
            3       10.28.16.178       4606    4602       4       0<br class="">
              0  294534<br class="">
            3       0     723      49       0       0      99<br class=""><br class="">
            3       10.28.16.179       4905    4879      26       0<br class="">
              0  312208<br class="">
            311       0     735     271       0       0      99<br class=""><br class="">
            3       10.28.16.181       5515    5502      13       0<br class="">
              0  352107<br class="">
            77       0      89      87       0       0      99<br class=""><br class="">
            3       10.28.16.184      12095   12059      10       0<br class="">
              0  763014<br class="">
            39       0     249     147       0       0      99<br class=""><br class="">
            3       10.28.58.1        15401    6040     116    6354<br class="">
            53  191605<br class="">
            474  202346     192      96     144      83      99<br class=""><br class=""></div></div>
            3       all <a href="tel:42574%2033086" target="_blank" value="+14257433086" class="">42574 33086</a> <tel:42574%20%20%2033086>     217<span class=""><br class="">
            6354      53 1913488<br class="">
            1582  202300     348     138     153     105      99<br class=""><br class=""><br class=""><br class=""><br class=""><br class="">
            On Fri, Feb 20, 2015 at 11:46 PM, W Verb <<a href="mailto:wverb73@gmail.com" target="_blank" class="">wverb73@gmail.com</a><br class=""></span><div class=""><div class="h5">
            <mailto:<a href="mailto:wverb73@gmail.com" target="_blank" class="">wverb73@gmail.com</a>>> wrote:<br class=""><br class=""><br class="">
                Hello All,<br class=""><br class="">
                Thank you for your replies.<br class="">
                I tried a few things, and found the following:<br class=""><br class="">
                1: Disabling hyperthreading support in the BIOS drops<br class="">
                performance overall<br class="">
                by a factor of 4.<br class="">
                2: Disabling VT support also seems to have some effect,<br class="">
                although it<br class="">
                appears to be minor. But this has the amusing side<br class="">
                effect of fixing the<br class="">
                hangs I've been experiencing with fast reboot. Probably<br class="">
                by disabling kvm.<br class="">
                3: The performance tests are a bit tricky to quantify<br class="">
                because of caching<br class="">
                effects. In fact, I'm not entirely sure what is<br class="">
                happening here. It's just<br class="">
                best to describe what I'm seeing:<br class=""><br class="">
                The commands I'm using to test are<br class="">
                dd if=/dev/zero of=./test.dd bs=2M count=5000<br class="">
                dd of=/dev/null if=./test.dd bs=2M count=5000<br class="">
                The host vm is running Centos 6.6, and has the latest<br class="">
                vmtools installed.<br class="">
                There is a host cache on an SSD local to the host that<br class="">
                is also in place.<br class="">
                Disabling the host cache didn't immediately have an<br class="">
                effect as far as I could<br class="">
                see.<br class=""><br class="">
                The host MTU set to 3000 on all iSCSI interfaces for all<br class="">
                tests.<br class=""><br class="">
                Test 1: Right after reboot, with an ixgbe MTU of 9000,<br class="">
                the write test<br class="">
                yields an average speed over three tests of 137MB/s. The<br class="">
                read test yields an<br class="">
                average over three tests of 5MB/s.<br class=""><br class="">
                Test 2: After setting "ifconfig ixgbe0 mtu 3000", the<br class="">
                write tests yield<br class="">
                140MB/s, and the read tests yield 53MB/s. It's important<br class="">
                to note here that<br class="">
                if I cut the read test short at only 2-3GB, I get<br class="">
                results upwards of<br class="">
                350MB/s, which I assume is local cache-related distortion.<br class=""><br class="">
                Test 3: MTU of 1500. Read tests are up to 156 MB/s.<br class="">
                Write tests yield<br class="">
                about 142MB/s.<br class="">
                Test 4: MTU of 1000: Read test at 182MB/s.<br class="">
                Test 5: MTU of 900: Read test at 130 MB/s.<br class="">
                Test 6: MTU of 1000: Read test at 160MB/s. Write tests<br class="">
                are now<br class="">
                consistently at about 300MB/s.<br class="">
                Test 7: MTU of 1200: Read test at 124MB/s.<br class="">
                Test 8: MTU of 1000: Read test at 161MB/s. Write at 261MB/s.<br class=""><br class="">
                A few final notes:<br class="">
                L1ARC grabs about 10GB of RAM during the tests, so<br class="">
                there's definitely some<br class="">
                read caching going on.<br class="">
                The write operations are easier to observe with iostat,<br class="">
                and I'm seeing io<br class="">
                rates that closely correlate with the network write speeds.<br class=""><br class=""><br class="">
                Chris, thanks for your specific details. I'd appreciate<br class="">
                it if you could<br class="">
                tell me which copper NIC you tried, as well as to pass<br class="">
                on the iSCSI tuning<br class="">
                parameters.<br class=""><br class="">
                I've ordered an Intel EXPX9502AFXSR, which uses the<br class="">
                82598 chip instead of<br class="">
                the 82599 in the X520. If I get similar results with my<br class="">
                fiber transcievers,<br class="">
                I'll see if I can get a hold of copper ones.<br class=""><br class="">
                But I should mention that I did indeed look at PHY/MAC<br class="">
                error rates, and<br class="">
                they are nil.<br class=""><br class="">
                -Warren V<br class=""><br class="">
                On Fri, Feb 20, 2015 at 7:25 PM, Chris Siebenmann<br class=""></div></div>
                <<a href="mailto:cks@cs.toronto.edu" target="_blank" class="">cks@cs.toronto.edu</a> <mailto:<a href="mailto:cks@cs.toronto.edu" target="_blank" class="">cks@cs.toronto.edu</a>>><div class=""><div class="h5"><br class="">
                wrote:<br class=""><br class=""><br class="">
                        After installation and configuration, I observed<br class="">
                        all kinds of bad<br class="">
                        behavior<br class="">
                        in the network traffic between the hosts and the<br class="">
                        server. All of this<br class="">
                        bad<br class="">
                        behavior is traced to the ixgbe driver on the<br class="">
                        storage server. Without<br class="">
                        going<br class="">
                        into the full troubleshooting process, here are<br class="">
                        my takeaways:<br class=""><br class="">
                    [...]<br class=""><br class="">
                       For what it's worth, we managed to achieve much<br class="">
                    better line rates on<br class="">
                    copper 10G ixgbe hardware of various descriptions<br class="">
                    between OmniOS<br class="">
                    and CentOS 7 (I don't think we ever tested OmniOS to<br class="">
                    OmniOS). I don't<br class="">
                    believe OmniOS could do TCP at full line rate but I<br class="">
                    think we managed 700+<br class="">
                    Mbytes/sec on both transmit and receive and we got<br class="">
                    basically disk-limited<br class="">
                    speeds with iSCSI (across multiple disks on<br class="">
                    multi-disk mirrored pools,<br class="">
                    OmniOS iSCSI initiator, Linux iSCSI targets).<br class=""><br class="">
                       I don't believe we did any specific kernel tuning<br class="">
                    (and in fact some of<br class="">
                    our attempts to fiddle ixgbe driver parameters blew<br class="">
                    up in our face).<br class="">
                    We did tune iSCSI connection parameters to increase<br class="">
                    various buffer<br class="">
                    sizes so that ZFS could do even large single<br class="">
                    operations in single iSCSI<br class="">
                    transactions. (More details available if people are<br class="">
                    interested.)<br class=""><br class="">
                        10: At the wire level, the speed problems are<br class="">
                        clearly due to pauses in<br class="">
                        response time by omnios. At 9000 byte frame<br class="">
                        sizes, I see a good number<br class="">
                        of duplicate ACKs and fast retransmits during<br class="">
                        read operations (when<br class="">
                        omnios is transmitting). But below about a<br class="">
                        4100-byte MTU on omnios<br class="">
                        (which seems to correlate to 4096-byte iSCSI<br class="">
                        block transfers), the<br class="">
                        transmission errors fade away and we only see<br class="">
                        the transmission pause<br class="">
                        problem.<br class=""><br class=""><br class="">
                       This is what really attracted my attention. In<br class="">
                    our OmniOS setup, our<br class="">
                    specific Intel hardware had ixgbe driver issues that<br class="">
                    could cause<br class="">
                    activity stalls during once-a-second link heartbeat<br class="">
                    checks. This<br class="">
                    obviously had an effect at the TCP and iSCSI layers.<br class="">
                    My initial message<br class="">
                    to illumos-developer sparked a potentially<br class="">
                    interesting discussion:<br class=""><br class=""><br class=""></div></div>
                    <a href="http://www.listbox.com/member/__archive/182179/2014/10/sort/__time_rev/page/16/entry/6:405/__20141003125035:6357079A-4B1D-__11E4-A39C-D534381BA44D/" target="_blank" class="">http://www.listbox.com/member/<u class=""></u>__archive/182179/2014/10/sort/<u class=""></u>__time_rev/page/16/entry/6:<u class=""></u>405/__20141003125035:6357079A-<u class=""></u>4B1D-__11E4-A39C-D534381BA44D/</a><span class=""><br class="">
                    <<a href="http://www.listbox.com/member/archive/182179/2014/10/sort/time_rev/page/16/entry/6:405/20141003125035:6357079A-4B1D-11E4-A39C-D534381BA44D/" target="_blank" class="">http://www.listbox.com/<u class=""></u>member/archive/182179/2014/10/<u class=""></u>sort/time_rev/page/16/entry/6:<u class=""></u>405/20141003125035:6357079A-<u class=""></u>4B1D-11E4-A39C-D534381BA44D/</a>><br class=""><br class="">
                    If you think this is a possibility in your setup,<br class="">
                    I've put the DTrace<br class="">
                    script I used to hunt for this up on the web:<br class=""><br class=""></span>
                    <a href="http://www.cs.toronto.edu/~__cks/src/omnios-ixgbe/ixgbe___delay.d" target="_blank" class="">http://www.cs.toronto.edu/~__<u class=""></u>cks/src/omnios-ixgbe/ixgbe___<u class=""></u>delay.d</a><span class=""><br class="">
                    <<a href="http://www.cs.toronto.edu/~cks/src/omnios-ixgbe/ixgbe_delay.d" target="_blank" class="">http://www.cs.toronto.edu/~<u class=""></u>cks/src/omnios-ixgbe/ixgbe_<u class=""></u>delay.d</a>><br class=""><br class="">
                    This isn't the only potential source of driver<br class="">
                    stalls by any means, it's<br class="">
                    just the one I found. You may also want to look at<br class="">
                    lockstat in general,<br class="">
                    as information it reported is what led us to look<br class="">
                    specifically at the<br class="">
                    ixgbe code here.<br class=""><br class="">
                    (If you suspect kernel/driver issues, lockstat<br class="">
                    combined with kernel<br class="">
                    source is a really excellent resource.)<br class=""><br class="">
                              - cks<br class=""><br class=""><br class=""><br class=""><br class=""></span>
                ______________________________<u class=""></u>___________________<br class="">
                OmniOS-discuss mailing list<br class="">
                <a href="mailto:OmniOS-discuss@lists.omniti" class="">OmniOS-discuss@lists.omniti</a>.__<u class=""></u>com<br class="">
                <mailto:<a href="mailto:OmniOS-discuss@lists.omniti.com" target="_blank" class="">OmniOS-discuss@lists.<u class=""></u>omniti.com</a>><br class="">
                <a href="http://lists.omniti.com/__mailman/listinfo/omnios-__discuss" target="_blank" class="">http://lists.omniti.com/__<u class=""></u>mailman/listinfo/omnios-__<u class=""></u>discuss</a><br class="">
                <<a href="http://lists.omniti.com/mailman/listinfo/omnios-discuss" target="_blank" class="">http://lists.omniti.com/<u class=""></u>mailman/listinfo/omnios-<u class=""></u>discuss</a>><br class=""><br class=""><br class="">
        ______________________________<u class=""></u>___________________<br class="">
        OmniOS-discuss mailing list<br class="">
        <a href="mailto:OmniOS-discuss@lists.omniti" class="">OmniOS-discuss@lists.omniti</a>.__<u class=""></u>com<br class="">
        <mailto:<a href="mailto:OmniOS-discuss@lists.omniti.com" target="_blank" class="">OmniOS-discuss@lists.<u class=""></u>omniti.com</a>><br class="">
        <a href="http://lists.omniti.com/__mailman/listinfo/omnios-__discuss" target="_blank" class="">http://lists.omniti.com/__<u class=""></u>mailman/listinfo/omnios-__<u class=""></u>discuss</a><span class=""><br class="">
        <<a href="http://lists.omniti.com/mailman/listinfo/omnios-discuss" target="_blank" class="">http://lists.omniti.com/<u class=""></u>mailman/listinfo/omnios-<u class=""></u>discuss</a>><br class=""><br class=""><br class="">
    --<br class="">
    OSN Online Service Nuernberg GmbH, Bucher Str. 78, 90408 Nuernberg<br class=""></span>
    Tel: <a href="tel:%2B49%20911%2039905-0" target="_blank" value="+49911399050" class="">+49 911 39905-0</a> <tel:%2B49%20911%2039905-0> - Fax: +49 911<br class="">
    39905-55 <tel:%2B49%20911%2039905-55> - <a href="http://www.osn.de/" target="_blank" class="">http://www.osn.de</a><span class=""><br class="">
    HRB 15022 Nuernberg, USt-Id: DE189301263, GF: Joerg Goltermann<br class=""><br class=""><br class=""></span></blockquote><div class="HOEnZb"><div class="h5"><br class="">
-- <br class="">
OSN Online Service Nuernberg GmbH, Bucher Str. 78, 90408 Nuernberg<br class="">
Tel: <a href="tel:%2B49%20911%2039905-0" target="_blank" value="+49911399050" class="">+49 911 39905-0</a> - Fax: <a href="tel:%2B49%20911%2039905-55" target="_blank" value="+499113990555" class="">+49 911 39905-55</a> - <a href="http://www.osn.de/" target="_blank" class="">http://www.osn.de</a><br class="">
HRB 15022 Nuernberg, USt-Id: DE189301263, GF: Joerg Goltermann<br class=""></div></div></blockquote></div><br class=""></div><div bgcolor="#ffffff" style="width:auto;margin:0;padding:5px;background-color:#fff;clear:both;border-top: 1px solid #ccc;" class=""><table bgcolor="#ffffff" border="0" cellpadding="0" cellspacing="0" style="background-color:#fff" width="100%" class=""><tbody class=""><tr class=""><td padding="4px" class=""><font color="#333333" size="1" style="font-family:helvetica, sans-serif;" class="">
      <strong class="">illumos-developer</strong> | <a href="https://www.listbox.com/member/archive/182179/=now" style="text-decoration:none;color:#669933;border-bottom: 1px solid #444444" title="Go to archives for illumos-developer" class="">Archives</a>
<a border="0" href="https://www.listbox.com/member/archive/rss/182179/21239177-3604570e" style="text-decoration:none;color:#669933" title="RSS feed for illumos-developer" class=""><img border="0" src="http://postlink.www.listbox.com/1851406/833487e62783d55fe81f119fb93ef644/21239177/8c349c6f.jpg?uri=aHR0cHM6Ly93d3cubGlzdGJveC5jb20vaW1hZ2VzL2ZlZWQtaWNvbi0xMHgxMC5qcGc" class=""></a>
 | <a href="https://www.listbox.com/member/?member_id=21239177&id_secret=21239177-2d0c9337" style="text-decoration:none;color:#669933;border-bottom: 1px solid #444444" title="" class="">Modify</a>
 Your Subscription</font></td><td align="right" valign="top" class=""><a href="http://www.listbox.com/" style="border-bottom:none;" class="">
<img border="0" src="http://postlink.www.listbox.com/1851407/3379085af0f1cf7fc3708f04b4471ae2/21239177/8c349c6f.png?uri=aHR0cHM6Ly93d3cubGlzdGJveC5jb20vaW1hZ2VzL2xpc3Rib3gtbG9nby1zbWFsbC5wbmc" title="Powered by Listbox" class=""></a></td></tr></tbody></table></div></div></blockquote></div><br class=""></div></div></div></div></body></html>