<div dir="ltr">Hi Dwight,<div><br></div><div>Yes it only stuck during creating concurrent containers for me and start/stop/freeze/unfreeze seems to work fine. If it helps I'm pretty sure that it was working fine till last week (or I was so lucky not to hit by this problem before). Go binding's test suite does lots of concurrent stuff to test bindings and they were passing but now ConcurrentCreate test case (which creates 10 containers in parallel) hangs most of the time.</div>
<div><br></div></div><div class="gmail_extra"><br><br><div class="gmail_quote">On Fri, Sep 13, 2013 at 5:29 PM, Dwight Engen <span dir="ltr"><<a href="mailto:dwight.engen@oracle.com" target="_blank">dwight.engen@oracle.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="im">On Fri, 13 Sep 2013 12:09:55 -0400<br>
S.Çağlar Onur <<a href="mailto:caglar@10ur.org">caglar@10ur.org</a>> wrote:<br>
<br>
> Hi Dwight,<br>
><br>
> Yes, I only observed a hang so far but not this assertion (in fact I<br>
> don't remember ever seeing that). What I'm seeing is this;<br>
<br>
</div>Okay, something funny is going on, but I don't know what yet. That<br>
assertion is coming from liblxc.so->libgnutls->libgcrypt and seems to<br>
be complaining that we're unlocking something that is already unlocked.<br>
So I compiled lxc without GNUTLS support (by commenting out the check<br>
for it in <a href="http://configure.ac" target="_blank">configure.ac</a>) and now I get past that and get hangs similar to<br>
yours.<br>
<br>
Interestingly, I modified your program to just do the create and<br>
destroy and not the start nor stop and I still get the hangs during the<br>
creation part.<br>
<div class="HOEnZb"><div class="h5"><br>
> * lxc-test-concurrent get stuck<br>
><br>
> [caglar@qgq:~] sudo lxc-test-concurrent<br>
> Executing (create) for 5 containers...<br>
><br>
> * ps auwxf shows this (so no rsync etc. running anymore)<br>
><br>
> caglar 21004 0.2 0.2 51344 4868 ? S 11:59 0:00<br>
> mosh-server new -s -c 256 -l LANG=en_US.UTF-8<br>
> caglar 21005 0.0 0.2 23068 4412 pts/2 Ss 11:59 0:00 \_<br>
> -bash root 27347 0.0 0.1 60248 2080 pts/2 S+ 12:03<br>
> 0:00 \_ sudo lxc-test-concurrent<br>
> root 27348 0.0 0.0 383816 884 pts/2 Sl+ 12:03 0:00<br>
> \_ lxc-test-concurrent<br>
> root 27354 0.0 0.0 381684 408 pts/2 S+ 12:03 0:00<br>
> \_ lxc-test-concurrent<br>
><br>
> * strace give this<br>
><br>
> [caglar@qgq:~/Projects/lxc(staging)] sudo strace -p 27354<br>
> Process 27354 attached - interrupt to quit<br>
> futex(0x7fdc68b82cc0, FUTEX_WAIT_PRIVATE, 2, NULL^C <unfinished ...><br>
> Process 27354 detached<br>
> [caglar@qgq:~/Projects/lxc(staging)] sudo strace -p 27348<br>
> Process 27348 attached - interrupt to quit<br>
> futex(0x7fdc65f3d9d0, FUTEX_WAIT, 27353, NULL^C <unfinished ...><br>
> Process 27348 detached<br>
> [caglar@qgq:~/Projects/lxc(staging)] sudo strace -p 27347<br>
> Process 27347 attached - interrupt to quit<br>
> select(6, [3 5], [], NULL, NULL^C <unfinished ...><br>
> Process 27347 detached<br>
><br>
> * lxc-ls<br>
><br>
> [caglar@qgq:~/Projects/lxc(staging)] sudo lxc-ls --fancy<br>
> NAME STATE IPV4 IPV6<br>
> ------------------------------<br>
> 0 STOPPED - -<br>
> 1 STOPPED - -<br>
> 2 STOPPED - -<br>
> 3 STOPPED - -<br>
> 4 STOPPED - -<br>
><br>
> * /var/lib/lxc/4/partial still there<br>
><br>
> [caglar@qgq:/var/lib/lxc] ls /var/lib/lxc/*<br>
> /var/lib/lxc/lxc-monitord.log<br>
><br>
> /var/lib/lxc/0:<br>
> config fstab rootfs<br>
><br>
> /var/lib/lxc/1:<br>
> config fstab rootfs<br>
><br>
> /var/lib/lxc/2:<br>
> config fstab rootfs<br>
><br>
> /var/lib/lxc/3:<br>
> config fstab rootfs<br>
><br>
> /var/lib/lxc/4:<br>
> config partial rootfs<br>
><br>
> /var/lib/lxc/bleach:<br>
> config fstab lxc_snapshots rootfs<br>
><br>
> /var/lib/lxc/bleach-ng:<br>
> config delta0 fstab lxc_rdepends rootfs<br>
> <br>
<br>
</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br>S.Çağlar Onur <<a href="mailto:caglar@10ur.org">caglar@10ur.org</a>>
</div>