[Lxc-users] concurrent aptitude/dpkg runs in separate containers --> bork bork bork
Trent W. Buck
twb at cybersource.com.au
Thu Feb 3 06:08:21 UTC 2011
twb at cybersource.com.au (Trent W. Buck)
writes:
> I'm being a bit more patient than last time, and I think they ARE
> proceeding, just REALLY slowly. Meanwhile aptitude consumes a 100% of a
> core busy-waiting for a response from dpkg :-/
>
> They look like this:
>
> $ ssh omega cat /proc/7713/stack
> Warning: Permanently added 'omega,192.168.155.22' (RSA) to the list of known hosts.
> [<ffffffff811669b7>] sync_inodes_sb+0x87/0xb0
> [<ffffffff8116b292>] __sync_filesystem+0x82/0x90
> [<ffffffff8116b379>] sync_filesystems+0xd9/0x130
> [<ffffffff8116b431>] sys_sync+0x21/0x40
> [<ffffffff810121b2>] system_call_fastpath+0x16/0x1b
> [<ffffffffffffffff>] 0xffffffffffffffff
>
> $ ssh omega cat /proc/5619/stack
> Warning: Permanently added 'omega,192.168.155.22' (RSA) to the list of known hosts.
> [<ffffffff81222865>] jbd2_log_wait_commit+0xc5/0x150
> [<ffffffff811d7a2c>] ext4_sync_file+0x13c/0x2e0
> [<ffffffff8116b051>] vfs_fsync_range+0xa1/0xe0
> [<ffffffff8116b0fd>] vfs_fsync+0x1d/0x20
> [<ffffffff8116b13e>] do_fsync+0x3e/0x60
> [<ffffffff8116b190>] sys_fsync+0x10/0x20
> [<ffffffff810121b2>] system_call_fastpath+0x16/0x1b
> [<ffffffffffffffff>] 0xffffffffffffffff
And here's one that is well and truly wedged:
root at omega:~# cat /proc/31430/stack
[<ffffffff811669b7>] sync_inodes_sb+0x87/0xb0
[<ffffffff8116b292>] __sync_filesystem+0x82/0x90
[<ffffffff8116b379>] sync_filesystems+0xd9/0x130
[<ffffffff8116b431>] sys_sync+0x21/0x40
[<ffffffff810121b2>] system_call_fastpath+0x16/0x1b
[<ffffffffffffffff>] 0xffffffffffffffff
In that case, even kill -SEGV'ing upstart won't stop it. I got that
with only a single dpkg run (i.e. no concurrency), after switching the
container's rootfs from ext4 to ext3, and forcing dpkg[0] to be upgraded
before anything else. Sigh...
I'm THIS CLOSE to giving up and wrapping apt-get in libeatmydata.
[0] I did this because I noticed that lucid's dpkg still suffers from
http://bugs.debian.org/578635
http://bugs.debian.org/605009
https://launchpad.net/bugs/570805
But lucid-updates & lucid-security both contain a version that
contains CLAIMS to address the first of those.
More information about the lxc-users
mailing list