[lxc-users] Memory problem in LXC causes host to crash
Danijel Vargek, Continum
danijel.vargek at continum.net
Thu Aug 28 09:41:06 UTC 2014
Hi all,
we are running a LXC-Host with several testing containers (14 at the moment).
The host itself is on Ubuntu 14.04, with 3.13.0-32 Kernel. The containers
are running Debian Wheezy.
>From time to time the host machine completely crashes, probably due to containers
eating up too much ram. We already limited every container via cgroup (cpu and ram),
but still receive this behaviour.
Our suspection is, that java on some of the containers isn't correctly limited, which
leads to crashing the host machine.
Does anybody got similar expirience, or is there something missing when limiting containers
via cgroup?
This is the syslog entry for the last crash (host machine + one of the containers):
#### HOST ####
Aug 26 13:33:10 node04 kernel: [87282.555841] Modules linked in:<4>[87282.555841] Call Trace:
Aug 26 13:33:10 node04 kernel: [87282.555841] [<ffffffff811458c4>] perf_event_overflow+0x14/0x20
Aug 26 13:33:10 node04 kernel: [87282.555841] [<ffffffff8136e9ed>] ? __write_lock_failed+0xd/0x20
Aug 26 13:33:10 node04 kernel: [87282.555841] ---[ end trace 71798cbdeee56afd ]---
Aug 26 13:33:10 node04 kernel: [87304.156008] RAX: ffff881018a2b2e8 RBX: ffff880815771e28 RCX: 0000000000000006
Aug 26 13:33:10 node04 kernel: [87304.156008] Stack:
Aug 26 13:33:10 node04 kernel: [87304.156008] [<ffffffff81152384>] pagefault_out_of_memory+0x14/0x80
Aug 26 13:33:10 node04 kernel: [87304.156008] [<ffffffff81727fda>] do_page_fault+0x1a/0x70
Aug 26 13:33:18 node04 kernel: [87312.204006] FS: 00007f58c7c59700(0000) GS:ffff88103f940000(0000) knlGS:0000000000000000
Aug 26 13:33:18 node04 kernel: [87312.204006] Stack:
Aug 26 13:33:18 node04 kernel: [87312.204006] ffff880ab78e3c70 ffffffff8160c3a8 ffff88103f914f00 ffff88103f914f00
Aug 26 13:33:18 node04 kernel: [87312.204006] [<ffffffff810dba85>] smp_call_function_single+0xe5/0x190
Aug 26 13:33:18 node04 kernel: [87312.204006] [<ffffffff810dbeb6>] smp_call_function_many+0x286/0x2d0
Aug 26 13:33:18 node04 kernel: [87312.204006] [<ffffffff811814d5>] change_protection+0x65/0xb0
Aug 26 13:33:18 node04 kernel: [87312.204006] [<ffffffff8172423c>] retint_signal+0x48/0x8c
##############
#### Container ####
Aug 26 13:32:45 ff01 kernel: [87279.009427] CPU: 0 PID: 773 Comm: java Tainted: GF O 3.13.0-32-generic #57-Ubuntu
Aug 26 13:32:45 ff01 kernel: [87279.009442] Call Trace:
Aug 26 13:32:45 ff01 kernel: [87279.009470] [<ffffffff811b388c>] mem_cgroup_oom_synchronize+0x4fc/0x540
Aug 26 13:32:45 ff01 kernel: [87279.009502] [<ffffffff81724448>] page_fault+0x28/0x30
Aug 26 13:32:45 ff01 kernel: [87279.009650] [23109] 0 23109 32444 422 29 0 0 console-kit-dae
Aug 26 13:32:45 ff01 kernel: [87279.009779] [ 714] 1000 714 4999 182 12 0 0 wrapper-linux-x
###################
Please tell me if you need additional information.
Regards,
Danijel Vargek
--
Danijel Vargek
Systemadministrator Unix
Continum AG
Bismarckallee 7b-d
D-79098 Freiburg i. Br.
Tel.: +49 761 217111-77
Fax.: +49 761 217111-99
http://www.continum.net
Sitz der Gesellschaft: Freiburg im Breisgau
Registergericht: Amtsgericht Freiburg, HRB 6866
Vorstand: Volker T. Mueller
Vorsitzender d. Aufsichtsrats: Bernd Straub
More information about the lxc-users
mailing list