系統在沒有人使用的情況下自己彈出諸如以下關於內核的報錯
[root@bkce tmp]#
Message from syslogd@bkce at Oct 13 14:25:00 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 26s! [mysqld:2875]
Message from syslogd@bkce at Oct 13 14:25:00 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 26s! [xfsaild/dm-0:1059]
Message from syslogd@bkce at Oct 13 14:25:00 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [consul:3503]
Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 24s! [basereport:18282]
Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 34s! [kworker/u16:0:10815]
Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 35s! [cmdb_cloudserve:26778]
Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 34s! [consul:31586]
Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 25s! [cmdb_toposerver:2713]
Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 25s! [basereport:15883]
Message from syslogd@bkce at Oct 13 15:01:33 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#7 stuck for 25s! [cmdb_procserver:26557]
Message from syslogd@bkce at Oct 13 15:04:28 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 24s! [1_scheduler:9550]
Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [celery:32038]
Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 22s! [supervisord:31880]
Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [basereport:22621]
Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 22s! [python:32031]
Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [supervisord:30968]
Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [ksoftirqd/5:34]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 34s! [xfsaild/dm-0:1059]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 34s! [kworker/2:2:2083]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 29s! [python:31737]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 29s! [kworker/5:2:16355]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#7 stuck for 29s! [cmdb_authserver:26540]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 29s! [exceptionbeat:21758]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 34s! [ksoftirqd/0:6]
Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 29s! [kworker/u16:0:10815]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 21s! [6_scheduler:9579]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 21s! [uwsgi:29155]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#7 stuck for 21s! [consul:8168]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 21s! [python:32031]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 21s! [gunicorn:23944]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 21s! [dataWorker:8717]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 21s! [xfsaild/dm-0:1059]
Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 21s! [SyncThread:0:5078]
內核軟死鎖(soft lockup)
Soft lockup:這個bug沒有讓系統徹底死機,但是若干個進程(或者kernel thread)被鎖死在了某個狀態(一般在內核區域),很多情況下這個是由於內核鎖的使用的問題。
出現死鎖原因
1、CPU高負載時間過長
2、服務器電源供電不足,導致CPU電壓不穩定
3、vcpus超過物理cpu cores
4、虛機所在的宿主機的CPU太忙或磁盤IO太高
5、虛機機的CPU太忙或磁盤IO太高
6、VM網卡驅動存在bug,處理高水位流量時存在bug導致CPU死鎖
7、BIOS開啟了超頻,導致超頻時電壓不穩,容易出現CPU死鎖
8、Linux kernel或KVM存在bug
9、BIOS Intel C-State開啟導致,關閉可解決
10、BIOS spread spectrum開啟導致
解決辦法
echo 30 > /proc/sys/kernel/watchdog_thresh
echo "kernel.watchdog_thresh=30" >> /etc/sysctl.conf
sysctl -w kernel.watchdog_thresh=30
sysctl -q vm.swappiness
sysctl -p
然后重啟系統