系統自己彈出諸如 kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 26s [mysqld:2875]


系統在沒有人使用的情況下自己彈出諸如以下關於內核的報錯

[root@bkce tmp]#
Message from syslogd@bkce at Oct 13 14:25:00 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 26s! [mysqld:2875]

Message from syslogd@bkce at Oct 13 14:25:00 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 26s! [xfsaild/dm-0:1059]

Message from syslogd@bkce at Oct 13 14:25:00 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [consul:3503]

Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 24s! [basereport:18282]

Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 34s! [kworker/u16:0:10815]

Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 35s! [cmdb_cloudserve:26778]

Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 34s! [consul:31586]

Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 25s! [cmdb_toposerver:2713]

Message from syslogd@bkce at Oct 13 15:01:32 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 25s! [basereport:15883]

Message from syslogd@bkce at Oct 13 15:01:33 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#7 stuck for 25s! [cmdb_procserver:26557]

Message from syslogd@bkce at Oct 13 15:04:28 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 24s! [1_scheduler:9550]

Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [celery:32038]

Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 22s! [supervisord:31880]

Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [basereport:22621]

Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 22s! [python:32031]

Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [supervisord:30968]

Message from syslogd@bkce at Oct 13 15:05:27 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [ksoftirqd/5:34]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 34s! [xfsaild/dm-0:1059]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 34s! [kworker/2:2:2083]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 29s! [python:31737]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 29s! [kworker/5:2:16355]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#7 stuck for 29s! [cmdb_authserver:26540]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 29s! [exceptionbeat:21758]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 34s! [ksoftirqd/0:6]

Message from syslogd@bkce at Oct 13 15:06:21 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 29s! [kworker/u16:0:10815]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#5 stuck for 21s! [6_scheduler:9579]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#6 stuck for 21s! [uwsgi:29155]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#7 stuck for 21s! [consul:8168]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#3 stuck for 21s! [python:32031]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#2 stuck for 21s! [gunicorn:23944]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#1 stuck for 21s! [dataWorker:8717]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 21s! [xfsaild/dm-0:1059]

Message from syslogd@bkce at Oct 13 15:27:56 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#4 stuck for 21s! [SyncThread:0:5078]

 

內核軟死鎖(soft lockup)

Soft lockup:這個bug沒有讓系統徹底死機,但是若干個進程(或者kernel thread)被鎖死在了某個狀態(一般在內核區域),很多情況下這個是由於內核鎖的使用的問題。

出現死鎖原因

1、CPU高負載時間過長
2、服務器電源供電不足,導致CPU電壓不穩定
3、vcpus超過物理cpu cores
4、虛機所在的宿主機的CPU太忙或磁盤IO太高
5、虛機機的CPU太忙或磁盤IO太高
6、VM網卡驅動存在bug,處理高水位流量時存在bug導致CPU死鎖
7、BIOS開啟了超頻,導致超頻時電壓不穩,容易出現CPU死鎖
8、Linux kernel或KVM存在bug
9、BIOS Intel C-State開啟導致,關閉可解決
10、BIOS spread spectrum開啟導致

解決辦法

echo 30 > /proc/sys/kernel/watchdog_thresh
echo "kernel.watchdog_thresh=30" >> /etc/sysctl.conf
sysctl -w kernel.watchdog_thresh=30
sysctl -q vm.swappiness
sysctl -p

然后重啟系統


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM