NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
dmesg | grep -i nvidia
[ 2.500890] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input5
[ 2.500919] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input6
[ 2.500946] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input7
[ 2.500977] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input8
[ 4768.060904] eglinfo[6073]: segfault at 10 ip 00007f5cf064188c sp 00007ffefb476d40 error 4 in libnvidia-egl-gbm.so.1.1.3[188c,7f5cf0641000+3000] likely on CPU 4 (core 8, socket 0)
内核升级了,得重装!!!
CUR_REL=$(uname -r)
apt install linux-headers-$CUR_REL -y
echo "--- 正在锁定当前内核相关软件包,防止自动升级 ---"
PACKAGES=$(dpkg -l | grep "$CUR_REL" | awk '{print $2}')
if [ -z "$PACKAGES" ]; then
echo "错误: 未能找到当前内核的软件包,请检查 dpkg 状态。"
exit 1
fi
for PKG in $PACKAGES; do
echo "锁定: $PKG"
apt-mark hold "$PKG"
done
apt-mark showhold
## hold住内核,平时apt upgrade跳过升级
