I recently put together a desktop computer for heavy data processing, and installed mint MATE edition on it.
Everything was working smoothly for months until a few weeks ago, where the system started to freeze seemingly randomly. It does so now on average every 3 days (+/- 2 days). I hadn't done anything noteworthy around that time, such as an upgrade or anything of the sort.
At first I thought it might be the CPU overheating so I upgraded the CPU fan to a much better one, but that didn't help. I also started monitoring the temps with the "sensors" applet, but at least based on the readings the cores rarely exceed 60C (even as captured by the display "freeze"). Could it just spark so fast that it freezes the computer before the sensors capture it?
I also ran
Code: Select all
sudo apt-get install linux-image-generic
When I look at /var/log/syslog or /var/log/dmesg, I don't see any obvious error message (though the latter is a bit too cryptic for me)
The puzzling part for me is that it was working perfectly for months... Any help in debugging this would be appreciated!
PS1: the only thing that keeps on working after freezing is the music, for about a minute or so and then it stops
PS2: magic key + REISUB does seem to work and restarts successfully
PS3: output for inxi -Fxxxz:
Code: Select all
System:
Kernel: 5.4.0-73-generic x86_64 bits: 64 compiler: gcc v: 9.3.0
Desktop: MATE 1.24.0 info: mate-panel wm: marco 1.24.0 dm: LightDM 1.30.0
Distro: Linux Mint 20 Ulyana base: Ubuntu 20.04 focal
Machine:
Type: Desktop Mobo: ASUSTeK model: TUF GAMING X570-PLUS (WI-FI)
v: Rev X.0x serial: <filter> UEFI: American Megatrends v: 1407
date: 04/01/2020
CPU:
Topology: 16-Core (2-Die) model: AMD Ryzen 9 3950X bits: 64
type: MT MCP MCM arch: Zen L2 cache: 8192 KiB
flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
bogomips: 223582
Speed: 2114 MHz min/max: 2200/3500 MHz boost: enabled Core speeds (MHz):
1: 1875 2: 2793 3: 2196 4: 2193 5: 2195 6: 2196 7: 2196 8: 2195 9: 1977
10: 2421 11: 2051 12: 2075 13: 1923 14: 2195 15: 2196 16: 2190 17: 2189
18: 2186 19: 2191 20: 2194 21: 2195 22: 2196 23: 2190 24: 1908 25: 2197
26: 2192 27: 2194 28: 2192 29: 2192 30: 2195 31: 2196 32: 2196
Graphics:
Device-1: NVIDIA GK208B [GeForce GT 710] vendor: Gigabyte driver: nouveau
v: kernel bus ID: 08:00.0 chip ID: 10de:128b
Display: x11 server: X.Org 1.20.8 driver: modesetting unloaded: fbdev,vesa
compositor: marco v: 1.24.0 resolution: 1920x1080~60Hz
OpenGL: renderer: NV106 v: 4.3 Mesa 20.0.4 direct render: Yes
Audio:
Device-1: NVIDIA GK208 HDMI/DP Audio vendor: Gigabyte
driver: snd_hda_intel v: kernel bus ID: 08:00.1 chip ID: 10de:0e0f
Device-2: AMD Starship/Matisse HD Audio vendor: ASUSTeK
driver: snd_hda_intel v: kernel bus ID: 0a:00.4 chip ID: 1022:1487
Sound Server: ALSA v: k5.4.0-73-generic
Network:
Device-1: Intel Wireless-AC 9260 driver: iwlwifi v: kernel bus ID: 03:00.0
chip ID: 8086:2526
IF: wlp3s0 state: up mac: <filter>
Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
vendor: ASUSTeK driver: r8169 v: kernel port: f000 bus ID: 04:00.0
chip ID: 10ec:8168
IF: enp4s0 state: down mac: <filter>
Drives:
Local Storage: total: 1.86 TiB used: 1.38 TiB (74.1%)
ID-1: /dev/sda vendor: Samsung model: SSD 860 PRO 2TB size: 1.86 TiB
speed: 6.0 Gb/s serial: <filter> rev: 2B6Q scheme: GPT
Partition:
ID-1: / size: 1.83 TiB used: 1.38 TiB (75.3%) fs: ext4 dev: /dev/sda2
Sensors:
System Temperatures: cpu: 49.2 C mobo: N/A gpu: nouveau temp: 45 C
Fan Speeds (RPM): N/A gpu: nouveau fan: 2760
Info:
Processes: 465 Uptime: 35m Memory: 62.79 GiB used: 2.06 GiB (3.3%)
Init: systemd v: 245 runlevel: 5 Compilers: gcc: 9.3.0 alt: 9 Shell: bash
v: 5.0.16 running in: mate-terminal inxi: 3.0.38