PC randomly crashing

Questions about other topics - please check if your question fits better in another category before posting here
Forum rules
Before you post please read how to get help
AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 5:00 pm

I got a PC running linux mint 18.3, and one or two months ago started randomly crashing. Some days it works perfectly, others it crashes with no apparent reason, sometimes few minutes after boot and other times after longer periods of time.
My specs are:
Mobo: ASUS ROG Strix H270F
i7 7700k
2x 8 GB Corsair Vengeance LPX
SSD NVMe Samsung 960 pro 512 GB
Corsair PSU

If I press CTRL ALT F10 while the PC is crashing, I get ext4-fs errors, IO errors and PCIe errors, i got pics of the errors, going to upload them after writing the post.
The PC also crashes if I run windows.
If I reboot, the pc boots normally.
If I reseat the SSD, the frequency of the crashes seems to get better, but it isn't permanent
Can someone help me get it working? I need the PC to do work and it is impossible to work like this.

edit: added images of the errors
Attachments
IMG-20190901-WA0005.jpg
IMG-20190901-WA0004.jpg
IMG-20190901-WA0006.jpg
Last edited by AndreZoio on Sun Sep 01, 2019 5:23 pm, edited 1 time in total.

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 5:22 pm

Do you also have a discrete graphics card in there? Because a power supply issue seems likeliest. Without a discrete graphics card I suppose you'd have trouble loading any PSU to a significant percentage of its specs (but if its an old PSU, it may still be the issue). What's its wattage?

Regardless, be sure that you have newest BIOS version, look at possible related NVMe settings, reseat NVMe drive, test/reseat memory, reseat power connectors, ...

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 5:26 pm

I got no discrete GPU, if i reseat the SSD the problem seems to get better temporarily, I got the latest BIOS, my PSU is 500 watts and it is a high end one. Everything on the pc is less than 2 years old. As a matter of fact, I just reseated the SSD and the pc still didn't crash while I write the reply. IS it possible the problem is the dust in the case? it is somewhat dusty.

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 5:31 pm

Well, possible, but I'd doubt it. Frankly, I don't know how hot a 960 gets. Does it have a heat sink? If not, should it perhaps? Is is physically situated in enough airflow from a case fan?

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 5:37 pm

It just crashed. I used it with a PCIe adapter with a fan, but took it off and used on the mobo slot to test. Crashes the same on both

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 5:56 pm

It's of course possible that the SSD is broken, and I'd carefully look at its connector to see if for example a trace has "bundled up" and is barely making contact (to go with the "after reseat temporarily better" thing) but it would for now then seem best to assume it's not the SSD -- even though it would still be interesting to see what happens with a SATA drive if you have one lying about.

Memory problems are most common by far but your screenshots seem to not point there. It is of course also possible that your CPU has a seating or temperature issue -- and it's still possible that a capacitor in your PSU gave out. Have a different one around to test that?

A memory test never hurts (you can pick memtest86 from the Mint Live USB/DVD's startup menu) but perhaps the most interesting thing would be to reboot into Windows and monitor temperatures with whatever ASUS-specific tool you got for that.

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 7:05 pm

I'm creating a bootable Linux mint 19.1 disk to do the memtest. I'll relocate my notebook's ssd to the desktop later to see if that's the problem. but i suspect it is the SDD, all the errors are related to it. How can it be that certain days it works flawlessly and others I spend the entire day trying to fix it and it doesn't work?

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 7:17 pm

Nondeterminism is most easily explained as PSU issue (I do suppose you have both 4-pin/the full 8-pin CPU power connector(s) connected?) which is however also something you'd not immediately be able to diagnose conclusively with the notebook SSD, assuming it's SATA. But do note that if you have an e.g. CPU seating issue that could mean flakiness of some specific PCIe/NVMe lanes... although come to think of it, the same issue as NVMe and via PCIe adapter seems to contradict that.

You seem to have this system for 2 years. Did this start at some specified time?

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 7:29 pm

Yes, it started like 2 months ago, at first crashes were rare and then increasingly occurred, and now some days it works fine and some days is unusable. I finished my live boot USB and booted it, now I'm letting it sit for some time to see if it crashes.

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 7:33 pm

Right, but not correlated with some specific event then. In any case, while sitting, try inxi -s or sudo inxi -s from a terminal a few times to look at temps.

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 7:50 pm

So, the live USB still didn't crash.
When running inxi -s, temps are 29.8 for cpu and 27.8 for the mobo. Now I'm going to boot from the nvme again to test the temperatures
EDIT: ok, so I logged into the ssd and the temps are the same.

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 8:01 pm

From the Live USB the drive wasn't accessed but given the (as far I have myself seen) scarcity of SSDs actually failing 'm still not giving up on PSU: those I've seen fail many more times and it could easily allow for the degree of randomness you are experiencing. As said, if the replacement is a SATA drive it's not in fact given that if it shows no problems it's NOT in fact PSU but SSD... but, well, that's still the next test I suppose.

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 8:13 pm

l lnstalled the sata old form factor ssd from my notebook and now going to let it sit for a while.
Also, I can access the nvme ssd normally
EDIT: as of now, no crash when booted with the SATA SSD, maybe the problem isn't the PSU?
Edit 2: after booting to the NVME SSD, it quickly crashed

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 8:59 pm

SATA-power and NVMe-power are on different connectors so it can still be. I would say that at the moment it could be

1. NVMe drive. Try accessing it intensively while booted from the SATA drive, i.e., sudo dd if=/dev/sdz of=/dev/null bs=4M for the right device specifier; inxi -D will tell
2. PSU. Actually, still also tested by above...
2. Motherboard, and specifically also power-related, VRM. Seems untestable in isolation.

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 9:26 pm

I give up for today, the pc just won't work without failing in 15 mins and i'm getting sad about it :( i spent a lot of money on a good PC so I could work efficiently and now it is just unreliable.

andre@andre-System-i7 ~ $ inxi -Fxz
System: Host: andre-System-i7 Kernel: 4.15.0-58-generic x86_64 (64 bit gcc: 5.4.0)
Desktop: Cinnamon 3.6.7 (Gtk 3.18.9) Distro: Linux Mint 18.3 Sylvia
Machine: Mobo: ASUSTeK model: STRIX H270F GAMING v: Rev X.0x
Bios: American Megatrends v: 1205 date: 05/11/2018
CPU: Quad core Intel Core i7-7700K (-HT-MCP-) cache: 8192 KB
flags: (lm nx sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx) bmips: 33600
clock speeds: max: 4500 MHz 1: 800 MHz 2: 800 MHz 3: 800 MHz
4: 800 MHz 5: 800 MHz 6: 800 MHz 7: 800 MHz 8: 800 MHz
Graphics: Card: Intel HD Graphics 630 bus-ID: 00:02.0
Display Server: X.Org 1.18.4 drivers: intel (unloaded: fbdev,vesa)
Resolution: 1440x900@74.98hz
GLX Renderer: Mesa DRI Intel HD Graphics 630 (Kaby Lake GT2)
GLX Version: 3.0 Mesa 18.0.5 Direct Rendering: Yes
Audio: Card Intel 200 Series PCH HD Audio
driver: snd_hda_intel bus-ID: 00:1f.3
Sound: Advanced Linux Sound Architecture v: k4.15.0-58-generic
Network: Card: Intel Ethernet Connection (2) I219-V
driver: e1000e v: 3.2.6-k bus-ID: 00:1f.6
IF: enp0s31f6 state: up speed: 100 Mbps duplex: full mac: <filter>
Drives: HDD Total Size: 128.0GB (74.9% used)
ID-1: /dev/nvme0n1 model: N/A size: 512.1GB
ID-2: /dev/sda model: KINGSTON_SV300S3 size: 120.0GB
ID-3: USB /dev/sdb model: STORAGE_DEVICE size: 7.9GB
Partition: ID-1: / size: 97G used: 71G (77%) fs: ext4 dev: /dev/nvme0n1p5
ID-2: swap-1 size: 17.04GB used: 0.00GB (0%) fs: swap dev: /dev/nvme0n1p6
RAID: No RAID devices: /proc/mdstat, md_mod kernel module present
Sensors: System Temperatures: cpu: 29.8C mobo: 27.8C
Fan Speeds (in rpm): cpu: 0
Info: Processes: 291 Uptime: 8 min Memory: 1078.7/15910.9MB
Init: systemd runlevel: 5 Gcc sys: 5.4.0
Client: Shell (bash 4.3.481) inxi: 2.2.35

andre@andre-System-i7 ~ $ inxi -D
Drives: HDD Total Size: 128.0GB (74.9% used)
ID-1: /dev/nvme0n1 model: N/A size: 512.1GB
ID-2: /dev/sda model: KINGSTON_SV300S3 size: 120.0GB
ID-3: USB /dev/sdb model: STORAGE_DEVICE size: 7.9GB

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 9:38 pm

Ah yes, /dev/nvme0n1 would be that right device specifier in the case of NVMe. You haven't answered whether or not accessing it while booted from SATA made the system crash, or at least, spam the same messages to dmesg -w as when it does crash.

Your board is a non-OC board and we need to take into account the fact that it used to be stable, supposedly changed without you poking around in the BIOS; without e.g. overclocking your PCIe bus if that's possikle at all on your model. Still, resetting the BIOS to defaults, seeing if you can e.g. force the NVMe settings to PCIe 4x instead of Auto, or upping a voltage somewhere relevant would be the only suggestions I've left, other than:

1. Trying with a different 960 or trying the 960 in a different system
2. Trying with a different board
3. Trying with a different PSU

User avatar
trytip
Level 11
Level 11
Posts: 3689
Joined: Tue Jul 05, 2016 1:20 pm

Re: PC randomly crashing

Post by trytip » Sun Sep 01, 2019 9:41 pm

do timeshift snapshot to restore it to your reality and try a different path.
https://wiki.ubuntu.com/Kernel/LTSEnablementStack

Code: Select all

sudo apt-get install --install-recommends linux-generic-hwe-18.04 xserver-xorg-hwe-18.04
and the boot into that kernel 5.0.0-25
Image

rene
Level 12
Level 12
Posts: 4148
Joined: Sun Mar 27, 2016 6:58 pm

Re: PC randomly crashing

Post by rene » Sun Sep 01, 2019 9:43 pm

I'd on the other hand NOT go chase down software issues, given that the same thing happens on Windows.

AndreZoio
Level 1
Level 1
Posts: 22
Joined: Sun Sep 01, 2019 4:47 pm

Re: PC randomly crashing

Post by AndreZoio » Sun Sep 01, 2019 9:57 pm

trytip wrote:
Sun Sep 01, 2019 9:41 pm
do timeshift snapshot to restore it to your reality and try a different path.
https://wiki.ubuntu.com/Kernel/LTSEnablementStack

Code: Select all

sudo apt-get install --install-recommends linux-generic-hwe-18.04 xserver-xorg-hwe-18.04
and the boot into that kernel 5.0.0-25
andre@andre-System-i7 ~ $ sudo apt-get install --install-recommends linux-generic-hwe-18.04 xserver-xorg-hwe-18.04
Reading package lists... Done
Building dependency tree
Reading state information... Done
E: Unable to locate package linux-generic-hwe-18.04
E: Couldn't find any package by glob 'linux-generic-hwe-18.04'
E: Couldn't find any package by regex 'linux-generic-hwe-18.04'
E: Unable to locate package xserver-xorg-hwe-18.04
E: Couldn't find any package by glob 'xserver-xorg-hwe-18.04'
E: Couldn't find any package by regex 'xserver-xorg-hwe-18.04'

User avatar
trytip
Level 11
Level 11
Posts: 3689
Joined: Tue Jul 05, 2016 1:20 pm

Re: PC randomly crashing

Post by trytip » Sun Sep 01, 2019 9:59 pm

if you have hardware issues, not much can be done if that can't be fixed

Code: Select all

mint03@mint03:~$ sudo apt-get install --install-recommends linux-generic-hwe-18.04 xserver-xorg-hwe-18.04
[sudo] password for mint03:       
Reading package lists... Done
Building dependency tree       
Reading state information... Done
linux-generic-hwe-18.04 is already the newest version (5.0.0.25.82).
The following additional packages will be installed:
  xserver-xorg-core-hwe-18.04 xserver-xorg-input-all-hwe-18.04
  xserver-xorg-input-libinput-hwe-18.04 xserver-xorg-legacy-hwe-18.04
  xserver-xorg-video-all-hwe-18.04 xserver-xorg-video-amdgpu-hwe-18.04
  xserver-xorg-video-ati-hwe-18.04 xserver-xorg-video-fbdev-hwe-18.04
  xserver-xorg-video-intel-hwe-18.04 xserver-xorg-video-nouveau-hwe-18.04
  xserver-xorg-video-qxl-hwe-18.04 xserver-xorg-video-radeon-hwe-18.04
  xserver-xorg-video-vesa-hwe-18.04 xserver-xorg-video-vmware-hwe-18.04
Suggested packages:
  xfonts-100dpi | xfonts-75dpi firmware-amd-graphics xserver-xorg-video-r128
  xserver-xorg-video-mach64 firmware-misc-nonfree
Recommended packages:
  xserver-xorg-input-wacom-hwe-18.04
The following packages will be REMOVED:
  xserver-xorg xserver-xorg-core xserver-xorg-input-all
  xserver-xorg-input-libinput xserver-xorg-video-amdgpu
The following NEW packages will be installed:
  xserver-xorg-core-hwe-18.04 xserver-xorg-hwe-18.04
  xserver-xorg-input-all-hwe-18.04 xserver-xorg-input-libinput-hwe-18.04
  xserver-xorg-legacy-hwe-18.04 xserver-xorg-video-all-hwe-18.04
  xserver-xorg-video-amdgpu-hwe-18.04 xserver-xorg-video-ati-hwe-18.04
  xserver-xorg-video-fbdev-hwe-18.04 xserver-xorg-video-intel-hwe-18.04
  xserver-xorg-video-nouveau-hwe-18.04 xserver-xorg-video-qxl-hwe-18.04
  xserver-xorg-video-radeon-hwe-18.04 xserver-xorg-video-vesa-hwe-18.04
  xserver-xorg-video-vmware-hwe-18.04
0 upgraded, 15 newly installed, 5 to remove and 2 not upgraded.
Need to get 2,634 kB of archives.
After this operation, 4,681 kB of additional disk space will be used.
Do you want to continue? [Y/n] 
Image

Post Reply

Return to “Other topics”