Occasional kernel panic and misboot upon linux mint

Questions about Grub, UEFI,the liveCD and the installer
Forum rules
Before you post read how to get help. Topics in this forum are automatically closed 6 months after creation.
Post Reply
pc_magas
Level 1
Level 1
Posts: 3
Joined: Mon Oct 23, 2023 3:16 pm

Occasional kernel panic and misboot upon linux mint

Post by pc_magas »

On an HP 17-g015dx I have instaled linux mint 21 and sometimes I get the following kernel panic:
https://drive.google.com/file/d/1apBJF1 ... sp=sharing

In the proces of mitigating the issue I tried:
  1. Upgrading the bios
  2. Installing reFind
But upon refind I occasionally I get the following error:
https://drive.google.com/file/d/1vgOTNP ... sp=sharing

At both cases I am able to boot by retrying after 2-3 times for the first image I need to powert cycle my laptop whereas at the latter I just need to press enter and retry.

The issue does not happen all the time but it can happen unexpectedly.

As a means of diagnosing the issue I checked via smartctl the dfisk's health:

Code: Select all

--------------- RUN 1 -------------
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       133
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       120
165 Block_Erase_Count       0x0032   100   100   000    Old_age   Always       -       80
166 Minimum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       1
167 Max_Bad_Blocks_per_Die  0x0032   100   100   ---    Old_age   Always       -       65
168 Maximum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       3
169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       351
170 Grown_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Average_PE_Cycles_TLC   0x0032   100   100   000    Old_age   Always       -       1
174 Unexpected_Power_Loss   0x0032   100   100   000    Old_age   Always       -       38
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   070   044   000    Old_age   Always       -       30 (Min/Max 16/44)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       0
230 Media_Wearout_Indicator 0x0032   100   100   000    Old_age   Always       -       0x001600140016
232 Available_Reservd_Space 0x0033   100   100   005    Pre-fail  Always       -       100
233 NAND_GB_Written_TLC     0x0032   100   100   ---    Old_age   Always       -       186
234 NAND_GB_Written_SLC     0x0032   100   100   000    Old_age   Always       -       638
241 Host_Writes_GiB         0x0030   100   100   000    Old_age   Offline      -       256
242 Host_Reads_GiB          0x0030   100   100   000    Old_age   Offline      -       218
244 Temp_Throttle_Status    0x0032   000   100   ---    Old_age   Always       -       0
 
------------- RUN 2 ----------------
 
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       133
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       120
165 Block_Erase_Count       0x0032   100   100   000    Old_age   Always       -       80
166 Minimum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       1
167 Max_Bad_Blocks_per_Die  0x0032   100   100   ---    Old_age   Always       -       65
168 Maximum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       3
169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       351
170 Grown_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Average_PE_Cycles_TLC   0x0032   100   100   000    Old_age   Always       -       1
174 Unexpected_Power_Loss   0x0032   100   100   000    Old_age   Always       -       38
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   072   044   000    Old_age   Always       -       28 (Min/Max 16/44)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       0
230 Media_Wearout_Indicator 0x0032   100   100   000    Old_age   Always       -       0x001600140016
232 Available_Reservd_Space 0x0033   100   100   005    Pre-fail  Always       -       100
233 NAND_GB_Written_TLC     0x0032   100   100   ---    Old_age   Always       -       187
234 NAND_GB_Written_SLC     0x0032   100   100   000    Old_age   Always       -       638
241 Host_Writes_GiB         0x0030   100   100   000    Old_age   Offline      -       256
242 Host_Reads_GiB          0x0030   100   100   000    Old_age   Offline      -       218
244 Temp_Throttle_Status    0x0032   000   100   ---    Old_age   Always       -       0
 
----------------- RUN 3 -----------------------
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       133
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       120
165 Block_Erase_Count       0x0032   100   100   000    Old_age   Always       -       80
166 Minimum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       1
167 Max_Bad_Blocks_per_Die  0x0032   100   100   ---    Old_age   Always       -       65
168 Maximum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       3
169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       351
170 Grown_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Average_PE_Cycles_TLC   0x0032   100   100   000    Old_age   Always       -       1
174 Unexpected_Power_Loss   0x0032   100   100   000    Old_age   Always       -       38
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   073   044   000    Old_age   Always       -       27 (Min/Max 16/44)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       0
230 Media_Wearout_Indicator 0x0032   100   100   000    Old_age   Always       -       0x001600140016
232 Available_Reservd_Space 0x0033   100   100   005    Pre-fail  Always       -       100
233 NAND_GB_Written_TLC     0x0032   100   100   ---    Old_age   Always       -       187
234 NAND_GB_Written_SLC     0x0032   100   100   000    Old_age   Always       -       638
241 Host_Writes_GiB         0x0030   100   100   000    Old_age   Offline      -       256
242 Host_Reads_GiB          0x0030   100   100   000    Old_age   Offline      -       218
244 Temp_Throttle_Status    0x0032   000   100   ---    Old_age   Always
And I am listing the `blkid` and fdisk 0l commands as well:

Code: Select all


$ sudo blkid
/dev/sda2: UUID="a71de2c3-198a-4c19-bc73-f0352b773e25" BLOCK_SIZE="4096" TYPE="ext4" PARTUUID="f35eb39a-aa22-4670-946b-4e36b8b61b36"
/dev/sda1: UUID="6ABC-C524" BLOCK_SIZE="512" TYPE="vfat" PARTLABEL="EFI System Partition" PARTUUID="eabc212c-0799-42db-a48f-42df0b2a7b15"

Code: Select all

$ sudo fdisk -l
Disk /dev/sda: 223,57 GiB, 240057409536 bytes, 468862128 sectors
Disk model: WDC WDS240G2G0A-
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disklabel type: gpt
Disk identifier: 4E12FEA4-BD47-449E-B21A-D5612754A7D0
 
Device       Start       End   Sectors   Size Type
/dev/sda1     2048   1050623   1048576   512M EFI System
/dev/sda2  1050624 468860927 467810304 223,1G Linux filesystem
Both UUIDs seem to match with the ones at `/etc/refind_linux.conf`

Code: Select all

# cat refind_linux.conf 
"Boot with standard options"  "root=UUID=a71de2c3-198a-4c19-bc73-f0352b773e25 ro quiet splash"
"Boot to single-user mode"    "root=UUID=a71de2c3-198a-4c19-bc73-f0352b773e25 ro quiet splash single"
"Boot with minimal options"   "ro root=UUID=a71de2c3-198a-4c19-bc73-f0352b773e25"

Also, after the kernel panic if I boot into a recovert/single-user mode first and then boot into normal mode I am able to boot as well.

And I have no other idea what to run or how I can diagnose this boot problem?
User avatar
AndyMH
Level 21
Level 21
Posts: 13639
Joined: Fri Mar 04, 2016 5:23 pm
Location: Wiltshire

Re: Occasional kernel panic and misboot upon linux mint

Post by AndyMH »

In spite of smartctl saying the drive is good, I would boot from an install stick and run fsck on your / partition.

If the problem persists I would then try reverting to an earlier kernel.
Thinkcentre M720Q - LM21.3 cinnamon, 4 x T430 - LM21.3 cinnamon, Homebrew desktop i5-8400+GTX1080 Cinnamon 19.0
pc_magas
Level 1
Level 1
Posts: 3
Joined: Mon Oct 23, 2023 3:16 pm

Re: Occasional kernel panic and misboot upon linux mint

Post by pc_magas »

I tried both smartctl and kerneldowngrade into 5.15.0-25-generic and error still persists.

Could this problem caused due to bad sata controller upon motherboard or due to hard disk failure?

I re-looked the output fo smartctl and I noticed this:

Code: Select all

169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       351
Could it be this my issue? Actually the SSD is dying????
pc_magas
Level 1
Level 1
Posts: 3
Joined: Mon Oct 23, 2023 3:16 pm

Re: Occasional kernel panic and misboot upon linux mint

Post by pc_magas »

In the end it had the UEFI some issues and seems to boot from 001 as it seems in:

Code: Select all

$ sudo efibootmgr -v
BootCurrent: 0001
Timeout: 5 seconds
BootOrder: 0001,2001,2002,2004
Boot0000* Notebook Hard Drive - WDC WDS240G2G0A-00JH30	BBS(HD,Notebook Hard Drive - WDC WDS240G2G0A-00JH30,0x500)................-.L.......L.A.L....................................4........A.........................
Boot0001* ubuntu	HD(1,GPT,f882aa7b-ec31-4d4c-9703-6f6963e7d6a0,0x800,0x100000)/File(\EFI\ubuntu\shimx64.efi)
Boot2001* EFI USB Device	RC
Boot2002* EFI DVD/CDROM	RC
Boot3001* Internal Hard Disk or Solid State Disk	RC
Usually is BIOS set to legacy they boot from 0000 instead of 0001 as it shows upon my desktop:

Code: Select all

BootCurrent: 0000
Timeout: 1 seconds
BootOrder: 0000,0003,0005,0001,0002,0004
Boot0000* ubuntu	HD(1,GPT,47452b13-9404-4092-b7b4-a87086d807cf,0x800,0xbe000)/File(\EFI\UBUNTU\SHIMX64.EFI)
Boot0001* UEFI:CD/DVD Drive	BBS(129,,0x0)
Boot0002* UEFI:Removable Device	BBS(130,,0x0)
Boot0003* Hard Drive	BBS(HD,,0x0)..GO..NO........o.T.E.A.M.T.2.5.3.5.T.1.2.0.G....................A...........................>..Gd-.;.A..MQ..L.0.R.E.O.N.3.M.Q.P.3.0.7.8.T.1.U.6.L.E.R........BO..NO........o.T.O.S.H.I.B.A. .H.D.W.D.1.1.0....................A...........................>..Gd-.;.A..MQ..L. . . . . . . . . . .Y. .1.9.3.D.3.Y.S.F........BO
Boot0004* UEFI:Network Device	BBS(131,,0x0)
Boot0005* USB	BBS(HD,,0x0)..GO..NO........[.G.e.n.e.r.i.c. .S.T.O.R.A.G.E. .D.E.V.I.C.E. .0.5.5.1....................A..........................Gd-.;.A..MQ..L.0.0.0.0.0.0.0.0.4.9.9.3........BO..NO........`.G.e.n.e.r.i.c. .S.T.O.R.A.G.E. .D.E.V.I.C.E. .0.5.5.1....................A...............................Gd-.;.A..MQ..L.0.0.0.0.0.0.0.0.4.9.9.3........BO..NO........`.G.e.n.e.r.i.c. .S.T.O.R.A.G.E. .D.E.V.I.C.E. .0.5.5.1....................A...............................Gd-.;.A..MQ..L.0.0.0.0.0.0.0.0.4.9.9.3........BO..NO........`.G.e.n.e.r.i.c. .S.T.O.R.A.G.E. .D.E.V.I.C.E. .0.5.5.1....................A...............................Gd-.;.A..MQ..L.0.0.0.0.0.0.0.0.4.9.9.3........BO..NO........`.G.e.n.e.r.i.c. .S.T.O.R.A.G.E. .D.E.V.I.C.E. .0.5.5.1....................A...............................Gd-.;.A..MQ..L.0.0.0.0.0.0.0.0.4.9.9.3........BO
Therefore, I opted to install the Linux Mint as MBR. In order to do that upon installer, I chose to manually setup the partitons.
After that and I setup the following partitions:
  • MBR 1MB
  • /boot 1GB as ext4
  • / rest of the disk as ext4
Upon partition configuration In order to configure the MBR partition like this, I used this settings as picture shows bellow:
Mint [Σε λειτουργία] - Oracle VM VirtualBox_003.png

Afterward, I got the following error:
Image

But it is an expected because we are not installing as EFI therefore I pressed Continue.


For those saby in Greek Language Also a tutorial do exists in: https://linux-user.gr/t/mysthria-symper ... ta/5133/19

-------------------------------

I want to ask is there a linux mint blog where I can post a full how-to without attatchment limitations?
Post Reply

Return to “Installation & Boot”