Shutdowns due to overheating

Hi,
since a few weeks, my L5 shuts down “properly” itself “randomly”.
Sometimes several times a day, sometimes not.
Checking the logs, this is due to :

thermal thermal_zone0: cpu-thermal:critical temperature reached, shutting down
reboot: HARDWARE PROTECTION shutdown  (temperature too high)

Needless to say it’s both worrying and annoying when you’re not notified and you think your phone
has just stopped its display whereas it’s just off not able to ring, and when you have to
start it and enter 4passwords :slight_smile: to have it active again.
Even more worrying (for me at least :slight_smile: ) when I don’t see anyone mentioning that in the forum.
I’ve not noticed it each time it happened so couldn’t check how warm it was,
but some time, 10-15mn after, I found it cold, not even warm.
I didn’t have time to investigate like dumping temperatures and top cpu intensive processes
on a regular basis.
I also got the impression this behavior started to happen after kernel 6.0 upgrade. I didn’t remember
having that even a single time since I have it.
Most of the time my cpu is around 34°C. Seems that critical temps is set to 95°C (/sys/class/thermal/thermal_zone0/trip_point_2_temp) which is really high.
I’ve got no particular software installed. I notices packagekit and gnome-software sometimes wake up
and eat a lot of cpu for a bit of time, but they seem to stop after a while.
I use my L5 more like a phone than a computer, meaning no intensive use, just a few calls and messages, no data connection, no firefox (tried it once on youtube and after 20mn the phone was really
warm and did not shut down btw)
One point is I use a thin leather sleeve… The temperature at home or in my car, is
usually below 20°C and the sleeve may keep it a bit warmer, but there’s no fan in the L5, so, I don’t get
how the sleeve could prevent cooling in November with low indoor temps to the point it reaches 95°C.
It also happens without the sleeve.
So really light use and never really warm.
Any thoughts ? I’ll try dumping some info in syslog to finally investigate.

F.

Which kernel version are you running?

Hi @dos,

 purism@pureos:~$ dpkg -l|grep linux-image
ii  linux-image-5.18.0-1-librem5          5.18.11pureos1                           arm64        Linux 5.18 for the Librem 5 and its devkit
ii  linux-image-6.0.0-1-librem5           6.0.6pureos1                             arm64        Linux 6.0 for the Librem 5 and its devkit
ii  linux-image-librem5                   6.0.6pureos1                             arm64        Linux kernel for the Librem 5 and its devkit
purism@pureos:~$ uname -a
Linux pureos 6.0.0-1-librem5 #1 SMP PREEMPT Mon Nov 7 21:26:38 UTC 2022 aarch64 GNU/Linux

F.

Ok,
I had just wrote a script to try log some details and enabled/started it
as a service and it did it again.
No sleeve on. Kitchen temp was low and my L5 was sitting on the table idling.
No details dumped only this :

nov. 20 08:20:48 pureos kernel: option 1-1.2:1.1: GSM modem (1-port) converter detected
nov. 20 08:20:48 pureos kernel: usb 1-1.2: GSM modem (1-port) converter now attached to ttyUSB1
nov. 20 08:20:48 pureos kernel: option 1-1.2:1.2: GSM modem (1-port) converter detected
nov. 20 08:20:48 pureos kernel: usb 1-1.2: GSM modem (1-port) converter now attached to ttyUSB2
nov. 20 08:20:48 pureos kernel: option 1-1.2:1.3: GSM modem (1-port) converter detected
nov. 20 08:20:48 pureos kernel: usb 1-1.2: GSM modem (1-port) converter now attached to ttyUSB3
nov. 20 08:20:48 pureos kernel: qmi_wwan 1-1.2:1.4: cdc-wdm0: USB WDM device
nov. 20 08:20:48 pureos kernel: qmi_wwan 1-1.2:1.4 wwan0: register 'qmi_wwan' at usb-xhci-hcd.4.auto-1.2, WWAN/QMI device, f6:3>
nov. 20 08:20:48 pureos kernel: usb 1-1.2: authorized to connect
nov. 20 08:22:01 pureos kernel: edt_ft5x06 2-0038: Unable to fetch data, error: -6
nov. 20 08:39:55 pureos kernel: edt_ft5x06 2-0038: Unable to fetch data, error: -6
nov. 20 08:47:18 pureos kernel: dwc3 38100000.usb: request 0000000067a5e8f7 was not queued to ep1in
nov. 20 08:47:18 pureos kernel: tps6598x 0-003f: The interrupt is masked , how did it fire ?? 380100a
nov. 20 08:47:18 pureos kernel: bq25890-charger 3-006a: Upstream supply changed: 0.
nov. 20 08:47:18 pureos kernel: bq25890-charger 3-006a: Disabling OTG_EN pin
nov. 20 08:47:18 pureos kernel: bq25890-charger 3-006a: Upstream supply changed: 0.
nov. 20 08:47:18 pureos kernel: bq25890-charger 3-006a: Disabling OTG_EN pin
nov. 20 08:47:35 pureos kernel: edt_ft5x06 2-0038: Unable to fetch data, error: -6
nov. 20 08:56:09 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
nov. 20 08:56:09 pureos kernel: reboot: HARDWARE PROTECTION shutdown (Temperature too high)
nov. 20 08:56:10 pureos kernel: rfkill: input handler enabled
nov. 20 08:56:16 pureos kernel: zram0: detected capacity change from 3061760 to 0
nov. 20 08:56:16 pureos systemd-shutdown[1]: Syncing filesystems and block devices.
nov. 20 08:56:16 pureos systemd-shutdown[1]: Sending SIGTERM to remaining processes...

My never triggered script that was running :

#!/bin/sh

while true; do
  sleep 10;
  TEMP=$(cat /sys/class/thermal/thermal_zone0/temp)
  if [ $TEMP -gt 60000 ]; then
    echo "---- HIGH TEMP : $TEMP ----" |logger
    ps -eo pcpu,pid,user,args | sort -k 1 -r | head -4 |logger
    echo "----" |logger
    aplay /usr/share/sounds/purple/alert.wav
  fi
done

F.

Details about the start of the boot, such as kernel command line :

-- Journal begins at Sun 2022-03-20 20:55:26 CET, ends at Sun 2022-11-20 09:25:16 CET. --
nov. 20 08:19:44 pureos kernel: Booting Linux on physical CPU 0x0000000000 [0x410fd034]
nov. 20 08:19:44 pureos kernel: Linux version 6.0.0-1-librem5 (builder@cumin-landing-buildd-arm64-sz1d) (aarch64-linux-gnu-gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP PREEMPT Mon Nov 7 21:26:38 UTC 2022
nov. 20 08:19:44 pureos kernel: Machine model: Purism Librem 5r4
nov. 20 08:19:44 pureos kernel: Zone ranges:
nov. 20 08:19:44 pureos kernel:   DMA      [mem 0x0000000040000000-0x00000000ffffffff]
nov. 20 08:19:44 pureos kernel:   DMA32    empty
nov. 20 08:19:44 pureos kernel:   Normal   empty
nov. 20 08:19:44 pureos kernel: Movable zone start for each node
nov. 20 08:19:44 pureos kernel: Early memory node ranges
nov. 20 08:19:44 pureos kernel:   node   0: [mem 0x0000000040000000-0x00000000ffffffff]
nov. 20 08:19:44 pureos kernel: Initmem setup node 0 [mem 0x0000000040000000-0x00000000ffffffff]
nov. 20 08:19:44 pureos kernel: cma: Reserved 320 MiB at 0x00000000e4a00000
nov. 20 08:19:44 pureos kernel: psci: probing for conduit method from DT.
nov. 20 08:19:44 pureos kernel: psci: PSCIv1.1 detected in firmware.
nov. 20 08:19:44 pureos kernel: psci: Using standard PSCI v0.2 function IDs
nov. 20 08:19:44 pureos kernel: psci: MIGRATE_INFO_TYPE not supported.
nov. 20 08:19:44 pureos kernel: psci: SMC Calling Convention v1.1
nov. 20 08:19:44 pureos kernel: percpu: Embedded 28 pages/cpu s74472 r8192 d32024 u114688
nov. 20 08:19:44 pureos kernel: pcpu-alloc: s74472 r8192 d32024 u114688 alloc=28*4096
nov. 20 08:19:44 pureos kernel: pcpu-alloc: [0] 0 [0] 1 [0] 2 [0] 3 
nov. 20 08:19:44 pureos kernel: Detected VIPT I-cache on CPU0
nov. 20 08:19:44 pureos kernel: CPU features: detected: GIC system register CPU interface
nov. 20 08:19:44 pureos kernel: CPU features: detected: ARM erratum 845719
nov. 20 08:19:44 pureos kernel: Built 1 zonelists, mobility grouping on.  Total pages: 774144
nov. 20 08:19:44 pureos kernel: Kernel command line: u_boot_version=2022.10-gc4960dade2 console=ttymxc0,115200 quiet fsck.repair=yes security=apparmor splash plymouth.ignore-serial-consoles vt.global_cursor_default=0
nov. 20 08:19:44 pureos kernel: Unknown kernel command line parameters "splash u_boot_version=2022.10-gc4960dade2", will be passed to user space.
nov. 20 08:19:44 pureos kernel: Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes, linear)
nov. 20 08:19:44 pureos kernel: Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes, linear)
nov. 20 08:19:44 pureos kernel: mem auto-init: stack:off, heap alloc:off, heap free:off
nov. 20 08:19:44 pureos kernel: Memory: 2666140K/3145728K available (12480K kernel code, 2680K rwdata, 4948K rodata, 4416K init, 648K bss, 151908K reserved, 327680K cma-reserved)
...

This continues to happen. And having a look to the details I noticed that different thermal_zones are
triggered. And my script missed that.
So I did some archeology.
I received my L5 on October 4th, 2022.
Got my sleeve October 11th 2022.
I see a total of 125 boot logs :

purism@pureos:~$ journalctl -k --list-boots|tee
-125 a2eb9795754446f8b2ed99d9b62314a3 Sun 2022-03-20 20:55:26 CET—Tue 2022-10-04 20:58:31 CEST
-124 d47cd2b60481480886939ce1a50b5ca9 Tue 2022-10-04 21:03:08 CEST—Tue 2022-10-04 21:43:47 CEST
-123 10f905de96584cb6855331675c8ded70 Tue 2022-10-04 21:50:35 CEST—Tue 2022-10-04 21:55:57 CEST
-122 4740332146c3484b99af50b77275ab33 Wed 2022-10-05 09:35:50 CEST—Wed 2022-10-05 10:10:34 CEST
-121 76e92fbc0d424cfb8d90e6fd0afbadd7 Wed 2022-10-05 10:11:33 CEST—Wed 2022-10-05 10:46:50 CEST
-120 007929a6dade4cb8b96f21c1c6729805 Wed 2022-10-05 10:47:22 CEST—Wed 2022-10-05 17:04:23 CEST
-119 77e13181863345e2b53eaf49ccc434f8 Wed 2022-10-05 17:38:45 CEST—Thu 2022-10-06 04:37:23 CEST
-118 151371f5839d481baedf31f1c968e4be Thu 2022-10-06 09:34:43 CEST—Thu 2022-10-06 21:06:58 CEST
-117 4160fd2fe7014baf8db6cc8c8dfc8927 Fri 2022-10-07 09:27:51 CEST—Fri 2022-10-07 20:49:58 CEST
-116 486c9d642e8a4855b206e260520b1b2a Sat 2022-10-08 08:42:49 CEST—Sat 2022-10-08 09:39:22 CEST
-115 03d41208bead4712b2f238b5b4296b5d Sat 2022-10-08 09:46:12 CEST—Sat 2022-10-08 19:30:14 CEST
-114 e6b03e3b1bf243b68e0f7a040a2c1103 Sun 2022-10-09 10:11:30 CEST—Sun 2022-10-09 20:22:14 CEST
-113 7c07b027e9ee4b3d9b843f144f2e8f21 Sun 2022-10-09 21:15:03 CEST—Sun 2022-10-09 21:23:39 CEST
-112 9e99d357953c42478c56bde30f8ae455 Mon 2022-10-10 08:12:41 CEST—Mon 2022-10-10 14:58:59 CEST
-111 a26564de114e4fa3bbefd37cf226ee13 Mon 2022-10-10 14:59:41 CEST—Mon 2022-10-10 17:14:55 CEST
-110 98159c0888434b428a2b9492995c85e0 Mon 2022-10-10 17:15:30 CEST—Mon 2022-10-10 19:00:00 CEST
-109 c4961396956a4d9bbc99ba656a4eaec7 Mon 2022-10-10 20:21:00 CEST—Mon 2022-10-10 21:31:15 CEST
-108 2ade22b87381493fa7e5ed5d49c71359 Mon 2022-10-10 21:31:59 CEST—Mon 2022-10-10 21:33:17 CEST
-107 62e84081a8ae42f5bcd91bb32cd157df Tue 2022-10-11 07:25:50 CEST—Tue 2022-10-11 09:53:26 CEST
-106 626de364261b482d8af56932451523d4 Tue 2022-10-11 09:54:10 CEST—Tue 2022-10-11 13:47:46 CEST
-105 bf47df6638f84614b43a0690182c71e3 Tue 2022-10-11 15:18:27 CEST—Tue 2022-10-11 19:48:51 CEST
-104 f84c661e823e4687a810e919647bf070 Tue 2022-10-11 21:10:59 CEST—Tue 2022-10-11 21:13:40 CEST
-103 df9b0d2a51a64567a8b4f697b162c521 Wed 2022-10-12 07:40:16 CEST—Wed 2022-10-12 10:01:16 CEST
-102 6fd9cbf6af2b4f0ca689f8fec6dd4e20 Wed 2022-10-12 10:01:49 CEST—Wed 2022-10-12 11:55:08 CEST
-101 e4791dbb26494c55bc593175a241b4da Wed 2022-10-12 11:57:13 CEST—Wed 2022-10-12 12:06:39 CEST
-100 8c6dd0dd20914bca99f11197ce45b557 Wed 2022-10-12 12:08:11 CEST—Wed 2022-10-12 12:20:19 CEST
 -99 8074c506f9924548960f339bf69bc346 Wed 2022-10-12 12:20:52 CEST—Wed 2022-10-12 12:23:05 CEST
 -98 a4d9b4dcb2304a87a3cf00da12a2d9df Wed 2022-10-12 12:24:02 CEST—Wed 2022-10-12 12:34:53 CEST
 -97 e6dd5128a7214173ba42c5d97548fcaa Wed 2022-10-12 12:35:59 CEST—Wed 2022-10-12 12:40:42 CEST
 -96 80be9de7a7314e98b7d65b1e492621cc Wed 2022-10-12 12:41:33 CEST—Wed 2022-10-12 12:43:33 CEST
 -95 7e2eaf82fddb4220805792be1a251331 Wed 2022-10-12 12:45:26 CEST—Wed 2022-10-12 12:51:13 CEST
 -94 84bb6e4023f646629c91e52b65bee565 Wed 2022-10-12 12:53:06 CEST—Wed 2022-10-12 13:08:50 CEST
 -93 61125c5d48434ae5936aa77542485470 Wed 2022-10-12 13:09:21 CEST—Wed 2022-10-12 13:10:17 CEST
 -92 b82937c1cd564c4f8ad5cb90a1c83022 Wed 2022-10-12 14:20:01 CEST—Wed 2022-10-12 14:41:10 CEST
 -91 d4c266e9f517401cadc998aaa106dd34 Wed 2022-10-12 14:41:43 CEST—Wed 2022-10-12 14:54:55 CEST
 -90 7e1584f5ea3b44a9be869ca821c71562 Wed 2022-10-12 14:55:25 CEST—Wed 2022-10-12 14:58:49 CEST
 -89 4a09ecaf5a9a4faba60b1cc81949a09c Wed 2022-10-12 14:59:25 CEST—Wed 2022-10-12 15:03:41 CEST
 -88 56f3bf302f214f7192259b937e1abd4c Wed 2022-10-12 15:04:15 CEST—Wed 2022-10-12 15:05:36 CEST
 -87 c3c86d436e1c43498f1d99d0df138f22 Wed 2022-10-12 15:06:09 CEST—Wed 2022-10-12 15:17:55 CEST
 -86 47dade86ee8c47b882fe8281a25dbbc1 Wed 2022-10-12 15:18:34 CEST—Wed 2022-10-12 15:22:10 CEST
 -85 b50052cfe02f45a0a25f4c1405578fc8 Wed 2022-10-12 15:22:47 CEST—Wed 2022-10-12 15:25:43 CEST
 -84 033d72532dbc4a70adee33581b2779c1 Wed 2022-10-12 15:26:24 CEST—Wed 2022-10-12 15:40:25 CEST
 -83 10c3b17b48e74a6788fefc6e0dda9fed Wed 2022-10-12 15:40:56 CEST—Wed 2022-10-12 20:24:02 CEST
 -82 9699e998f4154355ab6be2078b6e9b6e Thu 2022-10-13 07:41:08 CEST—Thu 2022-10-13 19:59:33 CEST
 -81 4b75587994bc4e6da642e4be9a6dde83 Fri 2022-10-14 07:07:56 CEST—Fri 2022-10-14 07:20:06 CEST
 -80 f915bbaaffe6484bae70e0638f889b3b Fri 2022-10-14 07:30:35 CEST—Fri 2022-10-14 11:26:12 CEST
 -79 eb6188e1d39f4b14bf6e7538ed31fd85 Fri 2022-10-14 11:26:43 CEST—Fri 2022-10-14 11:26:56 CEST
 -78 fa170a2ae0fc401a86fa00b079c31bbc Fri 2022-10-14 11:27:33 CEST—Fri 2022-10-14 20:03:42 CEST
 -77 085594e99d7847ab987b3e3b76129132 Sat 2022-10-15 09:18:17 CEST—Sat 2022-10-15 20:14:46 CEST
 -76 0898882f75424b1fa671c7dab550784b Sun 2022-10-16 16:31:49 CEST—Sun 2022-10-16 19:02:34 CEST
 -75 c2b6267b1c5d47d28bcced34148c28d6 Mon 2022-10-17 09:43:18 CEST—Mon 2022-10-17 18:20:36 CEST
 -74 b72ac07a625d401882402cc6c8869ddd Tue 2022-10-18 07:14:48 CEST—Tue 2022-10-18 10:35:00 CEST
 -73 3fbc136492ae498eb90510847cafc318 Tue 2022-10-18 10:35:36 CEST—Tue 2022-10-18 11:40:04 CEST
 -72 4510ea8f3b264d1d8aff70988ba78423 Tue 2022-10-18 11:40:33 CEST—Tue 2022-10-18 12:53:28 CEST
 -71 61e40c76f0e6482183be9f2a162f65ee Tue 2022-10-18 12:55:36 CEST—Tue 2022-10-18 19:39:20 CEST
 -70 b118b7c1bf51466bafbd15616b4faba2 Tue 2022-10-18 19:40:18 CEST—Tue 2022-10-18 20:47:45 CEST
 -69 a6a088ceb88f43aab82d0961b7d6c2e1 Wed 2022-10-19 06:22:23 CEST—Thu 2022-10-20 05:54:33 CEST
 -68 d73c4a1553874927acdf2f33031d2fbf Thu 2022-10-20 07:40:01 CEST—Thu 2022-10-20 07:40:36 CEST
 -67 1d6676ad4e294280ad665a0d349f748a Thu 2022-10-20 07:41:30 CEST—Thu 2022-10-20 18:30:35 CEST
 -66 b313b29630c54010a67cde5c58bcfc5c Thu 2022-10-20 20:05:39 CEST—Thu 2022-10-20 20:06:16 CEST
 -65 c28b1afddccd4568ac4e019ea46e6d4c Fri 2022-10-21 07:51:08 CEST—Fri 2022-10-21 17:13:22 CEST
 -64 3804dd1b89764e6783a17baf8703d91a Fri 2022-10-21 17:13:55 CEST—Fri 2022-10-21 20:21:57 CEST
 -63 1e57815497ed4cba9dac18db3b559410 Sat 2022-10-22 08:52:43 CEST—Sat 2022-10-22 18:29:06 CEST
 -62 8e67fdb1ccb944bead3b28f3f450dbc9 Sun 2022-10-23 09:11:41 CEST—Sun 2022-10-23 19:42:54 CEST
 -61 a73745b89f1f477a8c418857bc408588 Mon 2022-10-24 08:07:45 CEST—Mon 2022-10-24 11:45:08 CEST
 -60 c0f0ea47d30f468abe705740630ffaa0 Mon 2022-10-24 11:45:42 CEST—Mon 2022-10-24 20:09:38 CEST
 -59 54fe268c9a544e6eb416ba37ea049eb0 Tue 2022-10-25 08:42:50 CEST—Tue 2022-10-25 20:47:09 CEST
 -58 21a8dc0d00d14aad86ff7dc0c64af4cc Wed 2022-10-26 09:09:34 CEST—Wed 2022-10-26 20:14:07 CEST
 -57 9d45c66d7c0143308f9e6f829b5d9f38 Thu 2022-10-27 08:41:21 CEST—Thu 2022-10-27 09:15:54 CEST
 -56 87224a993a5e4e98a94f629612ae2014 Thu 2022-10-27 09:17:16 CEST—Thu 2022-10-27 19:49:22 CEST
 -55 736e68fb25824da296879ee25d249586 Thu 2022-10-27 19:52:06 CEST—Thu 2022-10-27 20:20:55 CEST
 -54 3f82f432dcce480cb5abc47baa9f7f26 Fri 2022-10-28 09:07:18 CEST—Fri 2022-10-28 20:12:45 CEST
 -53 c3c43a8c99944a8494ca92585838aa4b Sat 2022-10-29 09:49:33 CEST—Sat 2022-10-29 11:49:34 CEST
 -52 4282ab0f12104ee88ef66ec5c5ec1928 Sat 2022-10-29 16:46:48 CEST—Sat 2022-10-29 18:53:37 CEST
 -51 67e5346fb75c4be19b728b73e4799c02 Sun 2022-10-30 08:30:57 CET—Sun 2022-10-30 20:54:37 CET
 -50 d0845c59e2e94c7ab786621139739fca Mon 2022-10-31 08:44:48 CET—Mon 2022-10-31 14:21:53 CET
 -49 a3502c3f2bc14016a4098f401479dbd0 Mon 2022-10-31 14:22:30 CET—Mon 2022-10-31 19:38:29 CET
 -48 82447d679cd14ecc906a01060ed38e9f Tue 2022-11-01 07:47:02 CET—Tue 2022-11-01 19:50:05 CET
 -47 6cae341f5dee4c72b23a6a6f79c9b84d Wed 2022-11-02 07:36:55 CET—Wed 2022-11-02 20:12:53 CET
 -46 3f655bc268374f53aef8301d2c084622 Thu 2022-11-03 09:33:25 CET—Thu 2022-11-03 20:33:01 CET
 -45 22b9e253909e4f66933223aef9942c14 Fri 2022-11-04 09:26:11 CET—Fri 2022-11-04 19:57:37 CET
 -44 e6c15e3b45144fe6ab3aa1955756a05e Sat 2022-11-05 08:57:32 CET—Sat 2022-11-05 13:02:54 CET
 -43 7371063622a24e2893b8eb9f8dc4527a Sat 2022-11-05 13:47:46 CET—Sat 2022-11-05 15:50:22 CET
 -42 ec0af3078d7148b8a64696aedadd4bf9 Sat 2022-11-05 16:48:27 CET—Sat 2022-11-05 19:03:51 CET
 -41 8ebb8566936d42699bbfb00d75fd7744 Mon 2022-11-07 08:00:52 CET—Mon 2022-11-07 09:12:08 CET
 -40 5c76d909037d424c96d682b991f6a207 Mon 2022-11-07 09:41:02 CET—Mon 2022-11-07 19:56:31 CET
 -39 b2477cd615054ef7af76814e2ec94e0c Mon 2022-11-07 20:36:58 CET—Mon 2022-11-07 21:31:08 CET
 -38 61495e35997e461d9b11e0e54af8ad14 Mon 2022-11-07 21:58:20 CET—Mon 2022-11-07 22:04:29 CET
 -37 e0c28557411f4e46808b64a69a3e7da2 Tue 2022-11-08 09:23:59 CET—Tue 2022-11-08 18:51:46 CET
 -36 83b5e89d4a754960bc0a93eb1d90a866 Tue 2022-11-08 21:07:31 CET—Tue 2022-11-08 22:40:49 CET
 -35 78f4e68eae6549e1a156cf63766148cd Wed 2022-11-09 10:05:58 CET—Wed 2022-11-09 11:10:11 CET
 -34 c2861789841941da8e0ab14e932d38ff Wed 2022-11-09 11:10:45 CET—Wed 2022-11-09 15:22:42 CET
 -33 faf191bd84bc45dc839bb4751a199893 Wed 2022-11-09 15:23:11 CET—Thu 2022-11-10 04:01:42 CET
 -32 8820fc4e096e428e886d0cae94c5c563 Thu 2022-11-10 09:04:36 CET—Thu 2022-11-10 09:05:05 CET
 -31 e23349dd69574bbabc553df91c6eb70b Thu 2022-11-10 09:06:00 CET—Thu 2022-11-10 09:06:35 CET
 -30 07f2dc57b8ba443e9e4e78c2eb7ce389 Thu 2022-11-10 09:26:49 CET—Thu 2022-11-10 11:33:35 CET
 -29 6f823029bc514909a36438fc401f8172 Thu 2022-11-10 11:34:14 CET—Thu 2022-11-10 20:45:56 CET
 -28 5fa2242dfec54da18792ccb63610a0a5 Fri 2022-11-11 09:25:26 CET—Fri 2022-11-11 12:05:29 CET
 -27 829ff2bfe5a14a439ee4df64b3916c0a Fri 2022-11-11 12:34:40 CET—Fri 2022-11-11 19:32:57 CET
 -26 6d84c728a74e42ec9131245654df52df Fri 2022-11-11 20:01:38 CET—Sat 2022-11-12 03:28:56 CET
 -25 7fe2539128d045a88dd6683986e84c54 Sat 2022-11-12 07:28:43 CET—Sat 2022-11-12 07:31:06 CET
 -24 6fcbcf2485d94ebda67e38554e5255b4 Sat 2022-11-12 07:32:05 CET—Sat 2022-11-12 19:23:13 CET
 -23 af29cff736d242b8ae816b149403064b Sun 2022-11-13 09:06:31 CET—Sun 2022-11-13 20:55:51 CET
 -22 ff0ce9d9d8da4f63aa2ee708b8c24505 Mon 2022-11-14 09:39:34 CET—Mon 2022-11-14 09:54:03 CET
 -21 f9c2356663c94eb5bef855af71cc9925 Mon 2022-11-14 09:55:04 CET—Mon 2022-11-14 20:24:54 CET
 -20 f9f4e99593f54338a8ed4c62da700fae Tue 2022-11-15 08:58:56 CET—Tue 2022-11-15 09:18:00 CET
 -19 eba8462e582e47f3938cc627c55f8817 Tue 2022-11-15 09:30:27 CET—Tue 2022-11-15 17:20:23 CET
 -18 db389e5be2624ecc8a74cc3bf4183bd4 Tue 2022-11-15 17:21:14 CET—Tue 2022-11-15 20:20:22 CET
 -17 7a6b08d59ec0499e8542b034e99c9e00 Wed 2022-11-16 09:26:47 CET—Wed 2022-11-16 20:12:12 CET
 -16 7cab87c9ee574c339fd03f1d6acca693 Thu 2022-11-17 08:21:21 CET—Thu 2022-11-17 20:05:33 CET
 -15 8c4f629b59a743f59735b01ab32555a9 Thu 2022-11-17 20:19:20 CET—Thu 2022-11-17 20:23:20 CET
 -14 7667fcda7fb04d289acb8b6e6cf31a89 Fri 2022-11-18 07:38:48 CET—Fri 2022-11-18 22:52:22 CET
 -13 0cd94b74e8fa4d0a9643f2cb15cf341b Sat 2022-11-19 08:12:13 CET—Sat 2022-11-19 08:12:42 CET
 -12 e006c06fd5274688bbfe860d2bbabe07 Sat 2022-11-19 08:13:33 CET—Sat 2022-11-19 16:35:25 CET
 -11 2f57d15ca7934da6acfcc603eec54ae5 Sat 2022-11-19 18:04:33 CET—Sat 2022-11-19 18:14:22 CET
 -10 2c635700905a4984ba0a1f50953ac8c2 Sat 2022-11-19 18:38:56 CET—Sat 2022-11-19 18:43:14 CET
  -9 f622d382c4f7432eaa9f30c094fa2f49 Sat 2022-11-19 18:43:42 CET—Sat 2022-11-19 18:45:53 CET
  -8 8cf081c2d7a0409aac753d19196a21b3 Sat 2022-11-19 18:46:35 CET—Sat 2022-11-19 20:19:33 CET
  -7 c1b716ede69a4ec4a029ad09ad444a8c Sat 2022-11-19 20:28:02 CET—Sat 2022-11-19 21:28:49 CET
  -6 f28b6fef7744457ca12a262fdadf0de5 Sun 2022-11-20 08:19:44 CET—Sun 2022-11-20 08:56:16 CET
  -5 4b40924360dd426b80eab0aebcb0cbdb Sun 2022-11-20 09:07:59 CET—Sun 2022-11-20 12:41:01 CET
  -4 6e5499c376344515ab5e271ff6ca2ddb Sun 2022-11-20 14:39:56 CET—Sun 2022-11-20 20:20:09 CET
  -3 02b6926f1917407cb8ac1360b54430be Mon 2022-11-21 09:26:03 CET—Mon 2022-11-21 09:55:25 CET
  -2 6ffb6c6317e647678d27a8224b0da3a4 Mon 2022-11-21 09:57:37 CET—Mon 2022-11-21 21:48:43 CET
  -1 56c6929640414825861a630e3123e381 Tue 2022-11-22 09:06:51 CET—Tue 2022-11-22 09:31:46 CET
   0 7b09169ba28a4b68b9079a72670f472a Tue 2022-11-22 09:35:12 CET—Tue 2022-11-22 09:49:53 CET

And from this, here is the shutdowns due to high temp :

purism@pureos:~$ for I in $(seq 1 125); do journalctl -k -b -$I|grep 'critical temperature'; done
Nov 22 09:31:39 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Nov 20 12:40:54 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Nov 20 08:56:09 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Nov 19 18:14:15 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 19 16:35:18 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:20 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:21 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:21 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:21 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:21 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:22 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:22 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 17:20:22 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 15 09:17:53 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Nov 11 19:32:50 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Nov 11 12:05:21 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Nov 07 19:56:24 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Nov 07 09:12:01 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Nov 05 19:03:44 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Nov 05 15:50:16 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Nov 05 13:02:47 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Oct 29 11:49:27 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Oct 27 19:49:15 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Oct 27 09:15:22 pureos kernel: thermal thermal_zone1: gpu-thermal: critical temperature reached, shutting down
Oct 20 18:30:28 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Oct 14 07:19:59 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
Oct 11 13:47:39 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down
Oct 10 18:59:24 pureos kernel: thermal thermal_zone2: vpu-thermal: critical temperature reached, shutting down

Kernel 6.0.0-1-librem5 was installed October 10th 2022, and the sleeve on the 11th.
The 10th, first shutdown occured, at 18:59:24 .
Note : 6.0.0-1-librem5 was booted first the 10th at 17:15:30.
Between the 4th and the 10th, no shutdown.

can you roll back to the previous kernel?

Hi @Captain_Morgan,
I guess it’s the only way to verify things. I was hesitant because I didn’t want to loose any potential
improvement in kernel 6, but my L5 shutting down unexpectedly is just a pain in the neck.
So thanks to your message, I’m back on 5.18.0-1-librem5 now.
Let’s see how it behaves.
In the meantime, I saw there is a thermal.nocrt kernel option that I understand would turn the
shutting down behavior off. But that will come in a second time maybe (disabling that safety needs to be
thought twice… :slight_smile: )

I’m just wondering if my device is faulty in some way or if it’s the kernel . But in that latter case, I don’t
understand why I seem to be the only one experiencing this issue.

F.

Really you just need to stay on 5.x for a while and observe the behaviour and then form a tentative conclusion.

Personally I would never do that unless I am ok with the cpu melting on itself.

2 Likes

Right, let’s see. So far so good.

1 Like

At the time CPUs didn’t have much protection, the machine started behave badly and just crash/stop.
I think cpu based machines still behave the same, let me know if I’m wrong. It’s not a nuclear power plant I guess.
That being said, repeated overheating is never good for the hardware for sure.
Also, I was more worried last time I’ve been watching videos on youtube on firefox with the phone being
really hot compared to the few times my L5 shut down because of those overheating protection
shutdowns.
I would just replace those kernel triggers with some software/script that would do the job in place in
order to understand and have more control.

F.

Hey there,
as of today, almost a week later, not a single shutdown due to kernel overheating triggers with kernel 5.18.0-1-librem5 …
But I now have other issues : like microphone set to headset systematically after boot. So I have to change to handset every morning after turning L5 on.
Sometimes after I have switched to handset
following the boot and after receiving a first call, it would still be set to
handset, but if I get called again, people can not hear me. And I have
to reboot to again set mic to handset.
Not sure if it is related to some recent changes to call audio.
Also, systematically if I’m in a call and get a second call during that,
whether I accept it or not, the phone continues to ring/vibrate forever.
The 2 calls are “lost” and I have to reboot to stop the phone to ring/vibrate.
So now, I think I will go back to kernel 6 and see if shutdowns are still
triggered (and if I have the above mic/call behaviors which I don’t remember having had with 6.0).

F.

3 Likes

The continuous ringing also sometimes happens after picking up a call for me and while talking to that person the incoming call is from that same person. There are still a few items that need to be sorted out even with the new kernel.

Shutdowns are back since I re-installed kernel 6.0 :slight_smile: … at least first one this morning.
Later today I wrote a .service that would be triggered at shutdown time to dump temperatures.
This evening, I found my L5 off again… happy to see that. For once.
I looked at the logs :

déc. 01 18:48:10 pureos kernel: thermal thermal_zone0: cpu-thermal: critical temperature reached, shutting down
déc. 01 18:48:10 pureos kernel: reboot: HARDWARE PROTECTION shutdown (Temperature too high)
déc. 01 18:48:10 pureos systemd[1]: Stopping Session 1 of user purism.
déc. 01 18:48:11 pureos systemd[1]: Removed slice system-getty.slice.
déc. 01 18:48:11 pureos pulseaudio[848]: X connection to :0 broken (explicit kill or server shutdown).
déc. 01 18:48:11 pureos systemd[1]: Removed slice system-modprobe.slice.
déc. 01 18:48:11 pureos systemd[1]: Stopped target Block Device Preparation for /dev/mapper/crypt_root.
déc. 01 18:48:11 pureos systemd[1]: Stopped target Graphical Interface.
déc. 01 18:48:11 pureos systemd[1]: Stopped target Multi-User System.
déc. 01 18:48:11 pureos systemd[1]: Stopped target Login Prompts.
...
déc. 01 18:48:11 pureos systemd[1]: Stopped Network Manager Wait Online.
déc. 01 18:48:11 pureos ModemManager[756]: <info>  ModemManager is shut down
déc. 01 18:48:11 pureos callaudiod[1040]: Error in PulseAudio context: Connection terminated
déc. 01 18:48:12 pureos root[9081]: Temp thermal_zone0 : 23000 
déc. 01 18:48:11 pureos systemd[1]: Stopping Accounts Service...
déc. 01 18:48:13 pureos root[9090]: Temp thermal_zone1 : 23000 
déc. 01 18:48:11 pureos systemd[816]: pulseaudio.service: Main process exited, code=exited, status=1/FAILURE
déc. 01 18:48:13 pureos sshd[762]: Received signal 15; terminating.
déc. 01 18:48:13 pureos blkdeactivate[9073]:   [SKIP]: unmount of crypt_root (dm-0) mounted on /
déc. 01 18:48:11 pureos systemd[816]: pulseaudio.service: Failed with result 'exit-code'.
déc. 01 18:48:13 pureos root[9096]: Temp thermal_zone2 : 23000 
déc. 01 18:48:11 pureos systemd[816]: pulseaudio.service: Consumed 38.204s CPU time.
déc. 01 18:48:11 pureos systemd[1]: Stopping Save/Restore Sound Card State...
...

So 3s after the kernel overheat protection got triggered, my temperature hook showed (Temp thermal_zone...) : 23°C for thermal_zone[0,1,2] .
I don’t think my L5 could go from 95°C down to 23°C in 3s.

The sensors reading in /sys/… show 23°C which seems accurate according to the room temperature below that.
What’s going on, on my Librem5 ??
A kernel bug in 6.0 happening particularly well on my specific device ?

Could this be something similar to this .
Do we have something similar at 95°C on the hardware side on the Librem 5 ? :

Until then you can add thermal.crt=-1 to your kernel commandline to avoid the critical shutdown happening. There as not any danger this will damage your SOC as it has an internal safety shutoff at 105C

Thank you people for reading :slight_smile:
F.

Yep, pretty crazy behaviors :slight_smile: . Sometimes people laugh quite a bit around me :slight_smile: