Skip to main content
Topic: Kernel bug ever since switching to 5.18 (Read 408 times) previous topic - next topic
0 Members and 1 Guest are viewing this topic.

Kernel bug ever since switching to 5.18

Hello,

Ever since switching from 5.17.8 to 5.18, I have been seeing a very intermittent bug that causes multiple programs to freeze.

Code: [Select]
[29979.091868] BUG: kernel NULL pointer dereference, address: 00000000000000d6
[29979.091872] #PF: supervisor read access in kernel mode
[29979.091873] #PF: error_code(0x0000) - not-present page
[29979.091875] PGD 0 P4D 0
[29979.091877] Oops: 0000 [#1] PREEMPT SMP NOPTI
[29979.091878] CPU: 0 PID: 549 Comm: Disk Not tainted 5.18.12-artix1-1 #1 4573a8f9e91c18869fb6bcdf7852bea98d68548b
[29979.091881] Hardware name: Gigabyte Technology Co., Ltd. Z490 AORUS ULTRA-GU/Z490 AORUS ULTRA-GU, BIOS F21 11/23/2021
[29979.091883] RIP: 0010:__filemap_get_folio+0xb1/0x350
[29979.091886] Code: 10 e8 23 15 36 00 48 89 c3 48 3d 02 04 00 00 74 e2 48 3d 06 04 00 00 74 da 48 85 c0 0f 84 97 02 00 00 a8 01 0f 85 c8 00 00 00 <8b> 40 34 85 c0 74 c2 8d 50 01 f0 0f b1 53 34 75 f2 48 8b 54 24 28
[29979.091888] RSP: 0000:ffffa1378712fca8 EFLAGS: 00010246
[29979.091890] RAX: 00000000000000a2 RBX: 00000000000000a2 RCX: 0000000000000002
[29979.091892] RDX: 000000000000002c RSI: ffff91bb0bad0000 RDI: ffffa1378712fcb8
[29979.091893] RBP: 0000000000000000 R08: 00000000001f7fef R09: 0000000000000000
[29979.091894] R10: ffffffffffffffc0 R11: 0000000000000000 R12: 0000000000000000
[29979.091895] R13: ffff91b95b7498f8 R14: 00000000001f7fef R15: ffff91b99d4f7c50
[29979.091897] FS:  00007f6596ffd640(0000) GS:ffff91c09e200000(0000) knlGS:0000000000000000
[29979.091898] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[29979.091900] CR2: 00000000000000d6 CR3: 000000019d608006 CR4: 00000000007706f0
[29979.091901] PKRU: 55555554
[29979.091902] Call Trace:
[29979.091903]  <TASK>
[29979.091904]  ? page_add_file_rmap+0x9f/0x2a0
[29979.091907]  filemap_fault+0x6c/0x910
[29979.091909]  __do_fault+0x33/0x110
[29979.091912]  __handle_mm_fault+0xd79/0x14c0
[29979.091914]  handle_mm_fault+0xb2/0x280
[29979.091915]  do_user_addr_fault+0x1be/0x680
[29979.091918]  ? __x64_sys_rt_sigprocmask+0x9c/0xe0
[29979.091921]  exc_page_fault+0x74/0x170
[29979.091923]  ? asm_exc_page_fault+0xc/0x30
[29979.091925]  asm_exc_page_fault+0x22/0x30
[29979.091926] RIP: 0033:0x7f65f775ce8d
[29979.091928] Code: 00 00 00 00 00 66 66 2e 0f 1f 84 00 00 00 00 00 66 66 2e 0f 1f 84 00 00 00 00 00 66 90 f3 0f 1e fa 48 89 f8 48 83 fa 20 72 23 <c5> fe 6f 06 48 83 fa 40 0f 87 a5 00 00 00 c5 fe 6f 4c 16 e0 c5 fe
[29979.091929] RSP: 002b:00007f6596ffb258 EFLAGS: 00010202
[29979.091930] RAX: 00007f652400eed0 RBX: 00007f6596ffc288 RCX: 00007f6596ffb490
[29979.091931] RDX: 0000000000004000 RSI: 00007f62b63efa50 RDI: 00007f652400eed0
[29979.091932] RBP: 0000000000000000 R08: 0000000000000002 R09: 0000000000000000
[29979.091932] R10: 0000000000000008 R11: 0000000000000246 R12: 0000000000000000
[29979.091933] R13: 00007f6596ffb510 R14: 0000000000000002 R15: 00007f65240017b0
[29979.091934]  </TASK>
[29979.091935] Modules linked in: snd_seq_dummy snd_hrtimer snd_seq fuse nls_iso8859_1 vfat fat dm_multipath sg crypto_user intel_rapl_msr intel_rapl_common snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_bus snd_soc_skl intel_tcc_cooling x86_pkg_temp_thermal snd_soc_hdac_hda intel_powerclamp coretemp snd_hda_ext_core snd_soc_sst_ipc snd_hda_codec_realtek snd_soc_sst_dsp spi_nor kvm_intel snd_hda_codec_generic snd_soc_acpi_intel_match ledtrig_audio iwlmvm mtd snd_soc_acpi kvm iTCO_wdt snd_soc_core intel_pmc_bxt mei_pxp irqbypass snd_compress mei_hdcp rapl iTCO_vendor_support ee1004 ac97_bus gigabyte_wmi intel_cstate wmi_bmof intel_wmi_thunderbolt mxm_wmi mac80211 snd_usb_audio snd_hda_codec_hdmi snd_pcm_dmaengine snd_usbmidi_lib snd_hda_intel amdgpu libarc4 i915 intel_uncore snd_intel_dspcfg snd_rawmidi snd_intel_sdw_acpi snd_seq_device
[29979.091960]  snd_hda_codec mc snd_hda_core pcspkr btusb snd_hwdep iwlwifi spi_intel_pci btrtl snd_pcm gpu_sched spi_intel btbcm drm_buddy iptable_mangle iwlmei snd_timer drm_ttm_helper btintel btmtk iptable_raw i2c_i801 bluetooth cfg80211 ip_tables joydev mousedev snd igc ttm i2c_smbus mei_me ecdh_generic soundcore drm_dp_helper xt_connmark crc16 mei intel_gtt intel_pch_thermal nf_conntrack tpm_crb wmi nf_defrag_ipv6 tpm_tis video nf_defrag_ipv4 tpm_tis_core mac_hid acpi_pad acpi_tad xt_mark ip6table_mangle xt_comment xt_addrtype ip6table_raw ip6_tables x_tables rfkill wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel xfs libcrc32c crc32c_generic usbhid dm_crypt cbc encrypted_keys trusted asn1_encoder tee tpm rng_core dm_mod crct10dif_pclmul crc32_pclmul crc32c_intel nvme ghash_clmulni_intel aesni_intel crypto_simd sr_mod cryptd xhci_pci nvme_core cdrom xhci_pci_renesas
[29979.091991] CR2: 00000000000000d6
[29979.091992] ---[ end trace 0000000000000000 ]---
[29979.091993] RIP: 0010:__filemap_get_folio+0xb1/0x350
[29979.091994] Code: 10 e8 23 15 36 00 48 89 c3 48 3d 02 04 00 00 74 e2 48 3d 06 04 00 00 74 da 48 85 c0 0f 84 97 02 00 00 a8 01 0f 85 c8 00 00 00 <8b> 40 34 85 c0 74 c2 8d 50 01 f0 0f b1 53 34 75 f2 48 8b 54 24 28
[29979.091995] RSP: 0000:ffffa1378712fca8 EFLAGS: 00010246
[29979.091996] RAX: 00000000000000a2 RBX: 00000000000000a2 RCX: 0000000000000002
[29979.091997] RDX: 000000000000002c RSI: ffff91bb0bad0000 RDI: ffffa1378712fcb8
[29979.091998] RBP: 0000000000000000 R08: 00000000001f7fef R09: 0000000000000000
[29979.091998] R10: ffffffffffffffc0 R11: 0000000000000000 R12: 0000000000000000
[29979.091999] R13: ffff91b95b7498f8 R14: 00000000001f7fef R15: ffff91b99d4f7c50
[29979.092000] FS:  00007f6596ffd640(0000) GS:ffff91c09e200000(0000) knlGS:0000000000000000
[29979.092001] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[29979.092002] CR2: 00000000000000d6 CR3: 000000019d608006 CR4: 00000000007706f0
[29979.092002] PKRU: 55555554
Appears to be the culprit; however it happens very intermittently (as seen here, took more than 8 hours), so I have not been able to dissect which update broke it.

My specs are as follows;
Code: [Select]
ik4ms@Aldebaran ~ $ inxi -nbAGdp                                                                                                                                                                                                                                       
System:
  Host: Aldebaran Kernel: 5.17.8-artix1-1 arch: x86_64 bits: 64 Desktop: sway
    v: 1.7 Distro: Artix Linux
Machine:
  Type: Desktop System: Gigabyte product: Z490 AORUS ULTRA-GU v: -CF
  Mobo: Gigabyte model: Z490 AORUS ULTRA-GU
    UEFI: American Megatrends v: F21 date: 11/23/2021
CPU:
  Info: 8-core Intel Core i7-10700K [MT MCP] speed (MHz): avg: 4378
    min/max: 800/5100
Graphics:
  Device-1: Intel CometLake-S GT2 [UHD Graphics 630] driver: i915 v: kernel
  Device-2: AMD Vega 10 XTX [Radeon Frontier Edition] driver: amdgpu
    v: kernel
  Display: wayland server: Xwayland v: 22.1.3 compositor: sway v: 1.7
    driver: gpu: amdgpu resolution: 1: 1920x1080~60Hz 2: 3440x1440~100Hz
  OpenGL: renderer: AMD Radeon Vega Frontier Edition (vega10 LLVM 14.0.6
    DRM 3.44 5.17.8-artix1-1)
    v: 4.6 Mesa 22.2.0-devel (git-a841300384)
Audio:
  Device-1: Intel Comet Lake PCH cAVS driver: snd_hda_intel
  Device-2: AMD Vega 10 HDMI Audio [Radeon 56/64] driver: snd_hda_intel
  Device-3: C-Media Im Fulla Schiit type: USB
    driver: hid-generic,snd-usb-audio,usbhid
  Sound Server-1: ALSA v: k5.17.8-artix1-1 running: yes
  Sound Server-2: PipeWire v: 0.3.56 running: yes
Network:
  Device-1: Intel Comet Lake PCH CNVi WiFi driver: iwlwifi
  IF: wlan0 state: down mac: ##:##:##:##:##:##
  Device-2: Intel Ethernet I225-V driver: igc
  IF: eth0 state: up speed: 1000 Mbps duplex: full mac: ##:##:##:##:##:##
Drives:
  Local Storage: total: 4.57 TiB used: 3.71 TiB (81.1%)
  ID-1: /dev/nvme0n1 vendor: A-Data model: SX8200PNP size: 953.87 GiB
  ID-2: /dev/sda vendor: HGST (Hitachi) model: HDN726040ALE614
    size: 3.64 TiB
  Optical-1: /dev/sr0 vendor: TSSTcorp model: CDDVDW SH-224DB
    dev-links: cdrom
  Features: speed: 48 multisession: yes audio: yes dvd: yes
    rw: cd-r,cd-rw,dvd-r,dvd-ram
Partition:
  ID-1: / size: 327.69 GiB used: 323.69 GiB (98.8%) fs: xfs dev: /dev/dm-0
  ID-2: /boot size: 1020 MiB used: 84.2 MiB (8.3%) fs: vfat
    dev: /dev/nvme0n1p1
  ID-3: /mnt/main size: 1.91 TiB
    used: 1.9 TiB (99.6%) fs: xfs dev: /dev/sda1
  ID-4: /mnt/main2 size: 1.6 TiB
    used: 1.49 TiB (93.2%) fs: ntfs dev: /dev/sda2
Info:
  Processes: 416 Uptime: 1h 9m Memory: 31.2 GiB used: 4.94 GiB (15.8%)
  Shell: mksh inxi: 3.3.15
I have tried multiple UEFI versions including the latest available, and have not found anyone with a similar problem.
I have also removed all third-party kernel modules just in case.

Thank you in advance for a clue as to what's causing it/where I can get help.

 

Re: Kernel bug ever since switching to 5.18

Reply #1
Your hardware is relatively new and kernel 5.18 has produced lots of bad reports so far. You may find it better to downgrade to 5.17 until things stabilise.