Web lists-archives.com

Kernel upgrade = freeze on boot (linux-image-4.16.0-2-amd64)




So, on a Testing system, linux-image-4.16.0-2-amd64 came up in the apt

queue. I installed it and rebooted.
My system froze on reboot. Scrolled through the boot messages on the
console, up to X starting, then this (warning, long excerpt from syslog):
Jul 1 18:50:08 debian-nitpicking kernel: [ 6.862360] caller _nv001169rm+0xe3/0x1d0 [nvidia] mapping multiple BARs Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking kernel: [ 7.237707] nvidia-modeset: Allocated GPU:0 (GPU-5f445880-4fd7-5bb8-5709-1ba476256ce5) @ PCI:0000:01:00.0 Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399065] usercopy: Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)! Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399077] ------------[ cut here ]------------ Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399077] kernel BUG at /build/linux-uwVqDp/linux-4.16.16/mm/usercopy.c:100! Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399084] invalid opcode: 0000 [#1] SMP NOPTI Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399085] Modules linked in: appletalk ax25 ipx(C) p8023 p8022 psnap llc pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) cpufreq_powersave cpufreq_conservative cpufreq_userspace binfmt_misc arc4 rt2800usb rt2x00usb rt2800lib rt2x00lib mac80211 edac_mce_amd joydev wmi_bmof cfg80211 snd_hda_codec_hdmi crc_ccitt rfkill kvm_amd ccp rng_core evdev kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel serio_raw fam15h_power pcspkr k10temp sg sp5100_tco shpchp acpi_cpufreq wmi button nvidia_drm(PO) drm_kms_helper drm nvidia_modeset(PO) nvidia(PO) snd_hda_codec_realtek snd_hda_codec_generic ipmi_devintf ipmi_msghandler snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore parport_pc ppdev lp parport ip_tables x_tables autofs4 ext4 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399122] crc16 mbcachJul 1 18:50:08 debian-nitpicking kernel: [ 6.862360] caller _nv001169rm+0xe3/0x1d0 [nvidia] mapping multiple BARs Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking kernel: [ 7.237707] nvidia-modeset: Allocated GPU:0 (GPU-5f445880-4fd7-5bb8-5709-1ba476256ce5) @ PCI:0000:01:00.0 Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking colord[846]: failed to get session [pid 728]: No data available Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399065] usercopy: Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)! Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399077] ------------[ cut here ]------------ Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399077] kernel BUG at /build/linux-uwVqDp/linux-4.16.16/mm/usercopy.c:100! Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399084] invalid opcode: 0000 [#1] SMP NOPTI Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399085] Modules linked in: appletalk ax25 ipx(C) p8023 p8022 psnap llc pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) cpufreq_powersave cpufreq_conservative cpufreq_userspace binfmt_misc arc4 rt2800usb rt2x00usb rt2800lib rt2x00lib mac80211 edac_mce_amd joydev wmi_bmof cfg80211 snd_hda_codec_hdmi crc_ccitt rfkill kvm_amd ccp rng_core evdev kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel serio_raw fam15h_power pcspkr k10temp sg sp5100_tco shpchp acpi_cpufreq wmi button nvidia_drm(PO) drm_kms_helper drm nvidia_modeset(PO) nvidia(PO) snd_hda_codec_realtek snd_hda_codec_generic ipmi_devintf ipmi_msghandler snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore parport_pc ppdev lp parport ip_tables x_tables autofs4 ext4 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399122] crc16 mbcache jbd2 crc32c_generic fscrypto ecb hid_generic usbhid hid sd_mod ohci_pci ata_generic crc32c_intel ahci libahci pata_atiixp aesni_intel aes_x86_64 crypto_simd firewire_ohci xhci_pci cryptd glue_helper libata ohci_hcd ehci_pci firewire_core xhci_hcd ehci_hcd crc_itu_t r8169 i2c_piix4 scsi_mod usbcore mii usb_common Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399141] CPU: 7 PID: 977 Comm: Xorg Tainted: P C O 4.16.0-2-amd64 #1 Debian 4.16.16-2 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399143] Hardware name: Gigabyte Technology Co., Ltd. GA-970A-UD3/GA-970A-UD3, BIOS F7 10/22/2012 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399148] RIP: 0010:usercopy_abort+0x69/0x80 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399149] RSP: 0018:ffffba16c2f8fb50 EFLAGS: 00010282 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399151] RAX: 000000000000006f RBX: 0000000000000003 RCX: 0000000000000000 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399153] RDX: 0000000000000000 RSI: ffff99acfedd6738 RDI: ffff99acfedd6738 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399154] RBP: 0000000000000003 R08: 00000000000003d9 R09: 0000000000000004 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399156] R10: ffffffff94c77e48 R11: ffffffff953a8dcd R12: 0000000000000001 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399157] R13: ffff99acd9c72cb3 R14: 0000000000000000 R15: ffff99acd9c72cf8 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399159] FS: 00007f071f2c96c0(0000) GS:ffff99acfedc0000(0000) knlGS:0000000000000000 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399160] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399162] CR2: 00007f0717360c10 CR3: 00000004265b6000 CR4: 00000000000406e0
Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399164] Call Trace:
Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399169] __check_heap_object+0xe7/0x120 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399172] __check_object_size+0x9c/0x1a0 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399470] os_memcpy_to_user+0x21/0x40 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399649] _nv009377rm+0xbf/0xe0 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399814] ? _nv028067rm+0x79/0x90 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399979] ? _nv028067rm+0x55/0x90 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.400145] ? _nv013694rm+0xee/0x100 [nvidia] e jbd2 crc32c_generic fscrypto ecb hid_generic usbhid hid sd_mod ohci_pci ata_generic crc32c_intel ahci libahci pata_atiixp aesni_intel aes_x86_64 crypto_simd firewire_ohci xhci_pci cryptd glue_helper libata ohci_hcd ehci_pci firewire_core xhci_hcd ehci_hcd crc_itu_t r8169 i2c_piix4 scsi_mod usbcore mii usb_common Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399141] CPU: 7 PID: 977 Comm: Xorg Tainted: P C O 4.16.0-2-amd64 #1 Debian 4.16.16-2 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399143] Hardware name: Gigabyte Technology Co., Ltd. GA-970A-UD3/GA-970A-UD3, BIOS F7 10/22/2012 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399148] RIP: 0010:usercopy_abort+0x69/0x80 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399149] RSP: 0018:ffffba16c2f8fb50 EFLAGS: 00010282 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399151] RAX: 000000000000006f RBX: 0000000000000003 RCX: 0000000000000000 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399153] RDX: 0000000000000000 RSI: ffff99acfedd6738 RDI: ffff99acfedd6738 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399154] RBP: 0000000000000003 R08: 00000000000003d9 R09: 0000000000000004 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399156] R10: ffffffff94c77e48 R11: ffffffff953a8dcd R12: 0000000000000001 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399157] R13: ffff99acd9c72cb3 R14: 0000000000000000 R15: ffff99acd9c72cf8 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399159] FS: 00007f071f2c96c0(0000) GS:ffff99acfedc0000(0000) knlGS:0000000000000000 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399160] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399162] CR2: 00007f0717360c10 CR3: 00000004265b6000 CR4: 00000000000406e0
Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399164] Call Trace:
Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399169] __check_heap_object+0xe7/0x120 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399172] __check_object_size+0x9c/0x1a0 Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399470] os_memcpy_to_user+0x21/0x40 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399649] _nv009377rm+0xbf/0xe0 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399814] ? _nv028067rm+0x79/0x90 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.399979] ? _nv028067rm+0x55/0x90 [nvidia] Jul 1 18:50:09 debian-nitpicking kernel: [ 7.400145] ? _nv013694rm+0xee/0x100 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.371896] ? _nv015342rm+0x154/0x270 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372043] ? _nv008310rm+0x134/0x1a0 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372190] ? _nv008289rm+0x29c/0x2b0 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372337] ? _nv001072rm+0xe/0x20 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372486] ? _nv007316rm+0xd8/0x100 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372621] ? _nv001171rm+0x627/0x830 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372754] ? rm_ioctl+0x73/0x100 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372870] ? nvidia_ioctl+0xf0/0x720 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.372986] ? nvidia_ioctl+0x519/0x720 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373102] ? nvidia_frontend_unlocked_ioctl+0x3e/0x50 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373104] ? do_vfs_ioctl+0xa4/0x630 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373107] ? handle_mm_fault+0xdc/0x210
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373109] ? SyS_ioctl+0x74/0x80
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373111] ? do_syscall_64+0x6c/0x130 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373114] ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373115] Code: 0f 44 d0 53 48 c7 c0 41 de 63 a0 51 48 c7 c6 dd d3 62 a0 41 53 48 89 f9 48 0f 45 f0 4c 89 d2 48 c7 c7 28 df 63 a0 e8 f1 2e ea ff <0f> 0b 49 c7 c1 ac de 64 a0 4d 89 cb 4d 89 c8 eb a5 66 0f 1f 44 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373137] RIP: usercopy_abort+0x69/0x80 RSP: ffffa8e702223b50 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.373173] ---[ end trace db4744b5e9ea5dac ]--- Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374473] general protection fault: 0000 [#2] SMP NOPTI Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374475] Modules linked in: appletalk ax25 ipx(C) p8023 p8022 psnap llc pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) cpufreq_powersave cpufreq_conservative cpufreq_userspace binfmt_misc arc4 rt2800usb rt2x00usb rt2800lib rt2x00lib mac80211 cfg80211 crc_ccitt snd_hda_codec_hdmi joydev wmi_bmof rfkill evdev edac_mce_amd kvm_amd ccp rng_core kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel k10temp serio_raw fam15h_power pcspkr sg sp5100_tco shpchp wmi nvidia_drm(PO) button acpi_cpufreq drm_kms_helper drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf snd_hda_codec_realtek ipmi_msghandler snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore parport_pc ppdev lp parport ip_tables x_tables autofs4 ext4 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374514] crc16 mbcache jbd2 crc32c_generic fscrypto ecb hid_generic usbhid hid sd_mod ohci_pci ata_generic crc32c_intel ahci aesni_intel pata_atiixp libahci xhci_pci aes_x86_64 ohci_hcd ehci_pci firewire_ohci crypto_simd xhci_hcd ehci_hcd cryptd glue_helper firewire_core libata crc_itu_t i2c_piix4 scsi_mod usbcore r8169 mii usb_common Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374527] CPU: 0 PID: 954 Comm: Xorg Tainted: P D C O 4.16.0-2-amd64 #1 Debian 4.16.16-2 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374528] Hardware name: Gigabyte Technology Co., Ltd. GA-970A-UD3/GA-970A-UD3, BIOS F7 10/22/2012 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374731] RIP: 0010:_nv007214rm+0x25/0x90 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374733] RSP: 0018:ffffa8e702223d20 EFLAGS: 00010006 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374735] RAX: 48e28944ffffff36 RBX: ffffffffc14490f8 RCX: ffffa8e702223db0 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374736] RDX: ffffffffc07ee4d5 RSI: 00000000000003ba RDI: ffffffffc14490f8 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374738] RBP: ffff994b453ddff8 R08: 0000000000000000 R09: ffffa8e702223dac Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374739] R10: 0000000000000000 R11: 00000000ffffff00 R12: 00000000000003ba Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374740] R13: ffff994b4622f800 R14: ffff994b45292d00 R15: ffff994b648f0000 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374742] FS: 00007fe8601096c0(0000) GS:ffff994b7ec00000(0000) knlGS:0000000000000000 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374744] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374745] CR2: 00007fe8581a0c10 CR3: 00000001d820a000 CR4: 00000000000406f0
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374747] Call Trace:
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.374937] ? _nv025895rm+0x13/0x50 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375112] ? _nv035609rm+0x144/0x1e0 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375270] ? rm_free_unused_clients+0x4f/0xe0 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375409] ? nv_check_pci_config_space+0x285/0x320 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375546] ? nvidia_close+0xba/0x350 [nvidia] Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375685] ? nvidia_frontend_close+0x2a/0x40 [nvidia]
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375688] ? __fput+0xd0/0x1e0
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375691] ? task_work_run+0x8a/0xb0
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375694] ? do_exit+0x2e1/0xb40
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375696] ? SyS_ioctl+0x74/0x80
Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375699] ? rewind_stack_do_exit+0x17/0x20 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375701] Code: 84 00 00 00 00 00 31 c9 48 85 ff 53 48 89 fb 74 0d 48 85 d2 74 08 48 63 47 08 48 8d 0c 10 48 8b 03 31 d2 0f 1f 00 48 85 c0 74 11 <48> 39 30 48 89 c2 76 47 48 8b 40 10 48 85 c0 75 ef 48 85 d2 48 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375913] RIP: _nv007214rm+0x25/0x90 [nvidia] RSP: ffffa8e702223d20 Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375916] ---[ end trace db4744b5e9ea5dad ]--- Jul 1 18:46:52 debian-nitpicking kernel: [ 7.375917] Fixing recursive fault but reboot is needed!

At that point, syslog shows the system successfully logging into the
local WiFi, but then nothing until I rebooted 3 minutes later. It was
unresponsive to keyboard, including the Caps Lock and Num Lock lights
never lighting (even though by default Num Lock should be on after
boot).

Multiple reboots give the same behavior, never
giving me either a console or X prompt.

I rebooted to .0.1 and removed .0.2 for the time being.

Thoughts?
--
Carl Fink
carl@xxxxxxxxxxxxxxx