Bug 6680

Summary: No display on reboot: `... mknod ... failed`
Product: Fedora Reporter: Ankur Sinha "FranciscoD" <sanjay.ankur>
Component: nvidia-kmodAssignee: Nicolas Chauvet <kwizart>
Status: RESOLVED WORKSFORME    
Severity: enhancement CC: leigh123linux
Priority: P1    
Version: f38   
Hardware: x86_64   
OS: GNU/Linux   
namespace:
Attachments: fpaste --sysinfo output

Description Ankur Sinha "FranciscoD" 2023-05-10 11:25:11 CEST
Created attachment 2501 [details]
fpaste --sysinfo output

Hi there,

I have the kmods installed for my GPU here. With the F38 kernels, I do not get a display on reboot. I thought it was a case of the kmods not being generated correctly, but that does not seem to be the case. Here's what the journal of one of these failed display boots says:


```
$ journalctl -b -1 --no-pager | grep -C5 nvidia
May 10 10:10:03 atlas kernel: microcode: microcode updated early to revision 0x54, date = 2022-03-17
May 10 10:10:03 atlas kernel: Linux version 6.2.12-300.fc38.x86_64 (mockbuild@54604edad16f4e818e702bda973f7473) (gcc (GCC) 13.0.1 20230401 (Red Hat 13.0.1-0), GNU ld version 2.39-9.fc38) #1 SMP PREEMPT_DYNAMIC Thu Apr 20 23:05:25 UTC 2023
May 10 10:10:03 atlas kernel: Command line: BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x008: 'MPX bounds registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x010: 'MPX CSR'
--
May 10 10:10:03 atlas kernel: pcpu-alloc: s217088 r8192 d28672 u262144 alloc=1*2097152
May 10 10:10:03 atlas kernel: pcpu-alloc: [0] 00 01 02 03 04 05 06 07 [0] 08 09 10 11 12 13 14 15
May 10 10:10:03 atlas kernel: Fallback order for Node 0: 0
May 10 10:10:03 atlas kernel: Built 1 zonelists, mobility grouping on.  Total pages: 16445629
May 10 10:10:03 atlas kernel: Policy zone: Normal
May 10 10:10:03 atlas kernel: Kernel command line: BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target
May 10 10:10:03 atlas kernel: blacklisting initcall simpledrm_platform_driver_init
May 10 10:10:03 atlas kernel: Unknown kernel command line parameters "rhgb BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64", will be passed to user space.
May 10 10:10:03 atlas kernel: Dentry cache hash table entries: 8388608 (order: 14, 67108864 bytes, linear)
May 10 10:10:03 atlas kernel: Inode-cache hash table entries: 4194304 (order: 13, 33554432 bytes, linear)
May 10 10:10:03 atlas kernel: mem auto-init: stack:all(zero), heap alloc:off, heap free:off
--
May 10 10:10:03 atlas systemd[1]: Finished systemd-tmpfiles-setup-dev.service - Create Static Device Nodes in /dev.
May 10 10:10:03 atlas systemd[1]: Finished dracut-cmdline-ask.service - dracut ask for additional cmdline parameters.
May 10 10:10:03 atlas systemd[1]: Finished systemd-tmpfiles-setup.service - Create Volatile Files and Directories.
May 10 10:10:03 atlas systemd[1]: Starting dracut-cmdline.service - dracut cmdline hook...
May 10 10:10:03 atlas dracut-cmdline[356]: dracut-38 (CompNeuro) dracut-059-2.fc38
May 10 10:10:03 atlas dracut-cmdline[356]: Using kernel command line parameters:  rd.driver.pre=btrfs   BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target
May 10 10:10:03 atlas systemd[1]: Finished dracut-cmdline.service - dracut cmdline hook.
May 10 10:10:03 atlas systemd[1]: Starting dracut-pre-udev.service - dracut pre-udev hook...
May 10 10:10:03 atlas systemd[1]: Finished dracut-pre-udev.service - dracut pre-udev hook.
May 10 10:10:03 atlas systemd[1]: Starting systemd-udevd.service - Rule-based Manager for Device Events and Files...
May 10 10:10:03 atlas systemd-udevd[439]: Using default interface naming scheme 'v253'.
--
May 10 10:10:07 atlas systemd[1]: Started mcelog.service - Machine Check Exception Logging Daemon.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas systemd[1]: mdmonitor.service - Software RAID monitoring and management was skipped because of an unmet condition check (ConditionPathExists=/etc/mdadm.conf).
May 10 10:10:07 atlas dbus-broker-launch[888]: Ready
May 10 10:10:07 atlas avahi-daemon[892]: Found user 'avahi' (UID 70) and group 'avahi' (GID 70).
May 10 10:10:07 atlas systemd[1]: Starting nvidia-powerd.service - nvidia-powerd service...
May 10 10:10:07 atlas systemd[1]: Starting polkit.service - Authorization Manager...
May 10 10:10:07 atlas systemd[1]: ssh-host-keys-migration.service - Update OpenSSH host key permissions was skipped because of an unmet condition check (ConditionPathExists=!/var/lib/.ssh-host-keys-migration).
May 10 10:10:07 atlas systemd[1]: sshd-keygen@ecdsa.service - OpenSSH ecdsa Server Key Generation was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: sshd-keygen@ed25519.service - OpenSSH ed25519 Server Key Generation was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: sshd-keygen@rsa.service - OpenSSH rsa Server Key Generation was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: Reached target sshd-keygen.target.
May 10 10:10:07 atlas systemd[1]: sssd.service - System Security Services Daemon was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: Reached target nss-user-lookup.target - User and Group Name Lookups.
May 10 10:10:07 atlas audit: BPF prog-id=69 op=LOAD
May 10 10:10:07 atlas systemd[1]: Starting systemd-homed.service - Home Area Manager...
May 10 10:10:07 atlas /usr/bin/nvidia-powerd[909]: nvidia-powerd version:1.0(build 1)
May 10 10:10:07 atlas audit: BPF prog-id=70 op=LOAD
May 10 10:10:07 atlas audit: BPF prog-id=71 op=LOAD
May 10 10:10:07 atlas audit: BPF prog-id=72 op=LOAD
May 10 10:10:07 atlas systemd[1]: Starting systemd-logind.service - User Login Management...
May 10 10:10:07 atlas audit: BPF prog-id=73 op=LOAD
--
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=abrt-xorg comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas ModemManager[1077]: <info>  ModemManager (version 1.20.6-1.fc38) starting in system bus...
May 10 10:10:07 atlas kernel: NET: Registered PF_QIPCRTR protocol family
May 10 10:10:07 atlas systemd[1]: Started ModemManager.service - Modem Manager.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=ModemManager comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas kernel: nvidia: loading out-of-tree module taints kernel.
May 10 10:10:07 atlas kernel: nvidia: module license 'NVIDIA' taints kernel.
May 10 10:10:07 atlas kernel: Disabling lock debugging due to kernel taint
May 10 10:10:07 atlas kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel
May 10 10:10:07 atlas kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 511
May 10 10:10:07 atlas kernel:
May 10 10:10:07 atlas kernel: nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
May 10 10:10:07 atlas systemd[1]: Started firewalld.service - firewalld - dynamic firewall daemon.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=firewalld comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas systemd[1]: Reached target network-pre.target - Preparation for Network.
May 10 10:10:07 atlas systemd[1]: Starting NetworkManager.service - Network Manager...
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.6938] NetworkManager (version 1.42.6-1.fc38) is starting... (boot:96172848-eaa0-4fda-ae78-f772d59a259f)
--
May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786250766+01:00" level=info msg=serving... address=/run/containerd/containerd.sock.ttrpc
May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786284767+01:00" level=info msg=serving... address=/run/containerd/containerd.sock
May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786550091+01:00" level=info msg="containerd successfully booted in 0.013326s"
May 10 10:10:07 atlas systemd[1]: Started containerd.service - containerd container runtime.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=containerd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas (udev-worker)[697]: nvidia: Process '/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidiactl c $(grep nvidia-frontend /proc/devices | cut -d \  -f 1) 255'' failed with exit code 1.
May 10 10:10:07 atlas audit[1078]: NETFILTER_CFG table=firewalld:2 family=1 entries=1 op=nft_register_table pid=1078 subj=system_u:system_r:firewalld_t:s0 comm="firewalld"
May 10 10:10:07 atlas systemd[1]: Started cups.service - CUPS Scheduler.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=cups comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas kernel: nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
May 10 10:10:07 atlas systemd[1]: Mounting var-lib-nfs-rpc_pipefs.mount - RPC Pipe File System...
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9604] device (lo): state change: disconnected -> prepare (reason 'none', sys-iface-state: 'external')
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9607] device (lo): state change: prepare -> config (reason 'none', sys-iface-state: 'external')
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9608] device (lo): state change: config -> ip-config (reason 'none', sys-iface-state: 'external')
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9618] modem-manager: ModemManager available
--
May 10 10:10:08 atlas systemd[1]: Finished plymouth-quit.service - Terminate Plymouth Boot Screen.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=plymouth-quit comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:08 atlas systemd[1]: Started getty@tty1.service - Getty on tty1.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=getty@tty1 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:08 atlas systemd[1]: Reached target getty.target - Login Prompts.
May 10 10:10:08 atlas kernel: nvidia-uvm: Loaded the UVM driver, major device number 509.
May 10 10:10:08 atlas (udev-worker)[697]: nvidia: Process '/usr/bin/bash -c 'for i in $(cat /proc/driver/nvidia/gpus/*/information | grep Minor | cut -d \  -f 4); do /usr/bin/mknod -Z -m 666 /dev/nvidia${i} c $(grep nvidia-frontend /proc/devices | cut -d \  -f 1) ${i}; done'' failed with exit code 1.
May 10 10:10:08 atlas kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  530.41.03  Thu Mar 16 19:23:04 UTC 2023
May 10 10:10:08 atlas kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
May 10 10:10:08 atlas avahi-daemon[892]: Server startup complete. Host name is atlas.local. Local service cookie is 3093917225.
May 10 10:10:08 atlas akmods[890]: Checking kmods exist for 6.2.12-300.fc38.x86_64[  OK  ]
May 10 10:10:08 atlas akmods[890]: Checking kmods exist for 6.2.14-300.fc38.x86_64[  OK  ]
May 10 10:10:08 atlas systemd[1]: Finished akmods.service - Builds and install new kmods from akmod packages.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=akmods comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:08 atlas /usr/bin/nvidia-powerd[909]: No matching GPU found
May 10 10:10:08 atlas /usr/bin/nvidia-powerd[909]: Failed to initialize RM Client
May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Main process exited, code=exited, status=1/FAILURE
May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Failed with result 'exit-code'.
May 10 10:10:08 atlas systemd[1]: Failed to start nvidia-powerd.service - nvidia-powerd service.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nvidia-powerd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Consumed 1.044s CPU time.
May 10 10:10:08 atlas kernel: fbcon: Taking over console
May 10 10:10:09 atlas kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 1
May 10 10:10:09 atlas systemd[1]: nvidia-fallback.service - Fallback to nouveau as nvidia did not load was skipped because of an unmet condition check (ConditionPathExists=!/sys/module/nvidia).
May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp
May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp
May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp
May 10 10:10:09 atlas thermald[943]: Config file /etc/thermald/thermal-conf.xml does not exist
May 10 10:10:09 atlas thermald[943]: Config file /etc/thermald/thermal-conf.xml does not exist
```


I've currently booted into my F37 kernel, and that seems to work fine. More info (please let me know if there's anything else I can provide):

(ins)[asinha@atlas  ~]$ rpm -qa \*kernel\* \*nvidia\* | sort -h
abrt-addon-kerneloops-2.16.1-1.fc38.x86_64
akmod-nvidia-530.41.03-1.fc38.x86_64
kernel-6.2.11-200.fc37.x86_64
kernel-6.2.12-300.fc38.x86_64
kernel-6.2.14-300.fc38.x86_64
kernel-core-6.2.11-200.fc37.x86_64
kernel-core-6.2.12-300.fc38.x86_64
kernel-core-6.2.14-300.fc38.x86_64
kernel-devel-6.2.11-200.fc37.x86_64
kernel-devel-6.2.11-300.fc38.x86_64
kernel-devel-6.2.12-300.fc38.x86_64
kernel-devel-6.2.14-300.fc38.x86_64
kernel-devel-matched-6.2.14-300.fc38.x86_64
kernel-headers-6.2.6-300.fc38.x86_64
kernel-modules-6.2.11-200.fc37.x86_64
kernel-modules-6.2.12-300.fc38.x86_64
kernel-modules-6.2.14-300.fc38.x86_64
kernel-modules-core-6.2.11-200.fc37.x86_64
kernel-modules-core-6.2.12-300.fc38.x86_64
kernel-modules-core-6.2.14-300.fc38.x86_64
kernel-modules-extra-6.2.11-200.fc37.x86_64
kernel-modules-extra-6.2.12-300.fc38.x86_64
kernel-modules-extra-6.2.14-300.fc38.x86_64
kernel-srpm-macros-1.0-16.fc38.noarch
kmod-nvidia-6.2.11-200.fc37.x86_64-530.41.03-1.fc37.x86_64
kmod-nvidia-6.2.12-300.fc38.x86_64-530.41.03-1.fc38.x86_64
kmod-nvidia-6.2.14-300.fc38.x86_64-530.41.03-1.fc38.x86_64
libreport-plugin-kerneloops-2.17.9-1.fc38.x86_64
nvidia-gpu-firmware-20230404-149.fc38.noarch
nvidia-persistenced-530.41.03-1.fc38.x86_64
nvidia-settings-530.41.03-1.fc38.x86_64
nvidia-vaapi-driver-0.0.9-1.fc38.x86_64
python3-ipykernel-6.17.1-2.fc38.noarch
texlive-l3kernel-svn65299-65.fc38.noarch
xorg-x11-drv-nvidia-530.41.03-1.fc38.x86_64
xorg-x11-drv-nvidia-cuda-530.41.03-1.fc38.x86_64
xorg-x11-drv-nvidia-cuda-libs-530.41.03-1.fc38.x86_64
xorg-x11-drv-nvidia-kmodsrc-530.41.03-1.fc38.x86_64
xorg-x11-drv-nvidia-libs-530.41.03-1.fc38.x86_64
xorg-x11-drv-nvidia-power-530.41.03-1.fc38.x86_64



`fpaste --sysinfo` output attached also.

Any ideas on what I'm doing wrong? Thanks
Comment 1 Ankur Sinha "FranciscoD" 2023-05-10 11:31:42 CEST
Looks like the same errors also occur in the F37 kernel, but the driver still loads..

```
$ journalctl -b -1 --no-pager | grep -C5 nvidia
May 10 10:10:03 atlas kernel: microcode: microcode updated early to revision 0x54, date = 2022-03-17
May 10 10:10:03 atlas kernel: Linux version 6.2.12-300.fc38.x86_64 (mockbuild@54604edad16f4e818e702bda973f7473) (gcc (GCC) 13.0.1 20230401 (Red Hat 13.0.1-0), GNU ld version 2.39-9.fc38) #1 SMP PREEMPT_DYNAMIC Thu Apr 20 23:05:25 UTC 2023
May 10 10:10:03 atlas kernel: Command line: BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x008: 'MPX bounds registers'
May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x010: 'MPX CSR'
--
May 10 10:10:03 atlas kernel: pcpu-alloc: s217088 r8192 d28672 u262144 alloc=1*2097152
May 10 10:10:03 atlas kernel: pcpu-alloc: [0] 00 01 02 03 04 05 06 07 [0] 08 09 10 11 12 13 14 15
May 10 10:10:03 atlas kernel: Fallback order for Node 0: 0
May 10 10:10:03 atlas kernel: Built 1 zonelists, mobility grouping on.  Total pages: 16445629
May 10 10:10:03 atlas kernel: Policy zone: Normal
May 10 10:10:03 atlas kernel: Kernel command line: BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target
May 10 10:10:03 atlas kernel: blacklisting initcall simpledrm_platform_driver_init
May 10 10:10:03 atlas kernel: Unknown kernel command line parameters "rhgb BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64", will be passed to user space.
May 10 10:10:03 atlas kernel: Dentry cache hash table entries: 8388608 (order: 14, 67108864 bytes, linear)
May 10 10:10:03 atlas kernel: Inode-cache hash table entries: 4194304 (order: 13, 33554432 bytes, linear)
May 10 10:10:03 atlas kernel: mem auto-init: stack:all(zero), heap alloc:off, heap free:off
--
May 10 10:10:03 atlas systemd[1]: Finished systemd-tmpfiles-setup-dev.service - Create Static Device Nodes in /dev.
May 10 10:10:03 atlas systemd[1]: Finished dracut-cmdline-ask.service - dracut ask for additional cmdline parameters.
May 10 10:10:03 atlas systemd[1]: Finished systemd-tmpfiles-setup.service - Create Volatile Files and Directories.
May 10 10:10:03 atlas systemd[1]: Starting dracut-cmdline.service - dracut cmdline hook...
May 10 10:10:03 atlas dracut-cmdline[356]: dracut-38 (CompNeuro) dracut-059-2.fc38
May 10 10:10:03 atlas dracut-cmdline[356]: Using kernel command line parameters:  rd.driver.pre=btrfs   BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target
May 10 10:10:03 atlas systemd[1]: Finished dracut-cmdline.service - dracut cmdline hook.
May 10 10:10:03 atlas systemd[1]: Starting dracut-pre-udev.service - dracut pre-udev hook...
May 10 10:10:03 atlas systemd[1]: Finished dracut-pre-udev.service - dracut pre-udev hook.
May 10 10:10:03 atlas systemd[1]: Starting systemd-udevd.service - Rule-based Manager for Device Events and Files...
May 10 10:10:03 atlas systemd-udevd[439]: Using default interface naming scheme 'v253'.
--
May 10 10:10:07 atlas systemd[1]: Started mcelog.service - Machine Check Exception Logging Daemon.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas systemd[1]: mdmonitor.service - Software RAID monitoring and management was skipped because of an unmet condition check (ConditionPathExists=/etc/mdadm.conf).
May 10 10:10:07 atlas dbus-broker-launch[888]: Ready
May 10 10:10:07 atlas avahi-daemon[892]: Found user 'avahi' (UID 70) and group 'avahi' (GID 70).
May 10 10:10:07 atlas systemd[1]: Starting nvidia-powerd.service - nvidia-powerd service...
May 10 10:10:07 atlas systemd[1]: Starting polkit.service - Authorization Manager...
May 10 10:10:07 atlas systemd[1]: ssh-host-keys-migration.service - Update OpenSSH host key permissions was skipped because of an unmet condition check (ConditionPathExists=!/var/lib/.ssh-host-keys-migration).
May 10 10:10:07 atlas systemd[1]: sshd-keygen@ecdsa.service - OpenSSH ecdsa Server Key Generation was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: sshd-keygen@ed25519.service - OpenSSH ed25519 Server Key Generation was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: sshd-keygen@rsa.service - OpenSSH rsa Server Key Generation was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: Reached target sshd-keygen.target.
May 10 10:10:07 atlas systemd[1]: sssd.service - System Security Services Daemon was skipped because no trigger condition checks were met.
May 10 10:10:07 atlas systemd[1]: Reached target nss-user-lookup.target - User and Group Name Lookups.
May 10 10:10:07 atlas audit: BPF prog-id=69 op=LOAD
May 10 10:10:07 atlas systemd[1]: Starting systemd-homed.service - Home Area Manager...
May 10 10:10:07 atlas /usr/bin/nvidia-powerd[909]: nvidia-powerd version:1.0(build 1)
May 10 10:10:07 atlas audit: BPF prog-id=70 op=LOAD
May 10 10:10:07 atlas audit: BPF prog-id=71 op=LOAD
May 10 10:10:07 atlas audit: BPF prog-id=72 op=LOAD
May 10 10:10:07 atlas systemd[1]: Starting systemd-logind.service - User Login Management...
May 10 10:10:07 atlas audit: BPF prog-id=73 op=LOAD
--
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=abrt-xorg comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas ModemManager[1077]: <info>  ModemManager (version 1.20.6-1.fc38) starting in system bus...
May 10 10:10:07 atlas kernel: NET: Registered PF_QIPCRTR protocol family
May 10 10:10:07 atlas systemd[1]: Started ModemManager.service - Modem Manager.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=ModemManager comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas kernel: nvidia: loading out-of-tree module taints kernel.
May 10 10:10:07 atlas kernel: nvidia: module license 'NVIDIA' taints kernel.
May 10 10:10:07 atlas kernel: Disabling lock debugging due to kernel taint
May 10 10:10:07 atlas kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel
May 10 10:10:07 atlas kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 511
May 10 10:10:07 atlas kernel:
May 10 10:10:07 atlas kernel: nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
May 10 10:10:07 atlas systemd[1]: Started firewalld.service - firewalld - dynamic firewall daemon.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=firewalld comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas systemd[1]: Reached target network-pre.target - Preparation for Network.
May 10 10:10:07 atlas systemd[1]: Starting NetworkManager.service - Network Manager...
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.6938] NetworkManager (version 1.42.6-1.fc38) is starting... (boot:96172848-eaa0-4fda-ae78-f772d59a259f)
--
May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786250766+01:00" level=info msg=serving... address=/run/containerd/containerd.sock.ttrpc
May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786284767+01:00" level=info msg=serving... address=/run/containerd/containerd.sock
May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786550091+01:00" level=info msg="containerd successfully booted in 0.013326s"
May 10 10:10:07 atlas systemd[1]: Started containerd.service - containerd container runtime.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=containerd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas (udev-worker)[697]: nvidia: Process '/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidiactl c $(grep nvidia-frontend /proc/devices | cut -d \  -f 1) 255'' failed with exit code 1.
May 10 10:10:07 atlas audit[1078]: NETFILTER_CFG table=firewalld:2 family=1 entries=1 op=nft_register_table pid=1078 subj=system_u:system_r:firewalld_t:s0 comm="firewalld"
May 10 10:10:07 atlas systemd[1]: Started cups.service - CUPS Scheduler.
May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=cups comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:07 atlas kernel: nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
May 10 10:10:07 atlas systemd[1]: Mounting var-lib-nfs-rpc_pipefs.mount - RPC Pipe File System...
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9604] device (lo): state change: disconnected -> prepare (reason 'none', sys-iface-state: 'external')
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9607] device (lo): state change: prepare -> config (reason 'none', sys-iface-state: 'external')
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9608] device (lo): state change: config -> ip-config (reason 'none', sys-iface-state: 'external')
May 10 10:10:07 atlas NetworkManager[1095]: <info>  [1683709807.9618] modem-manager: ModemManager available
--
May 10 10:10:08 atlas systemd[1]: Finished plymouth-quit.service - Terminate Plymouth Boot Screen.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=plymouth-quit comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:08 atlas systemd[1]: Started getty@tty1.service - Getty on tty1.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=getty@tty1 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:08 atlas systemd[1]: Reached target getty.target - Login Prompts.
May 10 10:10:08 atlas kernel: nvidia-uvm: Loaded the UVM driver, major device number 509.
May 10 10:10:08 atlas (udev-worker)[697]: nvidia: Process '/usr/bin/bash -c 'for i in $(cat /proc/driver/nvidia/gpus/*/information | grep Minor | cut -d \  -f 4); do /usr/bin/mknod -Z -m 666 /dev/nvidia${i} c $(grep nvidia-frontend /proc/devices | cut -d \  -f 1) ${i}; done'' failed with exit code 1.
May 10 10:10:08 atlas kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  530.41.03  Thu Mar 16 19:23:04 UTC 2023
May 10 10:10:08 atlas kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
May 10 10:10:08 atlas avahi-daemon[892]: Server startup complete. Host name is atlas.local. Local service cookie is 3093917225.
May 10 10:10:08 atlas akmods[890]: Checking kmods exist for 6.2.12-300.fc38.x86_64[  OK  ]
May 10 10:10:08 atlas akmods[890]: Checking kmods exist for 6.2.14-300.fc38.x86_64[  OK  ]
May 10 10:10:08 atlas systemd[1]: Finished akmods.service - Builds and install new kmods from akmod packages.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=akmods comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
May 10 10:10:08 atlas /usr/bin/nvidia-powerd[909]: No matching GPU found
May 10 10:10:08 atlas /usr/bin/nvidia-powerd[909]: Failed to initialize RM Client
May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Main process exited, code=exited, status=1/FAILURE
May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Failed with result 'exit-code'.
May 10 10:10:08 atlas systemd[1]: Failed to start nvidia-powerd.service - nvidia-powerd service.
May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nvidia-powerd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Consumed 1.044s CPU time.
May 10 10:10:08 atlas kernel: fbcon: Taking over console
May 10 10:10:09 atlas kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 1
May 10 10:10:09 atlas systemd[1]: nvidia-fallback.service - Fallback to nouveau as nvidia did not load was skipped because of an unmet condition check (ConditionPathExists=!/sys/module/nvidia).
May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp
May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp
May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp
May 10 10:10:09 atlas thermald[943]: Config file /etc/thermald/thermal-conf.xml does not exist
May 10 10:10:09 atlas thermald[943]: Config file /etc/thermald/thermal-conf.xml does not exist

```
Comment 2 Ankur Sinha "FranciscoD" 2023-05-25 09:44:28 CEST
Seems to have fixed itself---no issues with the latest kernel update. Closing.