| Summary: | No display on reboot: `... mknod ... failed` | ||
|---|---|---|---|
| Product: | Fedora | Reporter: | Ankur Sinha "FranciscoD" <sanjay.ankur> |
| Component: | nvidia-kmod | Assignee: | Nicolas Chauvet <kwizart> |
| Status: | RESOLVED WORKSFORME | ||
| Severity: | enhancement | CC: | leigh123linux |
| Priority: | P1 | ||
| Version: | f38 | ||
| Hardware: | x86_64 | ||
| OS: | GNU/Linux | ||
| namespace: | |||
| Attachments: | fpaste --sysinfo output | ||
|
Description
Ankur Sinha "FranciscoD"
2023-05-10 11:25:11 CEST
Looks like the same errors also occur in the F37 kernel, but the driver still loads.. ``` $ journalctl -b -1 --no-pager | grep -C5 nvidia May 10 10:10:03 atlas kernel: microcode: microcode updated early to revision 0x54, date = 2022-03-17 May 10 10:10:03 atlas kernel: Linux version 6.2.12-300.fc38.x86_64 (mockbuild@54604edad16f4e818e702bda973f7473) (gcc (GCC) 13.0.1 20230401 (Red Hat 13.0.1-0), GNU ld version 2.39-9.fc38) #1 SMP PREEMPT_DYNAMIC Thu Apr 20 23:05:25 UTC 2023 May 10 10:10:03 atlas kernel: Command line: BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers' May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers' May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers' May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x008: 'MPX bounds registers' May 10 10:10:03 atlas kernel: x86/fpu: Supporting XSAVE feature 0x010: 'MPX CSR' -- May 10 10:10:03 atlas kernel: pcpu-alloc: s217088 r8192 d28672 u262144 alloc=1*2097152 May 10 10:10:03 atlas kernel: pcpu-alloc: [0] 00 01 02 03 04 05 06 07 [0] 08 09 10 11 12 13 14 15 May 10 10:10:03 atlas kernel: Fallback order for Node 0: 0 May 10 10:10:03 atlas kernel: Built 1 zonelists, mobility grouping on. Total pages: 16445629 May 10 10:10:03 atlas kernel: Policy zone: Normal May 10 10:10:03 atlas kernel: Kernel command line: BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target May 10 10:10:03 atlas kernel: blacklisting initcall simpledrm_platform_driver_init May 10 10:10:03 atlas kernel: Unknown kernel command line parameters "rhgb BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64", will be passed to user space. May 10 10:10:03 atlas kernel: Dentry cache hash table entries: 8388608 (order: 14, 67108864 bytes, linear) May 10 10:10:03 atlas kernel: Inode-cache hash table entries: 4194304 (order: 13, 33554432 bytes, linear) May 10 10:10:03 atlas kernel: mem auto-init: stack:all(zero), heap alloc:off, heap free:off -- May 10 10:10:03 atlas systemd[1]: Finished systemd-tmpfiles-setup-dev.service - Create Static Device Nodes in /dev. May 10 10:10:03 atlas systemd[1]: Finished dracut-cmdline-ask.service - dracut ask for additional cmdline parameters. May 10 10:10:03 atlas systemd[1]: Finished systemd-tmpfiles-setup.service - Create Volatile Files and Directories. May 10 10:10:03 atlas systemd[1]: Starting dracut-cmdline.service - dracut cmdline hook... May 10 10:10:03 atlas dracut-cmdline[356]: dracut-38 (CompNeuro) dracut-059-2.fc38 May 10 10:10:03 atlas dracut-cmdline[356]: Using kernel command line parameters: rd.driver.pre=btrfs BOOT_IMAGE=(hd0,gpt6)/vmlinuz-6.2.12-300.fc38.x86_64 root=UUID=b90718c5-99d6-4965-aa20-67b6f6224b06 ro rootflags=subvol=root rhgb quiet rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 initcall_blacklist=simpledrm_platform_driver_init systemd.unit=multi-user.target May 10 10:10:03 atlas systemd[1]: Finished dracut-cmdline.service - dracut cmdline hook. May 10 10:10:03 atlas systemd[1]: Starting dracut-pre-udev.service - dracut pre-udev hook... May 10 10:10:03 atlas systemd[1]: Finished dracut-pre-udev.service - dracut pre-udev hook. May 10 10:10:03 atlas systemd[1]: Starting systemd-udevd.service - Rule-based Manager for Device Events and Files... May 10 10:10:03 atlas systemd-udevd[439]: Using default interface naming scheme 'v253'. -- May 10 10:10:07 atlas systemd[1]: Started mcelog.service - Machine Check Exception Logging Daemon. May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:07 atlas systemd[1]: mdmonitor.service - Software RAID monitoring and management was skipped because of an unmet condition check (ConditionPathExists=/etc/mdadm.conf). May 10 10:10:07 atlas dbus-broker-launch[888]: Ready May 10 10:10:07 atlas avahi-daemon[892]: Found user 'avahi' (UID 70) and group 'avahi' (GID 70). May 10 10:10:07 atlas systemd[1]: Starting nvidia-powerd.service - nvidia-powerd service... May 10 10:10:07 atlas systemd[1]: Starting polkit.service - Authorization Manager... May 10 10:10:07 atlas systemd[1]: ssh-host-keys-migration.service - Update OpenSSH host key permissions was skipped because of an unmet condition check (ConditionPathExists=!/var/lib/.ssh-host-keys-migration). May 10 10:10:07 atlas systemd[1]: sshd-keygen@ecdsa.service - OpenSSH ecdsa Server Key Generation was skipped because no trigger condition checks were met. May 10 10:10:07 atlas systemd[1]: sshd-keygen@ed25519.service - OpenSSH ed25519 Server Key Generation was skipped because no trigger condition checks were met. May 10 10:10:07 atlas systemd[1]: sshd-keygen@rsa.service - OpenSSH rsa Server Key Generation was skipped because no trigger condition checks were met. May 10 10:10:07 atlas systemd[1]: Reached target sshd-keygen.target. May 10 10:10:07 atlas systemd[1]: sssd.service - System Security Services Daemon was skipped because no trigger condition checks were met. May 10 10:10:07 atlas systemd[1]: Reached target nss-user-lookup.target - User and Group Name Lookups. May 10 10:10:07 atlas audit: BPF prog-id=69 op=LOAD May 10 10:10:07 atlas systemd[1]: Starting systemd-homed.service - Home Area Manager... May 10 10:10:07 atlas /usr/bin/nvidia-powerd[909]: nvidia-powerd version:1.0(build 1) May 10 10:10:07 atlas audit: BPF prog-id=70 op=LOAD May 10 10:10:07 atlas audit: BPF prog-id=71 op=LOAD May 10 10:10:07 atlas audit: BPF prog-id=72 op=LOAD May 10 10:10:07 atlas systemd[1]: Starting systemd-logind.service - User Login Management... May 10 10:10:07 atlas audit: BPF prog-id=73 op=LOAD -- May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=abrt-xorg comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:07 atlas ModemManager[1077]: <info> ModemManager (version 1.20.6-1.fc38) starting in system bus... May 10 10:10:07 atlas kernel: NET: Registered PF_QIPCRTR protocol family May 10 10:10:07 atlas systemd[1]: Started ModemManager.service - Modem Manager. May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=ModemManager comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:07 atlas kernel: nvidia: loading out-of-tree module taints kernel. May 10 10:10:07 atlas kernel: nvidia: module license 'NVIDIA' taints kernel. May 10 10:10:07 atlas kernel: Disabling lock debugging due to kernel taint May 10 10:10:07 atlas kernel: nvidia: module verification failed: signature and/or required key missing - tainting kernel May 10 10:10:07 atlas kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 511 May 10 10:10:07 atlas kernel: May 10 10:10:07 atlas kernel: nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none May 10 10:10:07 atlas systemd[1]: Started firewalld.service - firewalld - dynamic firewall daemon. May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=firewalld comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:07 atlas systemd[1]: Reached target network-pre.target - Preparation for Network. May 10 10:10:07 atlas systemd[1]: Starting NetworkManager.service - Network Manager... May 10 10:10:07 atlas NetworkManager[1095]: <info> [1683709807.6938] NetworkManager (version 1.42.6-1.fc38) is starting... (boot:96172848-eaa0-4fda-ae78-f772d59a259f) -- May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786250766+01:00" level=info msg=serving... address=/run/containerd/containerd.sock.ttrpc May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786284767+01:00" level=info msg=serving... address=/run/containerd/containerd.sock May 10 10:10:07 atlas containerd[1121]: time="2023-05-10T10:10:07.786550091+01:00" level=info msg="containerd successfully booted in 0.013326s" May 10 10:10:07 atlas systemd[1]: Started containerd.service - containerd container runtime. May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=containerd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:07 atlas (udev-worker)[697]: nvidia: Process '/usr/bin/bash -c '/usr/bin/mknod -Z -m 666 /dev/nvidiactl c $(grep nvidia-frontend /proc/devices | cut -d \ -f 1) 255'' failed with exit code 1. May 10 10:10:07 atlas audit[1078]: NETFILTER_CFG table=firewalld:2 family=1 entries=1 op=nft_register_table pid=1078 subj=system_u:system_r:firewalld_t:s0 comm="firewalld" May 10 10:10:07 atlas systemd[1]: Started cups.service - CUPS Scheduler. May 10 10:10:07 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=cups comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:07 atlas kernel: nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint. May 10 10:10:07 atlas systemd[1]: Mounting var-lib-nfs-rpc_pipefs.mount - RPC Pipe File System... May 10 10:10:07 atlas NetworkManager[1095]: <info> [1683709807.9604] device (lo): state change: disconnected -> prepare (reason 'none', sys-iface-state: 'external') May 10 10:10:07 atlas NetworkManager[1095]: <info> [1683709807.9607] device (lo): state change: prepare -> config (reason 'none', sys-iface-state: 'external') May 10 10:10:07 atlas NetworkManager[1095]: <info> [1683709807.9608] device (lo): state change: config -> ip-config (reason 'none', sys-iface-state: 'external') May 10 10:10:07 atlas NetworkManager[1095]: <info> [1683709807.9618] modem-manager: ModemManager available -- May 10 10:10:08 atlas systemd[1]: Finished plymouth-quit.service - Terminate Plymouth Boot Screen. May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=plymouth-quit comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:08 atlas systemd[1]: Started getty@tty1.service - Getty on tty1. May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=getty@tty1 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:08 atlas systemd[1]: Reached target getty.target - Login Prompts. May 10 10:10:08 atlas kernel: nvidia-uvm: Loaded the UVM driver, major device number 509. May 10 10:10:08 atlas (udev-worker)[697]: nvidia: Process '/usr/bin/bash -c 'for i in $(cat /proc/driver/nvidia/gpus/*/information | grep Minor | cut -d \ -f 4); do /usr/bin/mknod -Z -m 666 /dev/nvidia${i} c $(grep nvidia-frontend /proc/devices | cut -d \ -f 1) ${i}; done'' failed with exit code 1. May 10 10:10:08 atlas kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 530.41.03 Thu Mar 16 19:23:04 UTC 2023 May 10 10:10:08 atlas kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver May 10 10:10:08 atlas avahi-daemon[892]: Server startup complete. Host name is atlas.local. Local service cookie is 3093917225. May 10 10:10:08 atlas akmods[890]: Checking kmods exist for 6.2.12-300.fc38.x86_64[ OK ] May 10 10:10:08 atlas akmods[890]: Checking kmods exist for 6.2.14-300.fc38.x86_64[ OK ] May 10 10:10:08 atlas systemd[1]: Finished akmods.service - Builds and install new kmods from akmod packages. May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=akmods comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' May 10 10:10:08 atlas /usr/bin/nvidia-powerd[909]: No matching GPU found May 10 10:10:08 atlas /usr/bin/nvidia-powerd[909]: Failed to initialize RM Client May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Main process exited, code=exited, status=1/FAILURE May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Failed with result 'exit-code'. May 10 10:10:08 atlas systemd[1]: Failed to start nvidia-powerd.service - nvidia-powerd service. May 10 10:10:08 atlas audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nvidia-powerd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed' May 10 10:10:08 atlas systemd[1]: nvidia-powerd.service: Consumed 1.044s CPU time. May 10 10:10:08 atlas kernel: fbcon: Taking over console May 10 10:10:09 atlas kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 1 May 10 10:10:09 atlas systemd[1]: nvidia-fallback.service - Fallback to nouveau as nvidia did not load was skipped because of an unmet condition check (ConditionPathExists=!/sys/module/nvidia). May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp May 10 10:10:09 atlas thermald[943]: sensor id 10 : No temp sysfs for reading raw temp May 10 10:10:09 atlas thermald[943]: Config file /etc/thermald/thermal-conf.xml does not exist May 10 10:10:09 atlas thermald[943]: Config file /etc/thermald/thermal-conf.xml does not exist ``` Seems to have fixed itself---no issues with the latest kernel update. Closing. |