Bug 1397

Summary: Kernel 2.6.34.6-47 isnt booting because of nvidia driver
Product: Fedora Reporter: Rafael Louback Ferraz <ferrazrafael>
Component: nvidia-newest-kmodAssignee: Nicolas Chauvet <kwizart>
Status: RESOLVED WORKSFORME    
Severity: major CC: fedora, s.adam
Priority: P5    
Version: 13   
Hardware: All   
OS: GNU/Linux   
namespace:
Attachments: nvidia-bug-report.log
nvidia-bug-report.log (correct)
nvidia-bug-report.log, driver from testing (newer version)
A bug report when the pc doesnt boot properly
A bug report when the pc boot properly
log error with official nvidia 256.53 driver

Description Rafael Louback Ferraz 2010-09-06 19:17:02 CEST
This can be tracked in:

https://bugzilla.redhat.com/show_bug.cgi?id=629698
Comment 1 Nicolas Chauvet 2010-09-06 22:05:13 CEST
Can you provide an output of nvidia-bug-report.sh ?
Once that said, you could probably better try the driver from
rpmfusion-nonfree-updates-testing once it will hit the repository.
Comment 2 Rafael Louback Ferraz 2010-09-06 22:18:09 CEST
Created attachment 480 [details]
nvidia-bug-report.log

this log was generated running the xserver with vesa driver, because I cant bootup.
Comment 3 Nicolas Chauvet 2010-09-06 22:31:04 CEST
Can you show /var/log/Xorg.0.log.old then ?
Comment 4 Rafael Louback Ferraz 2010-09-06 22:36:23 CEST
Created attachment 481 [details]
nvidia-bug-report.log (correct)

I remenber that I could press ctrl+alt+fx to change terminal, this is a log running nvidia driver (I enabled testing repo, to install that kmod)
Comment 5 Rafael Louback Ferraz 2010-09-06 22:37:00 CEST
Comment on attachment 480 [details]
nvidia-bug-report.log

wrong log
Comment 6 Nicolas Chauvet 2010-09-07 09:18:55 CEST
Please update to rpmfusion-nonfree-updates-testing version.
If this doesn't solve your problem, please re-open.
Comment 7 Rafael Louback Ferraz 2010-09-07 17:22:44 CEST
(In reply to comment #6)
> Please update to rpmfusion-nonfree-updates-testing version.
> If this doesn't solve your problem, please re-open.
> 

I made this running the correct nvidia log!
Comment 8 Rafael Louback Ferraz 2010-09-07 17:25:20 CEST
(In reply to comment #7)
> (In reply to comment #6)
> > Please update to rpmfusion-nonfree-updates-testing version.
> > If this doesn't solve your problem, please re-open.
> > 
> 
> I made this running the correct nvidia log!
> 

sorry now I see that is a new driver version on testing, yesterday isnt available.
I will test it...
Comment 9 Rafael Louback Ferraz 2010-09-07 19:46:59 CEST
Created attachment 482 [details]
nvidia-bug-report.log, driver from testing (newer version)

The error persists in the driver from the testing repo. Here is the log
Comment 10 Nicolas Chauvet 2010-09-07 21:07:17 CEST
You report is invalid, you need to reboot in order to have a driver that match you kernel module.
Comment 11 Rafael Louback Ferraz 2010-09-08 03:05:41 CEST
Created attachment 483 [details]
A bug report when the pc doesnt boot properly

The problem persists, but in a intermittent way, sometimes it can boot, sometimes doesnt. Here is the log when a crash hapens
Comment 12 Rafael Louback Ferraz 2010-09-08 03:07:07 CEST
Created attachment 484 [details]
A bug report when the pc boot properly

This is the report when fedora is initialized correctly.
Comment 13 Rafael Louback Ferraz 2010-09-08 03:07:59 CEST
Problem still there...
Comment 14 Nicolas Chauvet 2010-09-08 09:36:30 CEST
(In reply to comment #13)
> Problem still there...
> 

There is indeed a problem with the nvidia module been loaded into memory but not showing into /proc/interrups.
You can give a try with adding options nvidia NVreg_EnableMSI=1 into /etc/modprobe.d/nvidia.conf
and reboot

Please forward this report to nvidia. You can reference this bug. And forward us the answear.
Thx for the report.
Comment 15 Rafael Louback Ferraz 2010-09-10 19:46:16 CEST
Created attachment 488 [details]
log error with official nvidia 256.53 driver

Ok, I send them a error but no reply until now...
Comment 16 Rafael Louback Ferraz 2010-09-10 21:10:07 CEST
still reproducible in kernel 2.6.34.6.53
Comment 17 Nicolas Chauvet 2010-09-10 21:42:39 CEST
(In reply to comment #16)
> still reproducible in kernel 2.6.34.6.53
Where you able to reproduce with kernel-2.6.33.8-149.fc13 ?
http://koji.fedoraproject.org/koji/buildinfo?buildID=190700


Comment 18 Rafael Louback Ferraz 2010-09-10 23:03:59 CEST
(In reply to comment #17)
> (In reply to comment #16)
> > still reproducible in kernel 2.6.34.6.53
> Where you able to reproduce with kernel-2.6.33.8-149.fc13 ?
> http://koji.fedoraproject.org/koji/buildinfo?buildID=190700
> 

NO!

(I put the "option nvidia NVregEnableMSI=1" on /etc/modprobe.d/options.conf, and seems to be working properly in kernel 2.6.34.6.53)
Comment 19 Rafael Louback Ferraz 2010-09-12 06:50:14 CEST
Seems that the problem is not solved with the NVreg_EnableMSI=1 option, but is happening less often
Comment 20 Rafael Louback Ferraz 2010-09-25 22:43:43 CEST
(In reply to comment #19)
> Seems that the problem is not solved with the NVreg_EnableMSI=1 option, but is
> happening less often
> 

know, with some fedora updates... nvidia driver dont boot at all. even with NVreg_EnableMSI=1
Comment 21 Nicolas Chauvet 2010-09-27 09:47:04 CEST
Did you get an answear from nvidia about this issue ?
Did you send the nvidia-bug-report.sh by email or via nvnews.net ?
Comment 22 Rafael Louback Ferraz 2010-09-27 17:35:22 CEST
(In reply to comment #21)
> Did you get an answear from nvidia about this issue ?
> Did you send the nvidia-bug-report.sh by email or via nvnews.net ?
> 

I send them by email this bug and posted it in nvnews.
No answer so far...
Comment 23 Rafael Louback Ferraz 2010-10-04 14:23:06 CEST
I was digging on the web and find this topic

http://ubuntuforums.org/showthread.php?t=1002287&page=2

Looking in the nvidia log error I find this messages:

vmap allocation for size 16781312 failed: use vmalloc=<size> to increase size.
NVRM: RmInitAdapter failed! (0x26:0xffffffff:1027)

Could this be a vmalloc problem too?
Comment 24 Rafael Louback Ferraz 2010-10-04 15:08:34 CEST
adding in the kernel line (/etc/grub.conf) vmalloc=256MB it works..

Couldnt fedora find the right vmalloc size on the fly?
Comment 25 Nicolas Chauvet 2010-10-17 18:50:04 CEST
Does it behave better with the driver from rpmfusion-nonfree-updates-testing ?
Comment 26 Rafael Louback Ferraz 2010-10-23 18:19:47 CEST
NO, new nvidia driver (260.12) from testing didnt solve this.. I need to put vmalloc=<256MB> in the boot kernel line to work.

but this new driver solves a bug that was showing in openoffice:
https://bugzilla.redhat.com/show_bug.cgi?id=643652
Comment 27 Rafael Louback Ferraz 2010-10-23 18:21:41 CEST
how fedora discover the right vmalloc size without this flag? shouldnt fedora discover this on-the-fly?
Comment 28 Nicolas Chauvet 2010-10-23 18:29:25 CEST
(In reply to comment #27)
> how fedora discover the right vmalloc size without this flag? shouldnt fedora
> discover this on-the-fly?
I don't know. Might worth to ask on nvnews.net
Do you have an updated bios ?
Comment 29 Rafael Louback Ferraz 2010-10-23 19:26:31 CEST
(In reply to comment #28)
> I don't know. Might worth to ask on nvnews.net
> Do you have an updated bios ?

I update the motherboard bios. In the website of the manufactor of my notebook there isnt a video card bios update.

Comment 30 Nicolas Chauvet 2010-11-11 23:22:46 CET
Can you reproduce (again) with lastest 260.19.21 from rpmfusion-nonfree-updates-testing ?
If yes, then please report this to nvnews.net there is nothing I can do about this as a package maintainer.

Thx
Comment 31 Rafael Louback Ferraz 2010-11-12 02:16:56 CET
(In reply to comment #30)
> Can you reproduce (again) with lastest 260.19.21 from
> rpmfusion-nonfree-updates-testing ?
> If yes, then please report this to nvnews.net there is nothing I can do about
> this as a package maintainer.
> 
> Thx
> 

I think that nvidia guys wont reply this, because is a documented bug. Fedora that needs to know how vmalloc size was supossed to be. Nvidia drivers break after a update on the kernel, this vmalloc size maybe was changed in the kernel config.
Comment 32 Nicolas Chauvet 2010-11-12 18:48:30 CET
It still weird because your are the only one affected by the problem.
Is there a non-default option you have made with one or another kernel module ?
Comment 33 Rafael Louback Ferraz 2010-11-13 02:44:29 CET
(In reply to comment #32)
> It still weird because your are the only one affected by the problem.
> Is there a non-default option you have made with one or another kernel module ?
> 

No, no odd option.

Maybe the size of memory of the video card. My card have 512mb of video memory. Could be this? Or my card in specific?
Comment 34 Nicolas Chauvet 2010-12-14 23:26:17 CET
I haven't seen in this report that you have forwarded this problem to nvidia for support from their side (as described in nvidia-bug-report.sh tool). Is it done already ?
Comment 35 Rafael Louback Ferraz 2010-12-15 10:23:43 CET
(In reply to comment #34)
> I haven't seen in this report that you have forwarded this problem to nvidia
> for support from their side (as described in nvidia-bug-report.sh tool). Is it
> done already ?
> 

Are you asking if I send this bug to nvidia? 

Yes I do, but I think it wont get a answer, because is a documented bug. Maybe from kernel side, that doesnt get the right vmalloc size for this gpu.
Comment 36 Nicolas Chauvet 2011-01-26 12:01:34 CET
BTW, Have you tried using the kernel-PAE since you are on i686 IIRC ?

You can even try with the lastest nvidia driver:
yum install kmod-nvidia-PAE --enablerepo=rpmfusion-nonfree-updates-testing

Thx for your feeback.
Comment 37 Rafael Louback Ferraz 2011-01-26 17:44:35 CET
(In reply to comment #36)
> BTW, Have you tried using the kernel-PAE since you are on i686 IIRC ?
> 
> You can even try with the lastest nvidia driver:
> yum install kmod-nvidia-PAE --enablerepo=rpmfusion-nonfree-updates-testing
> 
> Thx for your feeback.
> 

I update to fedora 14 (x86_64), and the problem doesnt exists anymore. But I think this can be only related to 32bits arquiteture. As I sayed, it can be a problem on the kernel side.



Comment 38 Nicolas Chauvet 2011-02-13 22:16:28 CET
OK, so closing as worksfor'us'