GPU overheating and shutting down Microserver

CodeMaster

Expert Member
Joined
Dec 4, 2003
Messages
3,512
Reaction score
70
Location
127.0.0.1
For the last week my N40L Microserver has been randomly shutting down.
I originally thought it was just because I need to replace the UPS battery and just thought it was a dip in power causing it to shut down. When I got a gap to start looking into it, I started with the Event Viewer. Bit since it was just spontaneously shutting down with no BSoD, there was nothing in the events list and no dump files either.

This started making me think it may be hardware and not software.
So I installed "HW Monitor" and straight away saw that the Nvidia GT210 card was overheating. At that stage it was hovering around 70°C, but as soon as I fired up Kodi then it started climbing and quickly climbed over 100°C. I checked the specs on the card, and MAX temp is 105°C, so I guess that's the point at which the card shuts down the PC.
I thought that I had put a silent card in the machine, so I suspected that I may have needed to redo the thermal paste, as the card is over 2 yrs old.
When I whipped it out, it is actually a fan cooled card. So I stuck it in my desktop to check it, and it powered up fine. Fan was spinning no problem, with no extra noise or anything.
With the case still open, temps started out at around 42°C. If I stopped the fan, then temps climbed, and if I let it go then temps came down again. I didn't leave it in for a long time, and also didn't install any drivers. Put it back into the Microserver, and the temps were fine again at idle, but when pushed a bit they climbed.
I managed to keep it running overnight to get some downloading done, but only by keeping the front door open and temps this morning were just under 100°C.

I currently have it running in my desktop at work, using Windows 8.1 drivers, and it's running fine since 8am.
HW Monitor reports it as 55°C min and 61°C max. Fan speed seems to be on average 1% per 1°C - so at 50°C it's around 50% rpm, So it definitely looks like the fan is working fine.

Since it is running fine in my desktop, I am thinking it probably isn't the thermal paste, but I will redo it anyway.
My work desktop has 5 case fans, so the airflow is a hell of a lot better than the Microserver, but as you can see in the screenshot below, all the other temps are well within range, so I don't believe it's an airflow problem. Besides the 12cm rear fan in the Microserver is working, and nothing else has changed with the placement of the server.

I am also going to remove all the Nvidia drivers and software tonight, and just install the Windows certified drivers for the card.

Beside that, what else could I try?
Could it realistically be a driver problem that's causing the high temps in the Microserver?

Capture.PNG
 
Install MSI afterburn so the fan RPM can be changed on its own should it get too hot.

And maybe you screwed up the thermal paste, what method did you use.
 
Sounds like your MS needs a clean. might be the airflow in your server, make sure the master fan is rotating, as well that the front panel is clear of durst.
 
Install MSI afterburn so the fan RPM can be changed on its own should it get too hot.

And maybe you screwed up the thermal paste, what method did you use.

Why would he? :erm:
 
Not deliberately, he could have put on too much by mistake or the heatsink isnt on properly.

no but he never said he reseated the heat sink before the problem? He only said: I am thinking it probably isn't the thermal paste, but I will redo it anyway.

Maybe its still factory paste. I have seen them go bleg over time . Had a 580 that was warming up... paste was shot hardish
 
The Microserver has extremely low air flow due to the compact nature of the server. The GT210 is known to cause overheating and shutdown issues due to this. Google it and swap the card out with something a little more appropriate. Ambient temperatures, hard disks, system fans etc all contribute to heat.
 
You didn't read properly... I only did the thermal paste 10 mins ago!
I know how to apply thermal paste, and after applying Arctic Silver 5 the temp has dropped by a couple °C, but not significant enough to make my think there was a problem with the thermal paste.
Also, like I said the fan is fine. It goes up 1% for every 1°C, so it's running at 90-100% when it's in the 90-100°C range, which I think is fine.

I blew the MS out with compresses air a little while ago, so it is pretty clean. Rear case fan is functioning fine. I don't want to start manually controlling fans just to control temps. It's a media centre after all, and noise is factor. Like I said, nothing has changed physically. That's what makes me think it could be software, but is that possible? I have heard of drivers screwing with GPU's, but my Nvidia drivers don't auto-update, so nothing there changed, plus I updated to newest drivers last night and it didn't help.
 
Last edited:
Download process hacker to see what is using your GPU.

Playing media should only use 5% max.
 
It's not that the GPU is being stressed though, if you check the screenshot, the utilization was pretty low yet the temp was up at 100°C.
At the highest recorded temp of 104°C the utilization was high, but that was only for a short time.

Download process hacker to see what is using your GPU.

Playing media should only use 5% max.
 
Funny you should recommend that... I was just looking at the 1GB version at NIVO
https://www.nivo.co.za/#buy~asus.amd.radeon.hd6450.graphics.card.eah6450.silent.di.1gd3.lp.~p41059

Is a passively cooled card gonna be any better though? How do the AMD cards compare to Nvidia from a heat perspective?


The card is running fine in my desktop though, so it must either be a software/driver issue, or it's the ariflow.
It's the only variables that are different.

Capture2.JPG
 
Last edited:
OK, so it looks like replacing the thermal compound has done the trick. Temps are sitting between 62-70°C, which I am OK with considering it is in a Microserver, which is housed in a TV stand cabinet.
I did also un-install all Nvidia software and drivers and revert to default certified Windows drivers, but the temps were stable before I did that anyway.
 
Top
Sign up to the MyBroadband newsletter
X