Monday, June 8, 2009

9800GT strikes again!

It has been a long weekend in NSW (Queens Birthday) and my Ubuntu 9.04 AMD X2 with the 9800GT has been on since Friday morning. Sometime on Sunday the GPU card went mindless and returned 240 SETI work-units with 60 something seconds of work and "The number of results detected exceeds the storage space allocated". It also returned 11 GPUGrid work units with a multitude of "unspecified launch failure" errors.
I thought that this machine is no longer on a UPS due to the GPU's power requirements and maybe a power glitch caused the problem. I have seen a fair number of entries on the UPS logs over weekends. There is, however, a Windows machine with a 9400GT on the same supply and it seems to be unaffected, so I think that ruins my theory.
The 240 trashed units actually moved me to go into the office on a public holiday and re-start the machine. There have been no work units returned to either SETI or GPUGrid since. I'm hoping this just means that there hasn't been a need to report completed units yet ... either that or some serious PC surgery awaits me tomorrow morning!
I'm running out of ideas to get some reliability out of this machine! It isnt overclocked, GPU seems to run around 70'C and the CPU work units seem to have no problems (which in my mind rules out HDD/RAM/Motherboard issues)

No comments:

Post a Comment