WU stuck

Forum rules
davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#1 WU stuck

Unread post by davidBAM » Mon Dec 10, 2018 8:27 pm

Grrrrrr - just aborted a Collatz WU which had become stuck! 18hrs of wasted time on an AMD RX580.

Anyone else seen that kind of problem? I only noticed when I saw that that machine hadn't uploaded to Collatz website in a while
I think this is fool-proof but could you just try it for me please? • There are 10 types of people in the world; those who understand binary, and those who don’t

User avatar
Alez
[ TSBT's Pirate ]
[ TSBT's Pirate ]
Posts: 8953
Joined: Thu Oct 04, 2012 1:22 pm
Location: roaming the planet

#2 Re: WU stuck

Unread post by Alez » Tue Dec 11, 2018 12:10 pm

Some projects are notorious for this issue. Collatz, strangely, is not one of them.
The best form of help from above is a sniper on the rooftop....
Image

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#3 Re: WU stuck

Unread post by davidBAM » Tue Dec 11, 2018 12:27 pm

I wonder if Boinctasks have the capability (or any plans) to spot this kind of thing? It would seem (relatively) easy to do on Linux clients. Hmmmm
I think this is fool-proof but could you just try it for me please? • There are 10 types of people in the world; those who understand binary, and those who don’t

User avatar
Alez
[ TSBT's Pirate ]
[ TSBT's Pirate ]
Posts: 8953
Joined: Thu Oct 04, 2012 1:22 pm
Location: roaming the planet

#4 Re: WU stuck

Unread post by Alez » Wed Dec 12, 2018 9:01 am

Scole wrote a program to detect and abort stuck units on another project. It's on here somewhere. I'll look for it some time today, if I get a chance. Most of today should hopefully be spent travelling.

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#5 Re: WU stuck

Unread post by davidBAM » Thu Dec 13, 2018 11:34 pm

davidBAM wrote:
Mon Dec 10, 2018 8:27 pm
Grrrrrr - just aborted a Collatz WU which had become stuck! 18hrs of wasted time on an AMD RX580.

Anyone else seen that kind of problem? I only noticed when I saw that that machine hadn't uploaded to Collatz website in a while
Ditto - also rx580 (on t1500 this time in case it happens again)

User avatar
scole of TSBT
Boinc Brigadier
Boinc Brigadier
Posts: 3928
Joined: Mon Feb 03, 2014 2:38 pm
Location: Goldsboro, (Eastern) North Carolina, USA

#6 Re: WU stuck

Unread post by scole of TSBT » Thu Dec 13, 2018 11:45 pm

Alez wrote:
Wed Dec 12, 2018 9:01 am
Scole wrote a program to detect and abort stuck units on another project. It's on here somewhere. I'll look for it some time today, if I get a chance. Most of today should hopefully be spent travelling.
https://tsbt.co.uk/forum/viewtopic.php?f=172&t=2927
Image

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#7 Re: WU stuck

Unread post by davidBAM » Fri Dec 14, 2018 7:45 am

It was happening on every Collatz WU on that machine. I have reset the project to see if it helps

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#8 Re: WU stuck

Unread post by davidBAM » Fri Dec 14, 2018 8:18 am

Hmmm - I think the card may be faulty

User avatar
Alez
[ TSBT's Pirate ]
[ TSBT's Pirate ]
Posts: 8953
Joined: Thu Oct 04, 2012 1:22 pm
Location: roaming the planet

#9 Re: WU stuck

Unread post by Alez » Fri Dec 14, 2018 1:21 pm

It may be or it could be something else. Try running something other than collatz and see.
I have an AMD 7970 that locks up the entire system on moo wrapper almost immediately, but runs everything else fine. I can't find why it won't run moo, it just wont.

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#10 Re: WU stuck

Unread post by davidBAM » Fri Dec 14, 2018 4:45 pm

It stuck permanently on a PrimeGrid WU as well. Crept up to 100% eventually but then just sat there

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#11 Re: WU stuck

Unread post by davidBAM » Fri Dec 14, 2018 11:24 pm

I reckon I've got to the bottom of it. My theory is that the RX580 doesn't like running (under Linux) in an x79 motherboard. It works fine in others.

I also note that the optimisation needs to go in 2 places to be effective on all Collatz WU
/var/lib/boinc-client/projects/boinc.thesonntags.com_collatz/collatz_sieve_1.40_x86_64-pc-linux-gnu__opencl_ati.config
/var/lib/boinc-client/projects/boinc.thesonntags.com_collatz/collatz_sieve_1.40_x86_64-pc-linux-gnu__opencl_ati_gpu.config

User avatar
scole of TSBT
Boinc Brigadier
Boinc Brigadier
Posts: 3928
Joined: Mon Feb 03, 2014 2:38 pm
Location: Goldsboro, (Eastern) North Carolina, USA

#12 Re: WU stuck

Unread post by scole of TSBT » Fri Dec 14, 2018 11:54 pm

Is the BIOS up to date?
Image

davidBAM
Boinc Colonel
Boinc Colonel
Posts: 1943
Joined: Wed Aug 15, 2018 1:15 pm
Location: Huntly, Scotland
Contact:

#13 Re: WU stuck

Unread post by davidBAM » Sat Dec 15, 2018 12:04 am

I'll need to check - they were cheap boards off of eBay so I am probably paying the real price now. Mind you - they work fine with nVidia

User avatar
Alez
[ TSBT's Pirate ]
[ TSBT's Pirate ]
Posts: 8953
Joined: Thu Oct 04, 2012 1:22 pm
Location: roaming the planet

#14 Re: WU stuck

Unread post by Alez » Sat Dec 15, 2018 12:49 am

davidBAM wrote:
Sat Dec 15, 2018 12:04 am
I'll need to check - they were cheap boards off of eBay so I am probably paying the real price now. Mind you - they work fine with nVidia
A very common scenario, linux and nVidia seems pretty much fine, AMD not so much.

Post Reply Previous topicNext topic

Return to “Collatz Conjecture”

Who is online

Users browsing this forum: No registered users and 1 guest