Detecting Overclaiming hosts made easy, and the new credit system

Message boards : Number crunching : Detecting Overclaiming hosts made easy, and the new credit system

To post messages, you must log in.

AuthorMessage
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 22030 - Posted: 8 Aug 2006, 16:17:47 UTC
Last modified: 8 Aug 2006, 16:19:44 UTC

The intent of this post is not to start a flame war. Rosetta/Ralph/users are questioning methods for determining what claims are from standard hosts and which are from third party hosts. Note: optimized clients in and of themselves aren't bad, they just need to be properly compiled so that benchmarks are "inline" with the standard client when it comes to system types. To date I know of none who are "inline", although some come closer than others.

What follows is a spreadsheet of all the officially made Boinc CC's, which OS they were made for and release dates:




At the bottom is a list of known third party optimized apps. Weeding out the third party results isn't that tough, since many of the app names don't correspond with the "official list" as it pertains to OS. For Example;
Any result from 5.5.0 can be discarded as the official 5.5.0 was for Linux only and then only a developmental version (never released to the public as a "recommended" version. Also, I've yet to come across one in my studies).
Any result from 5.3.6 and Linux can be discarded since it doesn't officially exist. Neither does 5.2.12 with any OS, or 5.2.14 when used with Linux. Now it gets a bit more complicated, but applying the above filters will eliminate 75% of them. Trux and I believe some versions of Boincstudio have a calibration feature which allow the users to turn on/off the calibration, so a comparison of benchmarks to known standard boinc clients will help determine whether or not overclaiming is happening. The same is true with 5.2.11, 5.2.13, and 4.45, as they have both standard and optimized versions. My earlier study of the hosts of users who posted can be found here and by weeding out the easy ones shown above, what's left is a small sampling of many Hosts which use standard boinc benchmarks and by comparing the suspect one to the others, they can be weeded out.

To make it even easier, just weed out all the host info for all of them,including what might be valid benchmarks and you have a list of reliable benchmarks. Then apply the claimed credit formula and you have a pretty good example of what claimed credit per hour for each individual system should be. Now data needs to be collected on the number of models done per hour by those hosts and a reasonably accurate credit/model/cpu type can be established. Rosetta has access to the DB and is in the best position to do this as they "have all the data at their finger tips".

To determine Project Parity, simply take the remaining hosts, apply the claimed credit score and go with that. My Cross project comparision (an ongoing work) shows most projects grant nearly what the official claimed credit is supposed to be. Note: Seti and Einstien are still trying to figure out the best way to acheive parity and will be changing their credit systems.

[edit]overclaiming can also be done by editing XML files, so a comparison is always handy, but if you apply the above, you'll quickly begin to see what "normal" benchmarks look like and spot differences.
ID: 22030 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Keck_Komputers
Avatar

Send message
Joined: 17 Sep 05
Posts: 211
Credit: 4,246,150
RAC: 0
Message 22032 - Posted: 8 Aug 2006, 16:43:25 UTC

Not to worry a new plan for determining credit is already in testing at RALPH.
BOINC WIKI

BOINCing since 2002/12/8
ID: 22032 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 22034 - Posted: 8 Aug 2006, 16:51:45 UTC

That's what I'm talking about John, determining a method for calculating the best cobblestone/model possible and still maintain cross project parity.

I applied my own suggestion to my old data base, by eliminating results from any host that could possibly be overclaiming with software. Here's the result:







See how a range starts to appear for a given CPU? Now imagine if the project did this with the entire data base. I imagine they'd have a pretty good idea of what benchmarks and then claimed credit/hour for a given host should be. Then extract data about how many models/hour each does and make the adjustment.
ID: 22034 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MikeMarsUK

Send message
Joined: 15 Jan 06
Posts: 121
Credit: 2,637,872
RAC: 0
Message 22051 - Posted: 8 Aug 2006, 18:57:36 UTC

Very interesting document, thanks for posting it. I see that see that overclocking by 43% only results in ~35% increase in credits/hour, guess the memory bandwidth is the limiting factor! I'm looking forwards to the new scheme turning up so I can finally compare my two boxes properly. Credit based on models processed is I think the correct way to go.

(Incidentally, I changed the CPU from a 3000 to a X2 4600 (overclocked to 2530MHz) on the 1st, so the most recent figures on the same host ID, 138077, will be a bit confused. As long as you collected the data before that date it should still be accurate).

Boinc version 5.3.19 was the first BBC/CCE version (I think this has been replaced by a 5.4.9-based version now, but don't know the timestamps for either, although you can download it from bbc.cpdn.org). I think both are windows-only but may be wrong.

ID: 22051 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,448,284
RAC: 11,030
Message 22062 - Posted: 8 Aug 2006, 22:02:19 UTC

out of curiosity, how come two of mine are different colours on there? is that to do with them running old BOINC versions? (they're both remotes so i haven't bothered upgrading them).
cheers
Danny
ID: 22062 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 22066 - Posted: 8 Aug 2006, 22:36:45 UTC - in response to Message 22062.  
Last modified: 8 Aug 2006, 22:38:56 UTC

out of curiosity, how come two of mine are different colours on there? is that to do with them running old BOINC versions? (they're both remotes so i haven't bothered upgrading them).
cheers
Danny

Here's all your hosts in my DB:


I organized boinc versions by colors to make them easier to distinquish. You have a mix of Boinc Versions. The Sempron didn't have any completed results that I could check the Result ID on, so I used an "x".

does this help?

tony

this data was extracted roughly 18 days ago, so things may have changed.
ID: 22066 · Rating: -0.99999999999999 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Haltech

Send message
Joined: 4 Jan 06
Posts: 18
Credit: 351,352
RAC: 0
Message 22093 - Posted: 9 Aug 2006, 4:03:56 UTC

Can i pick my own colors? How about BRIGHT lime ALIEN green! I really want to stick out on your spreadsheet!
SETI...Grown Men Looking For ET's That Dont Exist... Do You Dress Up Like Starwars Characters Too?
ID: 22093 · Rating: 9.9920072216264E-15 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 274
Message 22129 - Posted: 9 Aug 2006, 15:12:03 UTC

Odd, in that table, my "Evesham" node, (number 149), is still showing half Gig of RAM - I noticed this in an earlier table Tony posted. I have stripped that machine down, pulled and re-inserted the RAM sticks, and memtest86 assures me that there is 1GB in there, and that it is 100% good.

Perhaps like the processor type, (it is a P-IV Northwood core 2.533GHz), NT4 reports it weird?
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 22129 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 22133 - Posted: 9 Aug 2006, 15:20:56 UTC - in response to Message 22129.  

Odd, in that table, my "Evesham" node, (number 149), is still showing half Gig of RAM - I noticed this in an earlier table Tony posted. I have stripped that machine down, pulled and re-inserted the RAM sticks, and memtest86 assures me that there is 1GB in there, and that it is 100% good.

Perhaps like the processor type, (it is a P-IV Northwood core 2.533GHz), NT4 reports it weird?

Adrian, this is the same data, just mixed different. It's 19 days old now. It shows 1M now.

Memory 1023.11 MB
ID: 22133 · Rating: -1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 274
Message 22135 - Posted: 9 Aug 2006, 15:29:05 UTC
Last modified: 9 Aug 2006, 15:39:37 UTC

Ah, I assumed it was new data as it was dated yesterday. Cheers Tony.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 22135 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Detecting Overclaiming hosts made easy, and the new credit system



©2024 University of Washington
https://www.bakerlab.org