|
2b6e8416
|
2011-06-29T23:38:16
|
|
Use a buffer of up to 512 * 4 integers when retrieving work from the GPU.
This allows each local thread id to have one slot to put any positive results into, thus making overlapping results far less likely.
Thus races will be much rarer, allowing more threads.
It should also pick up blocks close to each other more reliably and hopefully decrease the number of rejects and opencl errors.
Do the search over the buffer entirely in a separate thread to allow the GPU to stay as busy as possible.
Detach threads from themselves to prevent unlucky even where dereferencing occurs by freeing the data that stores the thread info.
|
|
a45c54aa
|
2011-06-27T11:31:05
|
|
Make postcalc_hash asynchronous as well.
|
|
e1dd27c5
|
2011-06-29T11:19:43
|
|
Ensure that we don't overflow due to 32 bit limitations.
|
|
6b77d850
|
2011-06-17T14:00:41
|
|
Fixes.
|
|
f117675a
|
2011-06-22T10:15:23
|
|
Optimise work loop to make cl calls asynchronous where possible.
|
|
dde70397
|
2011-06-14T10:32:54
|
|
Merge gpumining from oclmine. Unstable.
|