ocl.c


Log

Author Commit Date CI Message
Con Kolivas 7d53fba1 2012-07-21T02:49:50 Reinstate GPU only opencl device detection.
Con Kolivas d13a3f1d 2012-07-21T02:47:27 Decrease lookup gap to 1. Does not seem to help in any way being 2.
Con Kolivas d72add9a 2012-07-20T16:16:18 Send correct values to scrypt kernel to get it finally working.
Con Kolivas 3e61db10 2012-07-18T21:58:27 Create command queue before compiling program in opencl.
Con Kolivas 471daecb 2012-07-16T17:05:08 Initialise mdplatform.
Con Kolivas 428d5e5d 2012-07-16T13:22:35 Limit scrypt to 1 vector.
Con Kolivas a9a0bba1 2012-07-16T11:53:18 Set the correct data for cldata and prepare for pad8 fixes.
Con Kolivas 04edf4bf 2012-07-15T13:40:56 Temporarily set opencl to use all devices to allow debugging of scrypt kernel rapidly.
Con Kolivas 53e9c61c 2012-07-15T13:40:11 Find the gpu platform with the most devices and use that if no platform option is passed.
Con Kolivas 884f83f3 2012-07-15T13:31:03 Allow more platforms to be probed if first does not return GPUs.
Con Kolivas 243d005b 2012-07-14T16:21:27 Set scrypt settings and buffer size in ocl.c code to be future modifiable.
Con Kolivas aabc7233 2012-07-14T00:30:25 Make sure goffset is set for scrypt and drop padbuffer8 to something manageable for now.
Con Kolivas e0296c41 2012-07-13T21:35:25 Set up buffer8 for scrypt.
Con Kolivas 0f43eb5e 2012-07-13T20:35:44 Don't test nonce with sha and various fixes for scrypt.
Con Kolivas b085c338 2012-07-13T20:28:36 Make scrypt buffers and midstate compatible with cgminer.
Con Kolivas dd740caa 2012-07-13T19:02:43 Provide initial support for the scrypt kernel to compile with and mine scrypt with the --scrypt option.
Philip Kaufmann f479be07 2012-04-27T08:29:56 add goffset support for diakgcn with -v 1 and update kernel version
Con Kolivas 9a3ae266 2012-04-27T10:22:53 Add support for latest ATI SDK on windows.
Con Kolivas bb319883 2012-04-25T11:41:35 Detect poorly performing combination of SDK and phatk kernel and add verbose warning at startup.
Con Kolivas 9175e4f2 2012-04-23T17:56:31 Display all OpenCL devices when -n is called as well to allow debugging of differential mapping of OpenCL to ADL.
Con Kolivas 6274fbe7 2012-03-30T09:32:42 Change the preferred vector width to 1 for Tahiti only, not all poclbm kernels.
Con Kolivas 621bcca7 2012-03-27T22:10:17 Use global offset parameter to diablo and poclbm kernel ONLY for 1 vector kernels.
Con Kolivas 39395eb1 2012-03-27T19:58:51 Use poclbm preferentially on Tahiti now regardless of SDK.
Con Kolivas edb070c8 2012-02-24T13:31:29 Fixes.
Con Kolivas fb077c6d 2012-02-24T13:27:15 Pass vectors * worksize to kernel to avoid one op.
Con Kolivas 709c4cd8 2012-02-23T20:24:32 Use diablo kernel on all future SDKs for Tahiti and set preferred vector width to 1 on poclbm kernel only.
ckolivas dfcb98de 2012-02-23T00:45:40 Use the SDK and hardware information to choose good performing default kernels.
ckolivas d3ad87f5 2012-02-22T20:13:23 Allow writing of multiple worksizes to the configuration file.
ckolivas 1b1fa5cd 2012-02-22T20:08:29 Allow writing of multiple vector sizes to the configuration file.
ckolivas 994cd775 2012-02-22T20:01:09 Allow writing of multiple kernels to the configuration file.
ckolivas 93efb726 2012-02-22T19:38:01 Allow multiple different kernels to be chosen per device.
ckolivas a54f7606 2012-02-22T19:00:44 Fix multiple work size entry.
Con Kolivas 26c59fbf 2012-02-22T16:59:28 Allow the worksize to be set per-device.
Con Kolivas deff55c6 2012-02-22T16:54:06 Allow different vectors to be set per device.
Con Kolivas bf3a9f94 2012-02-22T14:42:20 Unintentionally dropped the device name from the binary filenames. Reinstate.
Con Kolivas 5d23d70f 2012-02-22T14:14:26 As all kernels will be new versions it's an opportunity to change the .bin format and make it simpler. Specifying bitalign is redundant and long can be l.
Con Kolivas d1cddf8b 2012-02-21T22:23:07 Update licensing to GPL V3.
Con Kolivas 00290a3e 2012-02-21T21:31:31 Select diablo kernel on all but GCN+SDK 2.6.
Con Kolivas e9c3d730 2012-02-19T18:32:56 Tahiti prefers worksize 64 with poclbm.
Con Kolivas 30936f17 2012-02-18T23:28:41 No need to expressly retain the opencl program now that the zero binary issue is fixed.
Con Kolivas 810ad045 2012-02-18T23:16:08 More copyright updates.
Con Kolivas 22d3034e 2012-02-18T23:13:45 Show error code on any opencl failure status.
Con Kolivas be9db9ce 2012-02-18T23:00:21 Copyright updates.
Con Kolivas 0b6e35cd 2012-02-18T22:49:49 Add detection for version 898.1 SDK as well but only give SDK 2.6 warning once on startup instead of with each device initialisation.
Con Kolivas 67c4ada1 2012-02-16T01:10:11 Provide warning on each startup about sdk 2.6 and decrease poclbm kernel selection to LOG_INFO.
Con Kolivas b4c86ba6 2012-02-16T00:48:34 Give SDK 2.6 warning only on building a kernel for !GCN bitalign devices.
Con Kolivas 728e3d43 2012-02-16T00:43:05 Revert "Automatically choose phatk kernel for bitalign non-gcn ATI cards, and then only select poclbm if SDK2.6 is detected." This reverts commit e7fdadfc8fc388f68772d5a4c2740da60287c889. Broke kernel loading.
ckolivas e7fdadfc 2012-02-15T14:52:29 Automatically choose phatk kernel for bitalign non-gcn ATI cards, and then only select poclbm if SDK2.6 is detected.
ckolivas 6a785946 2012-02-15T14:47:02 Make SDK 2.6 warning and advice big and bold.
Con Kolivas 23c01bc7 2012-02-13T13:19:04 Make output buffer write only as per Diapolo's suggestion.
Con Kolivas b2b5083b 2012-02-13T12:22:35 Microoptimise phatk kernel on return code.
Con Kolivas fd05341a 2012-02-13T10:39:26 Do not loop indefinitely setting poclbm kernel to load a binary.
Con Kolivas d689cfbd 2012-02-13T10:06:26 Try to load a binary if we've defaulted to the poclbm kernel on SDK2.6
Con Kolivas 3057b701 2012-02-13T09:59:29 Use the poclbm kernel on SDK2.6 with bitalign devices only if there is no binary available.
Con Kolivas 2c33f122 2012-02-13T08:34:44 Whitelist ATI SDK 2.6 to use the poclbm kernel by default.
Con Kolivas fb99c8d5 2012-02-12T21:38:45 The longstanding generation of a zero sized binary appears to be due to the OpenCL library putting the binary in a RANDOM SLOT amongst 4 possible binary locations. Iterate over each of them after building from source till the real binary is found and use that.
Con Kolivas 56907db2 2012-02-12T18:21:30 Fix harmless warnings with -Wsign-compare to allow cgminer to build with -W.
Con Kolivas 405a2120 2012-02-11T20:11:18 Remove unnecessary check for opt_debug on every invocation of applog at LOG_DEBUG and place the check in applog().
Con Kolivas 60c70145 2012-02-11T16:41:41 Retain cl program after successfully loading a binary image.
Con Kolivas 55bd031d 2012-02-11T16:38:55 Variable unused after this so remove setting it.
Con Kolivas 1c1b8bec 2012-02-11T15:58:07 BFI INT patching is not necessarily true on binary loading of files and not true on ATI SDK2.6+. Report bitalign instead.
ckolivas f2d5db0c 2012-02-10T16:45:35 Use only working kernels by default.
ckolivas 59d3d011 2012-02-10T14:33:40 Implement diablo kernel support and try to make it work.
ckolivas 95a989da 2012-02-10T13:18:16 Conflicting entries of cl_kernel may have been causing problems, and automatically chosen kernel type was not being passed on. Rename the enum to cl_kernels and store the chosen kernel in each clState.
ckolivas e6cf96ad 2012-02-10T10:28:45 ALlow much longer filenames for kernels to load properly.
ckolivas 4822cca7 2012-02-10T10:23:06 Allow different kernels to be used by different devices and fix the logic fail of overcorrecting on last commit with !strstr.
Con Kolivas 196e8a0f 2012-02-10T09:10:57 Fix kernel selection process and build error.
Philip Kaufmann 47a09cea 2012-02-09T15:15:03 added OpenCL >= 1.1 detection code, in preparation of OpenCL 1.1 global offset parameter support
ckolivas cb7145b1 2012-02-08T13:45:56 Add basic build ability with diakgcn and put all kernel names in configure.ac to avoid changing them in mutliple places.
ckolivas 6776b0ea 2012-02-10T16:45:35 Use only working kernels by default.
ckolivas 2270b4e0 2012-02-10T14:33:40 Implement diablo kernel support and try to make it work.
ckolivas 02c94272 2012-02-10T13:18:16 Conflicting entries of cl_kernel may have been causing problems, and automatically chosen kernel type was not being passed on. Rename the enum to cl_kernels and store the chosen kernel in each clState.
ckolivas 35ea31b1 2012-02-10T10:28:45 ALlow much longer filenames for kernels to load properly.
ckolivas 8af2365e 2012-02-10T10:23:06 Allow different kernels to be used by different devices and fix the logic fail of overcorrecting on last commit with !strstr.
Con Kolivas 2b23805e 2012-02-10T09:10:57 Fix kernel selection process and build error.
Philip Kaufmann ed7210af 2012-02-09T15:15:03 added OpenCL >= 1.1 detection code, in preparation of OpenCL 1.1 global offset parameter support
ckolivas a6c6866a 2012-02-08T13:45:56 Add basic build ability with diakgcn and put all kernel names in configure.ac to avoid changing them in mutliple places.
ckolivas 53c1e9ae 2012-02-04T15:15:57 Allow the OpenCL platform ID to be chosen with --gpu-platform.
ckolivas a4f47812 2012-02-04T14:47:23 Iterate over all platforms displaying their information and number of devices when --ndevs is called.
Con Kolivas ebaa2be1 2012-02-03T18:19:39 Update poclbm kernel for better performance on GCN and new SDKs with bitalign support when not BFI INT patching. Update phatk kernel to work properly for non BFI INT patched kernels, providing support for phatk to run on GCN and non-ATI cards.
Con Kolivas 82af288e 2012-01-29T22:57:29 Revert "Fix various harmless warnings." This reverts commit a4b67f030fc0c7e2b18e79114a441c1e1617d5f8.
Con Kolivas a4b67f03 2012-01-29T21:06:17 Fix various harmless warnings.
Con Kolivas b8f845b4 2012-01-29T16:43:38 Display information about the opencl platform with verbose enabled.
ckolivas 5d5584f8 2012-01-29T16:31:03 Explicitly check for nvidia in opencl platform strings as well.
Con Kolivas a3d90f84 2012-01-29T11:01:17 Default to poclbm kernel on Tahiti (7970) since phatk does not work, even though performance is sub-standard so that at least it will mine successfully by default.
Con Kolivas 31f6e8c7 2012-01-28T17:06:28 Unset prog_built after it is patched because it needs rebuilding.
Con Kolivas 1e503549 2012-01-28T16:29:19 Retain cl program after every possible place we might build the program.
Con Kolivas 25caca90 2012-01-28T16:26:53 Revert "Don't explicitly retain the cl program as it is of no benefit to do so and may lead to problems when trying to release the program." This reverts commit 32910463a3124265b56aca48a6c0fbb107ccfb70. Turns out this does help.
Con Kolivas 32910463 2012-01-26T20:53:35 Don't explicitly retain the cl program as it is of no benefit to do so and may lead to problems when trying to release the program.
Con Kolivas d18d5564 2012-01-26T20:39:35 Do not attempt to build the program that becomes the kernel twice. This could have been leading to failures on initialising cl.
Con Kolivas c87460b3 2012-01-26T19:42:57 Typo.
Con Kolivas 2ecabd85 2012-01-26T19:38:15 Some opencl compilers have issues with no spaces after -D in the compiler options.
Con Kolivas 77e9b1c2 2012-01-26T13:06:39 Use calloced stack memory for CompilerOptions to ensure sprintf writes to the beginning of the char.
Con Kolivas d7aac254 2012-01-26T11:44:42 Whitelist 79x0 cards to prefer no vectors as they perform better without.
Con Kolivas 3d4cfce8 2012-01-24T20:23:44 Instead of using the BFI_INT patching hack on any device reporting cl_amd_media_ops, create a whitelist of devices that need it. This should enable GCN architectures (ATI 79xx cards) to work properly.
Con Kolivas 6442c1ab 2012-01-22T20:36:57 Style police.
Con Kolivas 0719d407 2012-01-22T17:09:06 Clean up on failure to load a binary kernel.
Con Kolivas fb0c580b 2011-10-15T13:29:44 Go to kernel build should we fail to clCreateProgramWithBinary instead of failing on that device. Should fix the windows problems with devices not initialising.
Con Kolivas 2053de6d 2011-09-06T10:11:34 Add the directory name from the arguments cgminer was called from as well to allow it running from a relative pathname.
Con Kolivas 5848c110 2011-08-29T00:16:58 Confusion over the variable name for number of devices was passing a bogus value which likely was causing the zero sized binary issue.