reordered an addition in the kernel, which results in less instructions used in the GPU ISA code for GCN