I just found that on my (pretty expensive) RadoneRadeon 6970, only cl_amd_fp64
extension is supported. I am getting odd results in some parts of the code (accessing the value of 0.005
actually uses 1.99916e+37
?) when running with cl_amd_fp64
. Using cl_khr_fp64
with Intel SDK on the CPU works just fine. (The input buffers are exactly the same)
The extension page gives very little information.
What are exactly the differences between both?