After much ado, I have set up my new machine to leverage CUDA via DL4J in KNIME, but I noticed that the classifications using the GPU are not working correctly as compared to CPU when I select the GPU option for DL4J.
I have confirmed that that CUDA 8.0 is installed and works. I am running on an Nvidea RTX 2060 GPU, and the task manager shows some GPU utilization during training. The results of the training put all the training set into a single class though.
If I run it on CPU, the examples seem to run correctly. If I monitor the Backprop loss, the coefficients coming off the GPU are all zeros, versus some actual number from the CPU.
Is there something I am missing???