Running into a similar issue on my end. I have created fingerprints of 200K molecules and I am running a distance matrix calculation on these as well. In my case I chose to 'write tables to disk' as the memory policy, problem is KNIME wrote to 120GB of disk space. I understand that each fingerprint is taking up 512 bytes, and I am calculating 200K rows by 300+ columns but I dont think that would result in 120 GB file.
One question I have, is it possible to change the location of where KNIME writes its temp files as I could mount a larger drive and brute force this through?
Well, the distance matrix for 200,000 molecules has 200,000*200,000/2 entries. Given that each entry is a double value with 8 Bytes this gives 149GB. The compression brings it down to 120GB. You can change the temp location in the KNIME preferences (File->Preferences->KNIME).
Thank you for the response Thor, yup you are right about the size I did not multiply by 8 bytes per double. The ability to change the write location of the temp files is key, thank you so much I was getting a bit frustrated!