GC overhead limit issues when running the Topic Extractor Node

Hi All,

I ran into a GC overhead limit while running the topic extractor (parallel LDA) node on ~600K documents.  Each document probably contains about 50 terms.  Is there a possible workaround to this problem.  

I have a macbook pro 2.6 GHz Intel Core i7 with 8GB RAM.

Appreciate any suggestions to help optimize the performance.

 

Hey mvyas2,

did you try to increase the heapspace in the knime.ini file?

https://tech.knime.org/faq#q4_2

Cheers,

Julian

Hi Julian,

Thanks for the tip, changing the heapspace worked.

 

Malhar

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.