GC overhead limit issues when running the Topic Extractor Node

Hi All,

I ran into a GC overhead limit while running the topic extractor (parallel LDA) node on ~600K documents.  Each document probably contains about 50 terms.  Is there a possible workaround to this problem.  

I have a macbook pro 2.6 GHz Intel Core i7 with 8GB RAM.

Appreciate any suggestions to help optimize the performance.

 

Hey mvyas2,

did you try to increase the heapspace in the knime.ini file?

https://tech.knime.org/faq#q4_2

Cheers,

Julian

Hi Julian,

Thanks for the tip, changing the heapspace worked.

 

Malhar