NGram node has two options. The first is “Number of maximal parallel processes”. This option is repeated with other some nodes. According to what criteria determine its value regarding to the running machine? Is this option related to parallel environment like as Hadoop or data center? or is it related to the cores of individual CPU?
The second is “Number of document per process”. According to what criteria determine that number?