question about sample sentiment analyse workflow

metinergoktas · May 11, 2024, 11:18pm

Hi all,

I try to learn knime platform by inspecting sample workflows. Then one thing is not clear for me on sample sentiment analyse workflow on 03_Sentiment_Classification – KNIME Community Hub.

If someone can explain me what is the using purpose of “extract table dimension” and “java edit variable” node here. Screenshot is below.

thanks.

thor_landstrom · May 14, 2024, 4:53pm

Hello @metinergoktas,

The extract table dimension just gives you the number of rows and columns in a particular table. The Java edit variable uses this info to run a script on it to do a simple calculation that is essentially:

numRows / 100

This value is passed as a flow variable to filter out any terms that have less than the value in the variable and are considered insignificant. All those 2 do is calculate the cutoff point for terms based on occurrence

If you look at the ‘Row Filter’ node that uses the flow variable passed from those 2, it uses it in the ‘lower bound’ field in range checking. Although there is a 20 there, if for example you put 1000, the output will be the same as it is defaulting the lower bound to the value being passed by the flow variable (which is 20). It may look confusing, but the node is not actually using the number in the lower bound unless you disconnect the flow variable that overwrites the value.

If we disconnect that flow variable and keep the lower bound of 1000, we only get 2 rows that appear in the document more than 1000 times (versus the original 1499 rows):

Hope this helps,
TL

system · August 12, 2024, 4:54pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.