I’m replicating the Google NMT benchmarking for English–> Vietnamese, using KNIME example as my guide. I have successfully added the data sets for English and Vietnamese. However, the python script that does the “index encoding and padding” seems to be stuck. (waited for more than 4hrs to finish).
Row count of the table: 132,837
The script seems to run fine inside the KNIME node and in Jupyter Notebook. (See Attached screen shots). KNIME Log does not show much and I’m not sure what is causing this.