I’d like to extract an embedding object from a lot of Word documents. Before doing a loop I try to make a workflow with 1 file. To realize it, I unzip (with "unzip files" node) the .doc or .docx and then I select the file I need. This file is a .bin which have to be unzip to obtain the object I need.
Currently I can isolate the .bin file but when I try to unzip it, the "unzip files" node doesn’t afford the whole files. It appears that I only obtain the starting file with no extension. I tried to move the .bin file and change its extension to .zip but "unzip files" node gave the same result. However if I try to unzip the same file (.zip, .bin or without extension) manually (eg with 7zip) I obtain the files I’m looking for…
Many thanks for your answer. I came to the same conclusion.
It also happens with ".doc" files which can't be unzipped with the node, while it works fine with ".docx".
I was searching to integrate 7zip in the workflow but still had no real solution, so I'm glad somebody offer me an idea, thanks !
I'm wondering how insert it direclty in the workflow, maybe with java ? But don't know java at all...