Filename as column

Lawson · May 23, 2017, 2:48am

Dear sir

I am now doing text processing by importing few hundred text documents via flat file reader node and would like to add the filename as one of the columns so as to facilitate tables joining from other output from flat file reader. Can you suggest the way of doing this?

Thanks
Lawson

swebb · May 23, 2017, 9:48am

There's multiple ways to do this. You could use the List Files node to list all the files and then iterate over this table to import them.

Or if you are manually configuring the reader you can output the filepath via a flow variable by naming it on the flow variables dialog tab.

Cheers

Sam

Lawson · May 23, 2017, 10:57am

Thanks Sam.

Regarding to the Flow Variable option, is it only applicable to file path, not filename? Or I need to use other node to extract the filename seprately from the file path?

Thanks
Lawson

izaychik63 · May 23, 2017, 3:14pm

URL to File Path node will extract name for you.

swebb · May 23, 2017, 3:45pm

Sorry I didn't pay enough attention earlier, you will need to conver to extract the filename from the path using for example the URL to FIle Path as above or the some regex.

armand_ink · May 24, 2017, 6:59pm

Hi Sam,

Thanks for suggesting the URL node. I am new to KNIME and could use some help in the same topic as well.

I have multiple .csv files so I created the following nodes:

List Files > URL Path > Table Row To Variable > CSV Reader > Loop End.

I can see the filename as Flow variable. How can I get it to show up as a column header in the data?

Thanks,

Armand

Andi · May 25, 2017, 11:29am

hi armand_ink,

i don't know if this answer your question, but i solve it in this way...

i used variable to table row, to extract only the URL, then i used URL to file Path Node and after this the cross joiner to mege both tables...(all in the loop)

After the END LOOP i got a table with filenames/location for every row...

filenames-as-row.png

armand_ink · May 31, 2017, 11:57pm

Hi Andi,

Perfect, I got it to work with your sugestion.

Thank you so much!!

Armand

TardisPilot · April 26, 2019, 2:05pm

@Andi, I built my workflow based on your image, but I want each file to be saved with the new column or at least appended to a master table that has the file name column. But when I run the loop is just pulls in the next file, adds the column, dumps it an reuns etc.

How are these nodes configured to actually allow me to change the file itself in order to add the filename column and change the actual .CSV file or at least have a master table in my workflow that I can then export?

Andi · April 27, 2019, 9:11am

@TardisPilot
i take some new pictures of the test-workflow. Hope this will help you. If you have questions, don’t hesitate to contact me

Karlygash · August 18, 2021, 7:11am

Hi, @Andi !

I have tried to follow your steps, seems like I m missing something in here.
I would like to get files names wo creating new column for each filename
I stuck in URl to File Path
Can you please look at this?
I have to get filename (here month and year from file) for each file
new_file.knwf (1.7 MB)

ipazin · August 18, 2021, 9:45am

Hello there,

with KNIME version 4.4.0 you can use Append path column option to have file identifier when reading one or more files into KNIME. See here for more:

Br,
Ivan