How to map one CSV data to other CSV with pre-defined strucuture

prashant · July 29, 2020, 1:14pm

Hi,

I am novice in Knime platform. So, apologies if I am making no sense.

I have a requirement wherein I have to read a CSV file (I figured this part) with some structure, say 50 columns.
Then I will do some manipulation, transformation (I think I can do this part as well) on the column(s).
Ultimately I need to write to CSV file with some structure say 100 columns and output file. Sequence of columns are important for output file.

Is there a way in Knime using which I can do this. I mean any node for data mapping between source CSV to target CSV, or something like that.

Thanks
Prashant

ipazin · July 29, 2020, 1:42pm

Hi there @prashant,

welcome to KNIME Community!

Your requirement is frequent, makes sense and KNIME can do it just as you described it. Read your data, perform manipulations and transformations on it to get it into format you need and finally write data out. As you are novice in KNIME I recommend this (free) E-Learning course where you’ll learn basic concepts around KNIME: https://www.knime.com/knime-introductory-course
In above link you’ll find more learning content that can be useful. To check KNIME examples (&more) you can explore KNIME Hub.

Hope this helps!

Br,
Ivan

prashant · July 29, 2020, 2:40pm

Thank Ivan for quick reply.

I have gone through the course in bits and pieces; and have gathered some knowledge.
To start with I put this in the workflow.

But could not find a node which can help me add extra columns (consider when no data manipulation is required) even if if the records are blank.

Can you point me to some sample workflows that resembles this kind of requirement.

Many Thanks
Prashant

ScottF · July 29, 2020, 3:32pm

Hi @prashant

One way to add blank columns is by using the Constant Value Column node.

prashant · July 30, 2020, 6:08am

Hi @ScottF,

It worked, thanks a ton.

This design leads me to 3 questions:

Is there a simple way if I want to add 100 columns.
How I can map data in these columns. Is there a way I can map data in column values dynamically(may be depending on some values in other column. For example: I want to derive Manager based on EMPNAME).
Is there a way I can reorder columns of the file reader. Ex: From EMPNAME, EMPNO,DEPT to EMPNO,EMPNAME,DEPT.

Thanks in advance,
Prashant

ipazin · July 30, 2020, 9:09am

Hi @prashant,

Don’t think so but on the other hand why would you like to have 100 blank columns?
This depends on mapping. For example if you want to have new column with manager name depending on employee name you can use Rule Engine (have in mind that Rule Engine, as well as other nodes, don’t need a blank column before in order to be populated). And if you have manager name together with employee names in other table you can use Joiner or Cell Replacer to add new column to your table
Use Column Resorter node. A way to find out about nodes is to look for/search them in Node Repository from within KNIME Analytics Platform or on already mentioned KNIME Hub

Here is basic ETL workflow example which you can check:

Br,
Ivan

prashant · July 30, 2020, 12:48pm

Thanks @ipazin

I will need blank columns to match with the output file structure, as it has a predefined structure.
I think this information will be sufficient for me for now.

Regards
Prashant

ipazin · July 30, 2020, 12:57pm

Hi @prashant,

in that case use Table Validator (Reference) node. Among other things it has option to insert columns that are missing based on reference table.

Br,
Ivan

ipazin · July 30, 2020, 1:13pm

Hi @prashant,

Actually yes. Use Table Creator node and add as many column as you need with proper names and types. Then use Column Appender node to add them to your input data. You’ll have missing values but that should be just fine.

Br,
Ivan

prashant · July 30, 2020, 1:36pm

Hi @ipazin,

That’s awesome.
I will try this node.

Thanks a lot
Prashant

docminus2 · August 5, 2020, 9:30am

Once you have all your columns established and they should for some reasons have the wrong order (your wrote that this seems important), you can use the column sorter node.

system · February 3, 2021, 9:31pm

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.