This week let’s wear our data engineer’s hat for our new Just KNIME It! challenge. You are reorganizing your company’s data warehouse and has a list of folders that should be created in the process. The filesystem you are using is smart, and if you ask it to create, for example, a folder2 inside a folder1 and none of them exist, it creates both in the right order without raising errors. How can you leverage this feature to reduce the list of folders you actively have to create?
Here is the challenge. Let’s use this thread to post our solutions to it, which should be uploaded to your public KNIME Hub spaces with tag JKISeason3-8.
Need help with tags? To add tag JKISeason3-8 to your workflow, go to the description panel in KNIME Analytics Platform, click the pencil to edit it, and you will see the option for adding tags right there. Let us know if you have any problems!
Hello everyone,
I am impressed with @Tofusa 's workflow! I only changed the part of his workflow where he gets the parent folders. I used Regex Split instead of String Manipulation.
My workflow is similar to @Tofusa 's solution.
I learned from his solution that the GroupBy process (to remove duplicated column) is not needed.
Congrats!!
Hello JKIers,
Wow! Very amazing and efficient solutions. I just went straight away to test my first idea, based in ‘Recursive Loop’ plus regexMatcher(). And it worked
Thank you @alinebessa for this simple yet interesting challenge. I’m hoping I understood the problem right. My solution is slightly convoluted but has some decent logic that also provides some intermediate statistics. You can find my solution here