Solutions to "Just KNIME It!" Challenge 3

Victor_G · February 11, 2022, 12:02pm

Link updated : KNIME_Challenge-3 – KNIME Hub

duristef · February 11, 2022, 12:29pm

The dataset is messed up. In fact, “Prostate” is the most frequent male cancer site in 2017, but the whole “Male Genital System” section is not present in the dataset, so “Prostate” does not rank in the top 5 sites.

FrankColumbo · February 11, 2022, 1:30pm

Here is my solution. I get a lot out of this (KNIME it) by 1) make my own solution 2) look at other solutions 3) see the differences and learn.

ersy · February 11, 2022, 3:20pm

Hi everyone,
Here is my solution.

Just Knime it - 3

gonhaddock · February 11, 2022, 10:11pm

Hello KNIMErs,

Here is my solution to #justknimeit-3 :

KNIME Hub > gonhaddock > Spaces > Just_KNIME_It > Just KNIME It _ Challenge 003

Female: GREEN
Men: RED

BR

duristef · February 12, 2022, 12:35am

I’ve tried to correct the dataset. This is my solution
KNIME_challenge_3.knwf (69.6 KB)

elimisael · February 12, 2022, 12:39am

a 2nd versión, working on database in-memory

In-memory database

Adrix · February 12, 2022, 5:35pm

After reading some of other solutions I noticed that I missed it out that the data were mixed with the totals , I think now it is working fine

We should exclude the true in bellow

gonhaddock · February 12, 2022, 8:28pm

Hello,
I did some updates and modifications to the workflow. Some literature has been added order to clarify my results as well:

An error on 'Cancer Site Code’s Rule Engine has been amended.
It has been connected the exclusion of ‘All invasive Cancer Sites Combined’ site type, aiming to avoid bias in final results.

Female: GREEN
Male: RED

BR

duristef · February 13, 2022, 4:48pm

My solution on Knime Hub KNIME_challenge_3 – KNIME Hub

cf_123 · February 14, 2022, 7:19am

Hi,
here my solution: knime://My-KNIME-Hub/Users/cf_123/Public/jKi3

emilio_s · February 14, 2022, 10:47am

I’m (finally) not the author of this week’s challenge, so I can participate and share my proposed solution!

Thank you guys for pointing out your assumptions and concerns in this thread. I have also included mine (i.e. I excluded some cancer sites since they are “aggregations” of other counts).

I also got inspired by the picture in the challenge page and came up with a Choropleth map showing the incidence rate by state.

To do that, I used the Choropleth World Map component available on the KNIME Hub and modified the JavaScript code adding the option resolution:“provinces”.

elisrich · February 14, 2022, 12:30pm

here is my solution for this week’s challenge

I grouped the different breast cancer types distinguishing between male and female into one category and estimated the missing values as group means, based on gender & age to get an accurate estimate

paci · February 14, 2022, 3:13pm

Hi all,

Here is my solution to this challenge and my very first contribution on the KNIME Hub!

alinebessa · February 14, 2022, 9:59pm

Hi @duristef,

Thanks for your contribution!

Yes, as you’re hinting at, the results are going to be as good as the data is. It turns out that the data has problems that are going to end up reflecting on the solution. Our main goal here, however, is on how to answer the frequency questions by processing the data. How to clean the data, or how to improve its quality, are secondary (and more advanced) goals.

Cleaning and fixing datasets is definitely paramount for serious data science, and although we had not set out to focus on this here, we are pleased to see that our community is paying attention to it.

alinebessa · February 15, 2022, 3:07pm

Here’s our team’s solution to the challenge!

As mentioned yesterday, results may vary depending on how you process or interpret the data. A detailed explanation to our solution is here: Just KNIME It! | KNIME

We’re very happy with the diversity in the answers! Big shout-out to everybody who participated and see you tomorrow with a new challenge.

#justknimeit #justknimeit-3

anabelvelazque · May 21, 2022, 9:03am

Hi! Here is my solution

Challenge 3 - Just KNIME It!