Solutions to “Just KNIME It!” Challenge 20 - Season 4

:sun_with_face: New Wednesday, new Just KNIME It! challenge! :sunny:

:house_with_garden: You work for a real estate company and want to evaluate if machine learning can help you determine median housing prices better. Which models would you select first to start studying and comparing techniques? :money_with_wings:

Here is the challenge. Let’s use this thread to post our solutions to it, which should be uploaded to your public KNIME Hub spaces with tag JKISeason4-20 .

:sos: Need help with tags? To add tag JKISeason4-20 to your workflow, go to the description panel in KNIME Analytics Platform, click the pencil to edit it, and you will see the option for adding tags right there. :blush: Let us know if you have any problems!

2 Likes

Mapped the data points geographically and tagged with county location for filtering.

Correct predictions = green & incorrect= red.
Just ran a couple of models, accuracy not great.

Tested whether running predictions for particular counties would be better and it definitely is, for some.

3 Likes

Here’s my solution. Rather than reinventing the wheel, I used the AutoML component.

1 Like

My solution to the challenge:

I also used the AutoML component (which I think is the goat in topics like this, really love it). I trained it for every model it could offer and for me the deep learning was the winner (with R2 of 0.7155).

1 Like

Greetings team,

Here is my solution to this week’s challenge:

JKISeason4-20

I initially started to use the Random Forest model to test/compare the models and not sure if my results are correct for the Random Forest model which seem to indicate strong correlations between the training and test dataset:

But from viewing the others, it seems the AutoML component is the way to go and does alot of the tedious/hard work for us and provides a more comprehensive comparison analysis to other models.

@jproudfoot111 provided a good idea to include geospatial context with this challenge and I thought I follow suit but I just included the lat/long co-ordinates and ignored county information (but in real life that is definitely information you need in Real Estate business). I don’t work with geospatial data that often so this is a good exercise to get that practice. The geospatial view definitely provides excellent context to understand your real estate market versus just looking at statistical figures.

Cheers

1 Like

Find my submission -https://hub.knime.com/s/e32ejs0s00FPz58z

2 Likes

Hi all,

Here my solution: https://hub.knime.com/s/oHJZlrO_RDP1jWtC

I did some simple data cleaning to explore the data. Followed by 2 models training to compare. Then, decided to use the AutoML to compare the results.

Finally, a little use of the geospatial extension to see the possibilities.

Cheers all

Jerome

2 Likes

:sun_with_face: Our solution to last week’s Just KNIME It! challenge is out! :sunflower:

:robot: It was nice to see you playing with the AutoML verified component we have for regression – a great way of testing several models in parallel and getting a final ranking with minimal overhead!

:pencil: Stay tuned for tomorrow’s new challenge! We will explore our very own KNIME Forum. Let’s experiment with its content to hone our text processing skills, especially text summarization! :bar_chart: You can even go one step further and visualize these summaries!