Solutions to “Just KNIME It!” Challenge 22 - Season 4

:sun_with_face: Happy Wednesday, everybody! This week we’re back with a medium-level Just KNIME It! challenge. :brain: And remember: even if you can’t solve an entire challenge, there are always easier objectives for you to play with. We are all here to learn! :woman_student:

:notes: This week you’ll explore what drives global listening trends. Using a dataset of most streamed songs up to 2024, your goal is to uncover which artists dominated the charts. :trophy: Which artists released the most music? And who ranked highest in average track score? Discover which artists make the cut on both popularity and consistency.

Here is the challenge. Let’s use this thread to post our solutions to it, which should be uploaded to your public KNIME Hub spaces with tag JKISeason4-22 .

:sos: Need help with tags? To add tag JKISeason4-22 to your workflow, go to the description panel in KNIME Analytics Platform, click the pencil to edit it, and you will see the option for adding tags right there. :blush: Let us know if you have any problems!

1 Like

2 Likes

Here’s my solution. Thank god no JSON.

1 Like

Here’s my solution. Thank god no JSON. Sorry couldn’t figure out how to save single link in the revised Hub.

Hi all,

My solution here: JKISeason 4-22 - Analyzing Top Streaming Artists – KNIME Community Hub

I noticed some strange track name formatting, most likely to a change in the encoding. This is not handled here, but should be done for a prod workflow. Same with duplicates, nothing is checked in this current workflow, but should be done - also some inconsistency checking (dates, etc.)

Enjoy all

Cheers

Jerome

1 Like

Find my Submission : JKISeason4-22 – KNIME Community Hub

2 Likes

After many weeks where time was limited, I’m very happy to be back with a fun challenge. Here is my solution.

1 Like

My solution to the challenge:

My thinking:

  1. Getting the data
  2. just filter for the needed columns and convert them to the needed data types (filtered out all the missing values)
  3. Aggregate the data to year and visualize it in a bar chart
  4. Get the TOP10 track score and TOP10 total streams and check if there is some overlap between the two lists
    1. I started with the assumption with the minimum 10 tracks just refer to the track score (I understood from the task description like that) –> In that way, there was no overlap between the two lists
    2. From that I thought that is not the right approach, so I applied the minimum ten tracks for both track score and total streams, and in that way, there was four overlaps (MUSIC LAB JPN, Lil Nas X, Ed Sheeran, Harry Styles)
    3. As I’m not sure which is the correct approach I kept both in the workflow

My workflow:

I tried to keep the documentation of the workflow to the maximum :slight_smile: I’m really interested which approach is the correct one. If there is no overlap, or there is four. I think there will be answer in the solution on Tuesday :slight_smile:

1 Like

Greetings Knimers

Here is my solution to this week’s Just KNIME It challenge:

JKISeason4-22

I had a difficult time getting a dataset where both artists had highest mean track score and number of total streams and looked at @trj workflow to understand how to get artists in both data sets and sometimes the problems lies in understanding the question and order of operations. I misunderstood that the count of tracks needs to be considered when determining highest mean track score and streams for the first 10 tracks. When using the count function I was then able to get a combined dataset for specific artists with both highest mean track score and highest total streams.

I also considered Spotify Streams, Pandora Streams, and Soundcloud Streams as my total streams.

Cheers

1 Like

Hi!

My solution here: JKI_4_Challenge22

Let me know if you have any suggestions to improve my workflow :slight_smile:

1 Like

Well done @garcbcpa - Happy to read that the workflow helped you.