Solutions to "Just KNIME It!" Challenge 06 - Season 2

MY take on the challenge 6 … The dataset itself took a toll on the problem … Tried the approach to classify in Binary instead of multinomial … and the xgboost gives good results in binary… certainly not the good way … to discard the given option.
Didnt try much on text preprocessing and enrichment.
xg multi
xg binary

2 Likes

Here is my solution:

It was quite interesting to experiment with different methods (BERT, Spacy) and I trained a weaker model based on BERT embedings. As a bonus I decided to include conformal prediction to analyze model prediction certainty.


9 Likes

Hi, KNIMEr :partying_face: :partying_face: :partying_face:

Here is mine:

image

Just use LLMs :joy:

I also did some error analysis if you are interested in reading

4 Likes

Very interesting approaches to this problem!! The conformal prediction and error analysis parts (thanks @Artem and @HaveF) make me wonder if you folks would be interested in challenges touching on XAI. Let us know!

2 Likes

@alinebessa Sure, XAI seems to be the hot topic nowadays.

2 Likes

Hello KNIMErs

Here is my take to the JKI S02 CH06. I had no time to code by myself, but found some interesting Py blogs about how to do it; and learning many concepts about Sentiment Analysis as well.

What I’ve worked out from scratch, it has been the pre-process, quite supported by regex. I hadn’t time either to play with the parameters, but I got a considerable accuracy at my first approach, so I’m happy with that.

The modelling part has been ‘inspired by’ Sentiment Analysis (Classification) of Documents – KNIME Community Hub :upside_down_face:

BR

2 Likes

@alinebessa Of course! Btw, guided analytics is also an interesting topic!

1 Like

:boom: As always on Tuesdays, here’s the solution to last week’s challenge on sentiment analysis :grin: :sob:!

Naturally, our solution relies on the components that were created for it – but we use an additional one for AutoML purposes! We did not optimize the solution very much to increase the accuracy or F-measure, but this would be a great step in order to make this solution more relevant for negative reviews (which are the ones that “matter” the most).

You folks once again really impressed us with your solutions. Very creative! We hope to see you tomorrow for a new challenge.

2 Likes

Out of competition, but I’d like to share my method of automatization of learning model with different parameters to choose best and compare efficiency of different models using Box Plot and Loops
knime://My-KNIME-Hub/Users/chuvak56/Public/Challenge%2006

Hi Everyone :slight_smile:

A little bit late with this one but here is my solution for challenge 6:

The workflow uses the -Enrichment and Preprocessing- component, followed by the -Document Vectorization- component in order to prepare the data for training. An -X-Partitioner- node is used to start a cross validation loop and inside the loop are the -Logisitic Regression Learner- and the -Logistic Regression Predictor- nodes using “Class” as the target column and “neutral” as the reference category. The -Loop End- node concatenates the results from each iteration.

The -Scorer- calculates the accuracy at around 75%, which is acceptable for something that is not crucial and for the simplicity of the workflow used to create the model. However, it is also important to assess the model’s performance. In this solution there is high specificity for all 3 classes and reasonable to high sensitivity for the positive and negative classes, where as the neutral class has a sensitivity around 0.5 and is therefore pretty much random.

You can find my workflow on the hub here:

Best wishes
Heather

3 Likes

Hi Crazy KNIMErs.

Tarde, pero aquí estoy.

Os comparto mi solución.

He querido centrarme en este reto en usar distintas posiblidades mas que en sacar un buen accuracy.


2-023-05-10 233815

Vamos a por el siguiente!!! :mechanical_arm:

2 Likes

A lot late, but here is my solution: JKISeason2-6 – KNIME Community Hub

2 Likes

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.