Machine learning Classification (Decision Tree)

Haroon_954 · October 16, 2021, 6:22pm

Hello Everyone,
Hope everyone is good and doing well, I have developed a workflow for Decision Tree, to check the accuracy of Decsion tree model using Accuracy statistics (Scorer Node) and through ROC but here I am feeling difficulty how to plot ROC curve for this workflow, Please anyone can help, that will be great for me.

Thanking in Anticipation

aworker Thank you very much for your help in SVM workflow.
Hope you will help in this regard too

workflow is attached here
knime://LOCAL/MS%20Research%20October%202021/Decision%20Tree

elsamuel · October 16, 2021, 7:36pm

What difficulty are you having exactly, and where is the workflow?

Haroon_954 · October 17, 2021, 9:32am

Thank you for your response, workflow is attached here. when i draw the ROC curve it lies below the line (negative) as shown in the future

Decision Tree.knwf (184.0 KB)

aworker · October 17, 2021, 11:11am

Hi @Haroon_954

Thanks for uploading the workflow and glad to help here too. I have two comments about it and a recommendation:

In the case of Classifiers based on Decision Trees and ensembles made of Decision Trees such as Random Forest, etc., you do not need to normalize the descriptors. This is because they treat every variable separately when deciding which one to use and how. This is not the case for neural networks (FFNN, SVM, RBF, etc.), where variables are all used together to compute distances and hence you need to normalize the variables to avoid bias towards the highly scaled ones. To summarize, it doesn’t harm to normalize but it is not needed.
The Decision Tree Predictor- node provides too with a Probability (Normalize Class Distribution) which you could use to generate your ROC curve:

Hope this helps.

Best

Ael

Haroon_954 · October 17, 2021, 1:41pm

Hi aworker, Thank you so much Sir for your kind response. Yes got it and the issue solved.
Thanks & Respect

Haroon_954 · October 17, 2021, 1:45pm

Hi aworker, Sir I need a suggestion, I need to learn concepts of machine learning algorithms like (SVM, RF, ANN, Decision Tree, Logistic Regression, etc). what would you suggest from where I should learn concepts of these classifications as I am using these techniques un KNIME.

Please Help
Thanks in Advance

aworker · October 18, 2021, 10:11am

Hi @Haroon_954

How to answer your question quite depends on your current level of knowledge on Machine Learning and on your background (Mathematics-Statistics / Informatics / Humanities / etc.). It would help if you could tell us a bit about your background. To start with a simple answer (assuming here a beginner’s level) , I would recommend to start with the KNIME documents available at the following link:

Many of them are provided free of charge and are very practical.

Hope this helps.

Best,

Ael

Haroon_954 · October 19, 2021, 7:38am

Hi aworker,

Thank you for your recommendations. Actually, I am doing MS in Transportation Engineering having Bachelors’s degree in Civil Engineering, and currently doing my Research in Machine learning classification techniques for accidents crash data. I am new to this field so your recommended books will help me. I hope you got some idea about my background. Actually, I am not developing some codes but checking the performance of different classification models using KNIME and WEKA software.

Please if you can give me some more recommendations or suggestions. I will start learning this book you mentioned

Thank you so much for your responses and guidance.
Respect

aworker · October 19, 2021, 4:15pm

Hi @Haroon_954

Thanks for your kind comments and for having validated the solution here too.

In this case, since you are familiar with mathematical notation, I would definitely recommend to follow from A to Z the extremely good tutorials by Jason Brownlee at his blog:

He is one of my preferred blog authors on Machine Learning. He is skilled, rigorous and very didactic. His books are really affordable in PDF format with plenty of examples and all of them with exercises are covered on his many blog articles as well. I would definitely recommend him to a MS Engineering student to start learning Machine Learning from scratch but with a good formal mathematical teaching.

His examples are not based on KNIME but on Python. Having said this, I believe it would be an excellent exercise for you to take these examples and try to translate them into KNIME workflows. Besides this, you could also take his Python code and directly integrate it into the KNIME Python nodes.

But as I said before, I would start first with the KNIME Press books to get really familiar with the KNIME environment and then the rest.

Hope this helps.

Good luck and best wishes,

Ael

mlauber71 · October 19, 2021, 4:22pm

I would also recommend his blog and books. And with KNIME you would not have to choose - you can have it all

Haroon_954 · October 21, 2021, 5:16am

Hi Sir aworker

Thank you so much for your recommendations. Yes, definitely I will see all his lectures and books to learn as much as possible.
This will help a lot

Again Thanking you
More Power and Respec Sir

system · October 28, 2021, 5:17am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.