How to clean and improve a dataset

Hi Guys! I need your help to develop in the best way a dataset.
I founded this dataset in Kaggle, the link is: https://www.kaggle.com/austinreese/craigslist-carstrucks-data
and i would like to obtain some suggestions in how can I substitute the different missing value (deleting/mode/means??); what new variables can I create ( cars lifetime/ odometers per years???); which kind of analysis best fit with this kind of data.
Thank you very much for your help!!

I hope to achieve some helps!

Yes, of course! It should be great!

For Missing values management use

1 Like

Thanks for the answer. How does it work?

UP! Please guys, united we can win!

See here

1 Like

You may find this video useful. Info on the Missing Value node starts at 3:00, but there are other useful concepts as well:

2 Likes

Thanks for the reply! Did you see the dataset i have linked? There are different variables in which I’m not sure to substitute the missing values with that node or to do some bivariate to find out the correlation.

hi guys! In a dataset, I have different ULR about the car’s photo. There is a possibility to find out the colour of the car from these pictures?
Thanks

hi guys! In a dataset, I have different ULR about the car’s photo. There is a possibility to find out the colour of the car from these pictures?
Thanks you a lot

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.