Mix and Match Predictive Approach. From Hive through In-Database Processing till KNIMEAnalytics Model Training.

This workflow reads CENSUS data from a Hive database in HDInsight; it then performs some In-Database Processing on Hive; and finally it trains a KNIME decision tree model to predict COW values based on all other attributes. Data for this example come from the new CENSUS dataset which is publicly available and can be downloaded from: http://www.census.gov/programs-surveys/acs/data/pums.html A full explanation of all attributes can be found in: http://www2.census.gov/programs-surveys/acs/tech_docs/pums/data_dict/PUMSDataDict15.pdf


This is a companion discussion topic for the original entry at https://kni.me/w/4g4ff_GnlKuGDRMM