The coloured bars above column name are indicating the quality of data. If you hover with the mouse on it you’ll see % of the dataset:
- null values
- missing values
- trailing or leading whitespaces
- multirow formatting
in alteryx you can inspect each node output like knime, but you’re limited in a portion of it, so these data quality checks refers only to this subset. If you want to perform a full data quality check and inspection (like knime), you have to drag in a Browse tool.
where to implement this functionality? well, not only in node monitor, but also in the data viewer when you are just analyzing the output of a node.
how can you improve this? maybe adding a more complete profiling such as distinct number of values (on categorical columns), mean, median, min, max for numerical one.