Hello! :)
I’d like to know if it is possible to use Hadoop distributed capabilities and cluster resources to predict in parallel when using PMML Predictor.
For instance, suppose I connect to Hadoop in some way and also load a pmml model. What I want to do is to send a process request for Hadoop to score the data (on a Hive table, for example) using the cluster resources, taking advantage of the map phase to work in parallel.
Is it clear?
Thank you!