Deployment_workflow

In this use case, we will use the NYC taxi dataset and a Random Forest to train a simple time series prediction model to predict taxi demand in the next hour based on data from past hours. For better scalability, we will train and test the model on a Spark cluster.


This is a companion discussion topic for the original entry at https://kni.me/w/NFHwoCcRUb55Ql2v