If you want to see this approach in action in a big data environment you could check out this collection of workflows demonstrating the development and deployment of H2O.ai auto-machine-learning models on a big data environment with the help of the Sparkling Water node.
There also is a presentation on YouTube showing how it is done:
(the presentation is in German, the slides are in english)