Have a look at this post from Vincenzo about a year ago on a similar topic. He talks a little about SMOTE and its reference paper. (Click the down arrow in the top right of the post to expand for a full view.)
As you mentioned, the Equal Size Sampling node may be a good approach as well.