Well the workflow is huge (it’s like the inception of metanodes). It’s mostly a database export from a forum MariaDB, and I’m applying the learner and predictor to a dataset to predict user gender (as I already have a column with gender label, but not all users are gendered and I need all of them).
It works fine until I have to learn from forum messages. I sampled down messages to 50K and it works now, but it really puts that computer to its knees.