a short question. Did somebody know a free available dataset (classification) in which models perform moderate but with parameter optimization you can push the accuracy to a high level.
Mostly the datasets I used allows little improvement with optimization but I would like to use a dataset for demonstration purpose where you can push the accuracy really good.
I like to demonstrate parameter optimization by using decision trees, because you can easily overfit them, i.e. make them perform poor on data points that were not part of the training partition.
Instead of focusing on how much you can improve your model trained with default parameters (they are chosen for a reason, likely because they are supposed to work OK in many cases), you can do a grid search and compare best and worst parameter combinations to show the possible impact of a hyperparameter optimization.
Just because I stumbled over this. There is a nice example of how to optimize a regression here which hints points at some data where I’d assume that optimization actually allows you to improve the regression: