difficult to debug this without workflow. Could you attach your workflow with some sample data, so that I can have a look?
Best,
Philipp
PS: We provide a node for content extraction from web pages (ContentExtrator), you might consider using this one instead of Boilerpipe. It doesn't require any web API access and during our evaluations it was more reliable than the Boilerpipe algorithm.