Ka-Boom! Palladian 2.0

:birthday: Happy Birthday Palladian – Here’s v2.4

We have just celebrated the 10th birthday of the Palladian plugin for KNIME (useless fact: the first commit was 2011-02-01, 17:59 CET), so this seems like a good opportunity to release this update.

Please find the detailed change log below. We have upgraded the version of the wrapped Palladian lib to fix several issues and added some new nodes and functionalities.

There’s a new OAuth Connector [BETA] node, which allows to build browser-based OAuth flows in your KNIME workflow:

This way you can easily authenticate with the following API services: 500px, Asana, AWeber, Box, Dataporten, Digg, Discord, Dropbox, Etsy, Facebook, Fitbit, Flickr, Foursquare, Freelancer, Genius, GitHub, Google, HeadHunter ХэдХантер, HiOrg-Server, Imgur, Kaixin 开心网, Kakao, Keycloak, LinkedIn, Mail.Ru, MediaWiki, Meetup, Microsoft Azure Active Directory (Azure AD), Microsoft Azure Active Directory (Azure AD) 2.0, Microsoft Live, NAVER, Odnoklassniki Одноклассники, Pinterest, Polar, Renren, Salesforce, Sina, Skyrock, Slack, StackExchange, Trello, Tumblr, TUT.BY, Twitter, uCoz, Viadeo, VK ВКонтакте, Xero, XING, and Yahoo. Admittedly, this is a node for advanced users familiar with REST APIs and we still consider it BETA. As there was quite some demand recently, we decided to roll it out in this early stage and we’re looking for your feedback. To install it, you must explicitly enable “Palladian for KNIME: OAuth Nodes”.

Changes since 2.3 in details (see also here):

  • (Info) Requires at least KNIME 4.1 (please make sure you’re using an update site URL corresponding to your KNIME version)
  • (Add) Trim Image Whitespace: Node to remove white space surrounding a PNG image
  • (Add) HTTP Retriever: Allow to override default proxy configuration in the “Proxy” tab (see here)
  • (Add) HTTP Retriever: Store redirected location in HTTP Results
  • (Add) HTTP Result Data Extrator: Add setting “Append redirected locations”
  • (Add) HTTP Retriever: Allow PATCH
  • (Add) N-Gram Extractor: Allow to specify output column name
  • (Add) N-Gram Extractor: Allow to drop input column
  • (Add) Base64 Encoder, Base64 Decoder: New nodes for encoding/decoding Base64
  • (Add) OAuth Connector: New node for connecting to 50+ OAuth-based APIs. This node is currently labled as “BETA” – there might be bugs or later versions might change the functionality. In case of feedback or bug reports, please do reach out!
  • (Change) Use version 2.0 of Palladian Toolkit library
  • (Change) HTTP Retriever: Show exection warnings on node additionally to logging them (e.g. when HTTP method is missing or invalid, when URL contains whitespace, in case of network errors)
  • (Change) HTTP Retriever: Automatically trim whitespace around URLs (see here)
  • (Change) AP Calculator: Make node streamable
  • (Change) Coordinate to Latitude/Longitude: Make node streamable
  • (Change) Form Encoded HTTP Entity Creator: Make node streamable
  • (Change) Hash Calculator: Make node streamable
  • (Change) HTML Parser: Make node streamable
  • (Change) Latitude/Longitude to Coordinate: Make node streamable
  • (Change) Multipart Encoded HTTP Entity Creator: Make node streamable
  • (Change) String Similarity: Make node streamable
  • (Change) Trim Image Whitespace: Make node streamable
  • (Change) URL Domain Extractor: Make node streamable
  • (Change) URL Normalizer: Make node streamable
  • (Change) URL Resolver: Make node streamable
  • (Change) Web Page Content Extractor: Make node streamable
  • (Remove) Ranking Services: Remove obsolete Compete, Delicious, DMOZ
  • (Fix) HTTP Retriever: Required validation for User Agent input in dialog
  • (Fix) HTTP Retriever: Prevent entering negative values for Socket Timeout
  • (Fix) Text Classifier Learner, Text Classifier Predictor: Fix link to press release in node documentation (kudos to @armingrudd)
  • (Fix) TF-IDF Similarity: Fix NaN values (see here)
  • (Fix) HTML Parser: Fix absolute URLs on redirected requests
  • (Fix) HTTP Retriever: Properly handle URLs with ? which are not query params (see here)
  • (Fix) HTTP Retriever: Make parsing of cookie “expires” attribute more lenient and behave more like a web browser (see here)
  • (Fix) HTTP Retriever: Automatically strip away URL #fragments (see here)
  • (Fix) HTTP Retriever: Properly honor the given “Maximum file size” limit, even when below 1024 bytes (see here)
  • (Fix) GeoIP2 Extractor: Improve error message when DB file cannot be accessed
  • (Fix) Hash Calculator: Correctly honor “Remove input column” setting
  • (Fix) HTTP Retriever: Catch potential NullPointerException in SSL-related code (see here)
  • (Fix) Fix potential version conflicts with NodePit licensing plugin

Anything missing? Any question? Any feedback? Get in touch!

9 Likes