In addition to the PDF Parser, you can also use the Tika Parser node to extract data from PDFs. After that, the next step is usually to use a Strings to Document node to prepare the data for text processing. Please have a look at these links for some inspiration for what you can do for text processing in KNIME: