DATASET help

Hi everyone

urgently please I need a dataset that contains the following:

  • url - Document url
  • title - Document title
  • top_tags - 10 most frequent tags attached to the document
  • keywords - Keywords author attached to the document
  • publication
  • editor
  • year
  • discipline
  • citation_count
  • citation

these attributes can belong to either research paper or a web page

I have searched too much but i couldnt found the dataset

the most important attributes are:

  • title - Document title
  • top_tags - 10 most frequent tags attached to the document
  • keywords - Keywords author attached to the document
  • discipline
  • citation

Thanks in advance

Did you already take a look in the Node Guide? There we have a lot of textprocessing workflows, and all of them contain the data, which is very similar to what you are searching.

e.g. https://www.knime.org/nodeguide/other-analytics-types/text-processing/document-classification