Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The Pipelet is configurable within the pipeline Editor.

Input Fields

What fields should be considered for further analysis.

  • fields_to_consider : Comma separated list of fields (default: title,body)

PDF Approximation

To reduce processing time of big PDFs, consider only a subset of pages.

...

  • tag_phrases: Enable / Disable key-phrase tagging

  • tag_top_k_phrases: Amount of phrases to tag

    • tag_phrases: Enable / Disable key-phrase tagging

    • tag_top_k_phrases: Amount of phrases to tag

      dynamic : Total amount of phrases selected relative to document size (between 20 - 70)

    • 10 : Take N highest ranked phrases as specified

  • tag_topics: Enable simple topic-tagging based on key-phrases

Enrichment

Key phrases are stored within the nlp_tag__phrases facet.
The item’s Title is also added as nlp_tag__phrases facet value (as-is, without processing).

Application

...