...
The Pipelet is configurable within the pipeline Editor.
Input Fields
What fields should be considered for further analysis.
fields_to_consider :
Comma separated list of fields (default:title,body
)
PDF Approximation
To reduce processing time of big PDFs, consider only a subset of pages.
...
tag_phrases
: Enable / Disable key-phrase taggingtag_top_k_phrases
: Amount of phrases to tagtag_phrases
: Enable / Disable key-phrase taggingtag_top_k_phrases
: Amount of phrases to tagdynamic
: Total amount of phrases selected relative to document size (between 20 - 70)10
: Take N highest ranked phrases as specified
tag_topics
: Enable simple topic-tagging based on key-phrases
Enrichment
Key phrases are stored within the nlp_tag__phrases
facet.
The item’s Title
is also added as nlp_tag__phrases
facet value (as-is, without processing).
Application
Content based auto-completion (type-ahead)
Significant-terms aggregation on search results
...