Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

The configuration of built-in enrichments is done with the processing config. This can be used both for enabling and disabling of enrichments, as well as adding additional configuration for a step.

There are two places this configuration can be specified:

...

To set up a processing configuration, specify the processing field in a source's config. The value of that field is again a dictionary, with the enrichment names as keys.

Enrichments which can be specified include:

Processing StepDocumentation Link
unshorten-link

Unshorten Link


deduplicationDuplicate Detection
content-augmentation

Content Augmentation

content-conversionContent Conversion
language-detectionLanguage Detection
boilerplate-removalBoilerplate Removal
nearduplicate-detectionNear-Duplicate Detection
webshotWebshot
filteringFiltering


For example to set up a Twitter source with duplicate detection disabled, the following configuration would be used:

...