The content cleanup enrichment cleans up incoming text and removes potentially malicious content from the HTML body.
Table of Contents
Overview
The Content Standardization step is used to clean content as it comes in. From text fields, such as title
or summary
, any HTML tags are removed. From the HTML field body
potentially harmful tags and attributes are removed, such as script tags.
When dealing with data from untrusted sources, this step should always be used.
Configuration
This step does not take any configuration.