Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Info

As of Squirro 3.4.5, you can optimize the execution of multiple, similar ML workflows in the enrichment pipeline.

Overview

Machine Learning (ML) workflows, like the AI Studio ML models, consist of multiple steps from libNLP. The steps within a workflow typically include:

  • data loading

  • data cleaning

  • pre-processing

  • classification

  • generation of entities

In a pipeline with similar ML workflows, it is likely that identical steps are contained in multiple workflows. For example, the identical sentence splitting or PDF sentence tokenization step can appear in all or multiple ML workflows present in the pipeline.

To avoid multiple executions of identical steps and increase the performance of the processing pipeline, you can enable the optimization of ML workflow execution.

Configuration

In the Server space under Configuration set the machinelearning.optimize.workflows to true:

...

This page can now be found at ML Enrichments for Pipeline Workflows on the Squirro Docs site.