Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Squirro 3.7.0 was released .

Table of Contents

What’s New

  • You can now install and run docker on production instances.

  • The PDF-OCR step now includes optional Confidence scoring, which can be enabled at the cost of performance.

  • Added the option within the Data Loader to delete nargs entry (multi-value fields).

  • Added a new libNLP step to call spaCy running as a Squirro NLP service.

  • Created a Binary Documents pipeline for new projects.

  • Added a HFQuestionAnswering processor that can run Hugging Face question-answering pipelines for inference.

...

  • Improved the performance of PDF document ingestion for pipeline workflows which include ML models.

  • [Search] Productize search service. For now exposes endpoints to help debugging the output of the squirro-query-syntax parsing pipeline.

  • Added information about sender and recipients to the attachments items in the Exchange data loader plugin.

  • Added bulk_labeling functions to handle this types of operation.

  • React widgets will now fetch data only if they are in a visible dashboard section layer.

  • Moves all steps calling external services.

  • Added endpoint for creating bulk labeling.

  • Upgraded React to v18

  • Clicking on a community typeahead suggestion will now redirect the user to the selected community.

  • Implemented an NLP step for bulk labeling.

  • Added handling config for the bulk labeling step.

  • The “PDF Cannot Be Displayed” error message is more generic and tries better to show a working file link.

  • Increased the clickable area for the subscribe button in the Communities List widget.

  • Added creation date for proximity rule.

  • Added a link to the Squirro Monitoring space to the Squirro Spaces popover with current project being selected in the dashboard filter.

  • Squirro now uses tika-pdf-sentences by default for speedier PDF Sentences Tokenization.

  • Search Query-Processing: Analyzed query tokens that might contain valid sub-tokens are now additionally re-written to perform exact phrase matching. NewYork => ("NewYork"~0 OR NewYork). This enables sub-word matching on New, and York individually relying on the configured SearchAnalyzer (subword-delimiter) - but will additionally match the exact phrase NewYork as well.

  • Implemented a ML endpoint for creating ml job which automatically creates labels.

...