Squirro 3.7.0 was released .
What’s New
You can now install and run docker on production instances.
The PDF-OCR step now includes optional Confidence scoring, which can be enabled at the cost of performance.
Added the option within the Data Loader to delete nargs entry (multi-value fields).
Added a new libNLP step to call spaCy running as a Squirro NLP service.
Created a Binary Documents pipeline for new projects.
Added a HFQuestionAnswering processor that can run Hugging Face question-answering pipelines for inference.
Improvements
Improved the performance of PDF document ingestion for pipeline workflows which include ML models.
[Search] Productize search service. For now exposes endpoints to help debugging the output of the squirro-query-syntax parsing pipeline.
Added information about sender and recipients to the attachments items in the Exchange data loader plugin.
Added bulk_labeling functions to handle this types of operation.
React widgets will now fetch data only if they are in a visible dashboard section layer.
Moves all steps calling external services.
Added endpoint for creating bulk labeling.
Upgraded React to v18
Clicking on a community typeahead suggestion will now redirect the user to the selected community.
Implemented an NLP step for bulk labeling.
Added handling config for the bulk labeling step.
The “PDF Cannot Be Displayed” error message is more generic and tries better to show a working file link.
Increased the clickable area for the subscribe button in the Communities List widget.
Added creation date for proximity rule.
Added a link to the Squirro Monitoring space to the Squirro Spaces popover with current project being selected in the dashboard filter.
Squirro now uses tika-pdf-sentences by default for speedier PDF Sentences Tokenization.
Search Query-Processing: Analyzed query tokens that might contain valid sub-tokens are now additionally re-written to perform exact phrase matching. NewYork => ("NewYork"~0 OR NewYork). This enables sub-word matching on New, and York individually relying on the configured SearchAnalyzer (subword-delimiter) - but will additionally match the exact phrase NewYork as well.
Implemented a ML endpoint for creating ml job which will create automatically gt labels.
Bug Fixes
Monitoring Project: Improved explicit error handling when a new monitoring-project is created from template, but errors out because a monitoring project exists already (only one monitoring project per server allowed)
GlobalSearch disappeared after searching or dashboard switch
Fixes distorted thumbnail in Communities Item widget.
Fix model templates demo view not working
Similarly to the rerun from index, now rerun from raw data will not remove existing items from the index if a step fails during a rerun of those items. The items will be replaced in the index only when the rerun is successful, otherwise the existing ones will remain in the index.
Fixed several issues with entities and queries loading inside of the items widget side panel
Fix QA answer highlight in item detail overlay for multiple answers from same item
SQ-18648
Fixed issues with apiOverrides not working for custom React widgets
Fix ResetFilters widget: same selection added multiple times
Fixes a bug where the dashboard menu icon disappears after pressing exit while editing dashboards.
Add validation for pipelet's sections order in pipeline workflows
Fix wrong position of KEE pipelet in pipeline workflows
fix tika-server systemd service
Fixed a bug where project theme and guide are not fetched after importing a project.
Deleting a project will check if any other existing project uses any of the indices of the to-be-deleted project, and it will refrain from deleting any shared index in order not to break the existing project.
fix rare issue when dataloader plugins with the same file names got mixed up
Fixes a bug where autosize stops working when toggling edit mode.
Fix the error (psycopg2.DatabaseError) error with status PGRES_TUPLES_OK and no message from the libpq (appearing some times in ingester's logs when executing pipelet steps) in PostgreSQL installations.
Fixed Pie chart not reacting to config and cutom query changes
Fix for Squirro space button not shown after new project has been created
Fix Aggregated Communities missing query
Fixed a bug where the concept search params would not disappear from the url.
Fixed entities and matching queries not working inside communities headlines item detail
fixed entities list not re-rendered when unselecting the chip
fix PDF-OCR step failing on mime-type detection
Community List widget breaks after subscribing to more than 15 communities
Fix dashboard store to URL serialization for persistentLinks feature
Breaking Changes
webshot generated thumbnail file is not added to Item files list to ensure webshot removal when item gets deleted
Installation and Upgrade
You will have to resolve at least the following config files when upgrading from Squirro 3.3.0
/etc/nginx/conf.d/ssl.inc
/etc/squirro/common.ini
For new installations, please follow the Setup on Linux instructions.
To upgrade an existing installation, please consult the Upgrades for Squirro 3.5.3 and later guide.