Features
Install and run docker on production instances
Improve the performance of ingesting PDF documents with pipeline workflows which include ML models.
[Search] Productize search service. For now exposes endpoints to help debugging the output of the squirro-query-syntax parsing pipeline.
Data Loader: add option to delete nargs entry (multi value fields).
PDF-OCR step now includes optional Confidence scoring, which can be enabled at cost of performance.
Add information about recipients and sender to the attachments items in the Exchange dataloader plugin.
Add bulk_labeling functions to handle this types of operation
React widgets will fetch data only if they are in a visible dashboard section layer
Moves all steps calling external services
Add endpoint for creating bulk labeling
Adds a new libNLP step to call spaCy running as a Squirro NLP service.
Upgrade React to v18
Click on community typeahead suggestion will redirect user to the selected community
Implement NLP step for bulk labeling
Add handling config for the bulk labeling step
Create Binary Documents pipeline for new projects
adds HFQuestionAnswering processor that can run Hugging Face question-answering pipelines for inference
PDF Cannot be displayed error message is more generic and tries better to show a working file link
Increase clickable area for subscribe button in Communities List widget
Add creation date for proximity rule
Add link to Squirro Monitoring space to Squirro Spaces popover with current project being selected in the dashboard filter
use tika-pdf-sentences by default for speedier PDF Sentences Tokenization
[Search] Query-Processing: Analyzed query tokens that might contain valid sub-tokens are additionally re-written to perform exact phrase matching. NewYork => ("NewYork"~0 OR NewYork). This enables to perform sub-word matching on New, and York individually relying on the configured SearchAnalyzer (subword-delimiter) - but will additionally match the exact phrase NewYork as well
Implement ML endpoint for creating ml job which will create automatically gt labels
Bug Fixes
Monitoring Project: Improved explicit error handling when a new monitoring-project is created from template, but errors out because a monitoring project exists already (only one monitoring project per server allowed)
GlobalSearch disappeared after searching or dashboard switch
Fixes distorted thumbnail in Communities Item widget.
Fix model templates demo view not working
Similarly to the rerun from index, now rerun from raw data will not remove existing items from the index if a step fails during a rerun of those items. The items will be replaced in the index only when the rerun is successful, otherwise the existing ones will remain in the index.
Fixed several issues with entities and queries loading inside of the items widget side panel
Fix QA answer highlight in item detail overlay for multiple answers from same item
SQ-18648
Fixed issues with apiOverrides not working for custom React widgets
Fix ResetFilters widget: same selection added multiple times
Fixes a bug where the dashboard menu icon disappears after pressing exit while editing dashboards.
Add validation for pipelet's sections order in pipeline workflows
Fix wrong position of KEE pipelet in pipeline workflows
fix tika-server systemd service
Fixed a bug where project theme and guide are not fetched after importing a project.
Deleting a project will check if any other existing project uses any of the indices of the to-be-deleted project, and it will refrain from deleting any shared index in order not to break the existing project.
fix rare issue when dataloader plugins with the same file names got mixed up
Fixes a bug where autosize stops working when toggling edit mode.
Fix the error (psycopg2.DatabaseError) error with status PGRES_TUPLES_OK and no message from the libpq (appearing some times in ingester's logs when executing pipelet steps) in PostgreSQL installations.
Fixed Pie chart not reacting to config and cutom query changes
Fix for Squirro space button not shown after new project has been created
Fix Aggregated Communities missing query
Fixed a bug where the concept search params would not disappear from the url.
Fixed entities and matching queries not working inside communities headlines item detail
fixed entities list not re-rendered when unselecting the chip
fix PDF-OCR step failing on mime-type detection
Community List widget breaks after subscribing to more than 15 communities
Fix dashboard store to URL serialization for persistentLinks feature
Breaking Changes
webshot generated thumbnail file is not added to Item files list to ensure webshot removal when item gets deleted
Installation and Upgrade
You will have to resolve at least the following config files when upgrading from Squirro 3.3.0
/etc/nginx/conf.d/ssl.inc
/etc/squirro/common.ini
For new installations, please follow the Setup on Linux instructions.
To upgrade an existing installation, please consult the Upgrades for Squirro 3.5.3 and later guide.