Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

When indexing data from other data sources into Squirro, the data is transformed into the Squirro item format. In this process keywords are used to add structured and semi-structured information to the items. ExcerptThis section talks about the considerations when thinking of the keywords to use on Squirro items.

...

page

...

Table of Contents

Table of Contents
outlinetrue
excludeTable of Contents

Considerations

Performance

Performance considerations come into play when working with facets. Every facet that's maintained adds a bit of overhead, especially memory consumption. When returning the facet selection list in the search screen, Squirro and the underlying Elasticsearch server, need to look at every result and count the occurrences of each facet value.

...

To improve performance, the following changes can be made:

...

now

...

Completeness

It's tempting to initially import any and every field from the source data and add them as a facet. While this is often a sensible approach in a PoC or exploratory phase, for production this should be avoided.

Only facets should be imported that are actually used in dashboards, filtering and search. If a facet needs to be added at a later stage that is not a problem and can always be done.

Usability

Generally it's best to use facet names internally that don't have spaces and are lower case. The display name is then used to give them a user-friendly name.

Also group facets into user-friendly groups and make use of the provided formatting options, such as date format.be found at Data Modeling on the Squirro Docs site.