Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

The convert.ini config file, located at /etc/squirro/convert.ini configures the content conversion.

Table of Contents

Table of Contents
outlinetrue
excludeTable of Contents

Memory Limit

By default, Squirro uses an auto-spawned Tika server under Java VM to do content conversion. The memory is limited to avoid running into out-of-memory exceptions when converting. For large documents the default setting may be too low. When this happens, there will be aborted items in the index with SQ-05105 processing errors.

In the apache-tike section use the vmargs option.

...

Key

...

Usage

...

Default

...

Example

...

vmargs

...

Java VM parameters. See Oracle's Java HotSpot VM Options reference for the options.

...

-Xmx512M,-Xms64M

...

Code Block
[apache-tika]
# comma-separated list of additional Java Virtual Machine command-line options
# to use
vmargs = -Xmx512M,-Xms64M

Using External Tika Service

The Tika web service can also be manually set up if more fain grain control over its config is needed.

Then change the convert.ini config file to define the tika-url.

...

Key

...

Usage

...

Default

...

Example

...

tika-url

...

Point to a Tika server, instead of using an embedded Tika Java VM.

...

-

...

This page can now be found at convert.ini on the Squirro Docs site.