The convert.ini
config file, located at /etc/squirro/convert.ini
configures the content conversion.
Table of Contents
Memory Limit
By default, Squirro uses an auto-spawned Tika server under Java VM to do content conversion. The memory is limited to avoid running into out-of-memory exceptions when converting. For large documents the default setting may be too low. When this happens, there will be aborted items in the index with SQ-05105 processing errors.
In the apache-tike
section use the vmargs
option.
Key | Usage | Default | Example |
---|---|---|---|
vmargs | Java VM parameters. See Oracle's Java HotSpot VM Options reference for the options. |
| [apache-tika] # comma-separated list of additional Java Virtual Machine command-line options # to use vmargs = -Xmx512M,-Xms64M |
Using External Tika Service
The Tika web service can also be manually set up if more fain grain control over its config is needed.
Then change the convert.ini
config file to define the tika-url
.
Key | Usage | Default | Example |
---|---|---|---|
tika-url | Point to a Tika server, instead of using an embedded Tika Java VM. | - | [apache-tika] tika-url = http://localhost:9998 |