Introduction
In this brief tutorial, we will go through the process of setting up, configuring, and deploying a simple Known Entity Extraction (KEE) project into a Squirro project.
Each KEE project is a tool which analyzes all incoming documents to a Squirro project, and identifies each instance of a known entity (such as a company, product, person, etc.) by tagging the document that contains the known entity with a specific metadata tag. Once known entities are identified, documents can easily be filtered and grouped within a Squirro project based on the known entities that they contain.
A more detailed overview of KEE as a whole can be found here: LINK
A simple example
As an easy example, let's take a CSV file that includes a list of salespeople employed by a company.
For each salesperson, their name, email address, and manager are provided. The basic layout of the CSV file is shown below:
id, name, email, position, manager 1, John Smith, jsmith@company.com, District Manager, David Cole 2, Jane Doe, jdoe@company.com, District Manager, David Cole 3, Adrian Fox, afox@company.com, District Representative, Jane Doe ...