Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Excerpt

Combined time and relevant sorting is the "magic solution" to ranking results which we believe will deliver the best results, especially for projects where the most recent items are usually more important than older inforrmation


Sorting by time

...

and relevance

The results of a given query are sorted by relevance score. Relevance score is computed based on some factors like term frequency, document frequency, where the querying terms matched in the document (body, title, summary...). The limitation of sorting purely by relevance is it ignores the important of up-to-date content such as news articles.

Using the combination of time and relevance in your project definition, Squirro will also sort the result list by using the created_at field of each item i.e. the most recent documents are put first in the result list. The limitation of of sorting purely by time field is that the most recent documents are often poor matches for the original query.

...

  • relevance_score: original relevance score of document wrt the query
  • base: parameter use to adjust the impact of relevance. Higher this value, higher impact of relevance score (if you set this value very high, e.g 100, you can see impact of time factor is very minor, relevance score contributes mostly in final score)
  • range: range of decay, higher this value, longer decay range (if range = 0 you will see final score depends only on relevance score)
  • decay: decay speed, higher this value, quicker decay value (if decay = 0 you will see final score depends only on relevance score)
  • age_days: number of days from now since created_at time of documents.

...