Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Processing config documentation improvement

...

FieldDescription
verticalSet to News.
query

The query to search on Bing. The query can contain any valid query text that the Bing Engine supports.

market

The news market in which to find news. Example markets:

Market IDLanguageCountry/Region
de-DEGermanGermany
de-CHGermanSwitzerland
en-UKEnglishUnited Kingdom
en-USEnglishUnited States

For the full list of available markets refer to the Bing API Schema Guide.

news_category

Limit search results to a category. This is optional.

Available categories:

  • default
  • rt_Business
  • rt_Entertainment
  • rt_Health
  • rt_Politics
  • rt_Sports
  • rt_US
  • rt_World
  • rt_ScienceAndTechnology
processing

default:

 

Code Block
languagebash
processing_config = {
    'content-augmentation': {
        'enabled': True,
        'fetch_link_content': True,
    },
}

A maximum number of 15 results can be retrieved via the Bing API. 

...

processing_config = {     'content-augmentation': {         'enabled': True,         'fetch_link_content': False,     }, }

FieldDescription
verticalSet to Web.
query

The query to search on Bing. The query can contain any valid query text that the Bing Engine supports.

market

The web market in which to find results. Optional.

processing

default:

Code Block
languagebash

Configuration Example

This is an example configuration for creating a news tracking configuration.

...

Code Block
python
python
client = SquirroClient(None, None, cluster='https://next.squirro.net/')
client.authenticate(refresh_token='293d…a13b')
client.new_subscription(project_id, object_id='default', provider='bing',
    config={'vertical': 'News', 'query': 'Squirro', 'market': 'en-US'})

Item Format

...

No special keywords are added to items created by this provider.

Item Filtering

For results in the News vertical the Bing provider fetches the story from its original site. For this, the story is retrieved and any ads, side columns, etc. are removed (see Boilerplate Removal).

After this boilerplate removal the news story is then compared against the original query. If at this point it doesn't match the query anymore, the story is discarded. This additional step avoids Bing's many false positives for news search. This can result in fewer stories being delivered to Squirro than you would see in Bing's search engine directly.

To enable content fetching for web search, set the Content Augmentation enrichment's fetch_link_content and enable the Noise Removal removal enrichment (see Built-in enrichments for details).

Licensing

The Bing provider needs to be licensed separately. Because this is a polling provider, the cost have to be considered based on number of configurations and the retry interval.