Bing Provider

The bing provider uses APIs from the Bing search engine to gather news, web and video search results.

Provider namebing
TypePolling provider

Table of Contents

Configuration

The bing provider supports three verticals: Web, News and Video. The configuration is different for each one.

News

FieldDescription
verticalSet to News.
query

The query to search on Bing. The query can contain any valid query text that the Bing Engine supports.

market

The news market in which to find news. Example markets:

Market IDLanguageCountry/Region
de-DEGermanGermany
de-CHGermanSwitzerland
en-UKEnglishUnited Kingdom
en-USEnglishUnited States

For the full list of available markets refer to the Bing API Schema Guide.

news_category

Limit search results to a category. This is optional.

Available categories:

  • default
  • rt_Business
  • rt_Entertainment
  • rt_Health
  • rt_Politics
  • rt_Sports
  • rt_US
  • rt_World
  • rt_ScienceAndTechnology

A maximum number of 15 results can be retrieved via the Bing API. 

Video

FieldDescription
verticalSet to Video.
query

The query to search on Bing. The query can contain any valid query text that the Bing Engine supports.

market

The video market in which to find results. Optional.

Web

Web search results can also be added to Squirro with the bing provider.  But unlike the News and Video verticals, Bing's Web vertical doesn't have support for ordering results by date. As a result Squirro fetches the best 50 results (based on Bing's result score; max 50 results are allowed by the Bing API for retrieval) and checks if that result set contains any previously unseen links. If so, those new links are added to Squirro as items.

FieldDescription
verticalSet to Web.
query

The query to search on Bing. The query can contain any valid query text that the Bing Engine supports.

market

The web market in which to find results. Optional.

Configuration Example

This is an example configuration for creating a news tracking configuration.

{
    "vertical": "News",
    "query": "Squirro",
    "market": "en-US"
}

Using the Python SDK a subscription for this could be created with the following code snippet:

client = SquirroClient(None, None, cluster='https://next.squirro.net/')
client.authenticate(refresh_token='293d…a13b')
client.new_subscription(project_id, object_id='default', provider='bing',
    config={'vertical': 'News', 'query': 'Squirro', 'market': 'en-US'})

Item Format

No special keywords are added to items created by this provider.

Item Filtering

For results in the News vertical the Bing provider fetches the story from its original site. For this, the story is retrieved and any ads, side columns, etc. are removed.

After this boilerplate removal the news story is then compared against the original query. If at this point it doesn't match the query anymore, the story is discarded. This additional step avoids Bing's many false positives for news search. This can result in fewer stories being delivered to Squirro than you would see in Bing's search engine directly.

To enable content fetching for web search, set the Content Augmentation enrichment's fetch_link_content and enable the Noise Removal removal enrichment (see Built-in enrichments for details).

Licensing

The Bing provider needs to be licensed separately. Because this is a polling provider, the cost have to be considered based on number of configurations and the retry interval.