THIS IS ARCHIVED DOCUMENTATION

Configuring an External Source

It’s possible to index external content in your Sitecore instance using Coveo Enterprise Search (CES). This section explains how you can use crawlers to index an external source in CES.

In this example, the Web Pages crawler is used to index some pages from the Sitecore Experience Platform website.

  1. Open the CES Administration Tool.

  2. Under Index > Sources and Collections, in the Collections section on the left, click Add to add a new collection.

  3. Name your collection, and then click Save. For this example, you can name it External Content.

  4. Under Sources, click Add to add a source to your new collection.

  5. Set the parameters of your external source.

    1. Name: Give your source a name. For this example, use Sitecore Experience Platform.

    2. Source Type: Select Web Pages.

    3. Addresses: Type in the URL from where to start crawling. For this example, you want to index the Sitecore Experience Platform section of the site, so you should enter the following address.

      You can specify more than one starting address per source. In that case, enter only one address per line.

      http://www.sitecore.net/en/products/sitecore-experience-platform/

  6. Click Save. You don’t want to start building your source yet.

  7. Verify that your source includes all pages under your main URL.

    1. Select the Filters tab.

    2. Under Filters, make sure that you see the following inclusion filter.

      http://www.sitecore.net/en/products/sitecore-experience-platform/*

  8. Select the Permissions tab.

  9. Under Custom Permissions, next to Allowed Users, click Add.

  10. Enter the following information to allow the content to be browsed in your Sitecore instance.

    1. Security Provider: Choose your Sitecore instance.

    2. Type: Choose Group.

    3. Name: Enter Everyone.

  11. Click Add, then on Apply Changes at the bottom, and on Start at the top.

  12. Once your source has finished building, go to Content > Index Browser, and validate that your documents are properly indexed.

It’s possible to index external content in your Sitecore instance using the Coveo Platform. This section explains how you can use crawlers to index an external source in the Cloud Platform itself.

In this example, the Web Pages crawler will be used to index some articles from the Sitecore Experience Platform website.

  1. In the Sitecore Control Panel, under Coveo Search, click Cloud admin UI.

  2. In Content > Sources, click Add Source.

  3. Choose the type of source you want to index. For this example, use the Web source.

  4. Enter the information of the source you want to add.

    1. Source Name: Type in a source name. For this example, you can use the name Sitecore Experience Platform.

    2. Site URL: Type in the URL of the source you want to add. You can add several URLs to be crawled under the same Source Name. Enter the John West Blog archive. Your items will be created from links on this page.

      http://www.sitecore.net/en/products/sitecore-experience-platform/

      There are more settings that you could configure. For the whole list, as well as explanations on what they do, see Add or edit a Web source.

    3. Inclusion filters: Because you want to limit your indexing to a specific section of the site, you need to add an inclusion filter.

      http://www.sitecore.net/en/products/sitecore-experience-platform/*

  5. Click Add and Build. The dialog box should close, and you should see your Web source being created.

  6. Once the source has been properly built, validate that the documents are indexed by clicking Content > Content Browser.

What’s Next?

You can now proceed to Enabling the External Source in Sitecore.