Configure crawl settings for a Cornerstone external content connector
Specify the organization units you want your Cornerstone external content connector to crawl. Define inclusion or exclusion filters to dictate the types of content the crawl retrieves and feeds to AI Search for indexing.
Vorbereitungen
A connector admin must have already created the Cornerstone external content connector that you want to configure crawl settings for. To learn about this procedure, see Create a Cornerstone external content connector.
Role required: sn_ext_conn.xcc_admin
Warum und wann dieser Vorgang ausgeführt wird
- Inclusion or exclusion filters for the organization units to retrieve content from when running content crawls
- Inclusion or exclusion filters for the types of Catalogue and Learning objects to retrieve content from when running content crawls
Content is only retrieved from the source system if it passes all of your configured crawl setting filters. If any crawl setting filter excludes a content item, the external content connector doesn't retrieve it.
By default, each external content connector can index up to oneten million (1,000,00010,000,000) content items from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends content item deletions and updates to AI Search for indexing, ignoring new content items. The connector logs an error message for every 10,000 content items it crawls beyond the indexing limit.
When a connector's indexed content item count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.
External content connectors that support user permissions crawls can retrieve up to five hundred thousand (500,000) users.
If one of your connectors reaches the content indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of content items it retrieves. Alternately, if you need a connector to index more than 1,000,00010,000,000 content items or to retrieve more than 500,000 users, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.
Prozedur
- In the Connectors list, select the record for the Cornerstone external content connector whose settings you want to modify.
- In the connector editor's Settings tab, select Crawl settings.
-
Select one of the following Organization units options:
- To crawl all organization units from the source system, select Crawl all organization units.
-
To crawl only a specified set of organization units from the source system, select Include only these organization units, then use the Add organization units to include field and Add button to enter names for organization units you want the connector to include when crawling.
As an example, you might enter Marketing and Sales to only retrieve searchable content from the specified organization units.
-
To crawl all but a specified set of organization units from the source system, select Exclude only these organization units, then use the Add organization units to exclude field and Add button to enter names for organization units you want the connector to exclude when crawling.
As an example, you might enter Internal to exclude searchable content from the specified organization unit.
- In the Catalog/trainings and User/learner sections, select the options for the Cornerstone LMS content types that you want the connector to retrieve when running content crawls.
- Wahlweise: In the Captions section, select the Multimodal captions option if you want AI Search to [[[ automatically generate captions for images retrieved in content crawls ]]].
- Select Save and validate.
Ergebnisse
The Cornerstone external content connector is updated with your modified crawl settings.
Nächste Maßnahme
To retrieve content from your Cornerstone source system using your modified crawl settings, create and run a one-time content crawl for your Cornerstone external content connector. To learn about creating and running one-time content crawls, see Create a content crawl for an external content connector.