Configure crawl settings for the Google Drive external content connector
Specify the shared drives you want your Google Drive external content connector to crawl. Define inclusion or exclusion filters for file extensions to dictate the types of documents the crawl retrieves and feeds to AI Search for indexing.
Before you begin
Role required: ais_admin
About this task
- Crawl only a specified set of shared drives from the source system
- Include only documents with specific file extensions when crawling the source system
- Exclude documents with specific file extensions when crawling the source system
By default, an external content connector can index up to one million (1,000,000) documents from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends document deletions and updates to AI Search for indexing, ignoring new documents. The connector logs an error message for every 10,000 documents it crawls beyond the indexing limit.
When a connector's indexed document count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.
If one of your connectors reaches the indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of documents it retrieves. Alternately, if you need to index more than 1,000,000 documents, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.
Procedure
Result
The Google Drive external content connector is updated with your crawl scope and file extension filter settings.
What to do next
Now that you've configured the crawl for your Google Drive external content connector, you can schedule crawls to run on a recurring basis, or you can run one-time crawls on demand. For details on scheduling crawls, see Define a crawl schedule for an external content connector. To learn how to run one-time crawls on demand, see Run a one-time full or partial document crawl for an external content connector and Run a one-time user mapping crawl for an external content connector.