Configure crawl settings for the Google Drive external content connector

  • Release version: Xanadu
  • Updated April 16, 2025
  • 3 minutes to read
  • Specify the shared drives you want your Google Drive external content connector to crawl. Define inclusion or exclusion filters for file extensions to dictate the types of documents the crawl retrieves and feeds to AI Search for indexing.

    Before you begin

    Role required: ais_admin

    About this task

    This task is optional. By default, the Google Drive external content connector crawls all shared drives from its specified source system and sends documents with all supported file extensions to AI Search for indexing. You only need to perform this task if you want the connector to use any of the following non-default settings:
    • Crawl only a specified set of shared drives from the source system
    • Include only documents with specific file extensions when crawling the source system
    • Exclude documents with specific file extensions when crawling the source system
    Important:

    By default, an external content connector can index up to one million (1,000,000) documents from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends document deletions and updates to AI Search for indexing, ignoring new documents. The connector logs an error message for every 10,000 documents it crawls beyond the indexing limit.

    When a connector's indexed document count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.

    If one of your connectors reaches the indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of documents it retrieves. Alternately, if you need to index more than 1,000,000 documents, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.

    Procedure

    1. Navigate to All > External Content Connectors > External Content Admin Home.
    2. In the Connectors list, select the record for the Google Drive external content connector whose settings you want to modify.
    3. In the connector editor's Settings tab, select Crawl settings.
    4. Specify the connector's crawl scope by selecting one of the following Drives to crawl options:
      • To crawl all shared drives from the source system, select All, then optionally select Exclude drives and use the Add URL field and Add button to enter URLs for any shared drives that you want to exclude from the crawl.
      • To crawl only a specified set of shared drives from the source system, select Specify, then use the Add URL field and Add button to enter URLs for the shared drives that you want to include in the crawl.
      Shared drive URLs have the format https://drive.google.com/drive/folders/<shared-drive-ID>, where <shared-drive-ID> is the identifier for the shared drive.
      Note:
      The Google Drive external content connector only crawls shared drives from your source system. It doesn't crawl user drives from your source system.
    5. Optional: To apply inclusion or exclusion filters to a crawl based on file extensions, perform the following steps:
      1. Select Filter by file extension.
      2. To specify the type of filter, select Include or Exclude.
        Select Include if you want the crawl to only retrieve documents that have one of the specified file extensions. Select Exclude if you want the crawl to retrieve all documents except those that have one of the specified file extensions.
      3. In the File extension field, select the file extensions that you want to include or exclude.
        For details on the supported file extensions, see File extensions supported in External Content Connectors.
    6. Select Save.

    Result

    The Google Drive external content connector is updated with your crawl scope and file extension filter settings.

    What to do next

    Now that you've configured the crawl for your Google Drive external content connector, you can schedule crawls to run on a recurring basis, or you can run one-time crawls on demand. For details on scheduling crawls, see Define a crawl schedule for an external content connector. To learn how to run one-time crawls on demand, see Run a one-time full or partial document crawl for an external content connector and Run a one-time user mapping crawl for an external content connector.