Train the similarity solution for Enterprise Architecture to categorize applications while registering

  • Release version: Xanadu
  • Updated August 1, 2024
  • 2 minutes to read
  • Train the business application similarity definition included within the Predictive Intelligence for Enterprise Architecture to suggest a category for a business application when it is being registered or on-boarded.

    Before you begin

    Ensure that the Enterprise Architecture – Predictive Intelligence plugin (com.snc.apm.predictive_intelligence) is activated.

    The Business Application Similarity solution uses textual similarity to compare business application records. It analyzes the Name and Description fields of existing business applications and compares them with new or updated records. During training, the system builds a model using words and phrases from existing application records. When a new business application is registered or on‑boarded, the system compares its text against previously trained records to identify applications with similar terminology and context. If the new application closely matches existing applications, the system suggests the most common category used by those similar applications.

    Role required: ml_admin

    Procedure

    1. Navigate to All > Predictive Intelligence > Similarity > Solution Definitions.
    2. In the Similarity Definitions [ML view], click the Business Application Similarity (ml_sn_sn_apm_ml_global_ba_similarity) label.
    3. On the Similarity Definition Business Application Similarity [ML view] form, verify the default values for business application similarity.

      For more information on the Similarity Definition form fields, see Create and train a similarity solution.

      Note:
      Set the application scope to Enterprise Architecture – Predictive Intelligence to edit the form. Click the word here at the end of the warning message that appears.
      Table 1. Similarity Definition form
      Field Definition
      Label Unique name for your similarity definition.
      Word Corpus Collection of words and phrases related to the name and description of the business application that functions as the vocabulary the system uses to compare your instance records based on their textual similarity.
      Processing Language Dominant language of the dataset that you are training on the solution definition. If the dataset language is Italian, choose Italian.
      Note:
      English processing is applied to all datasets by default.
      Stopwords Existing word corpus that is relevant to your solution. You can also add stopwords to the list, for example, words like Application.
      Training Frequency Option to retrain from once daily or every 30 days in three months increments up to 180 days.
      Update Frequency Frequency at which you want to refresh the data you use to retrieve your similarity results.
    4. Click Update & Retrain.

    What to do next

    You can create a similarity solution with words and phrases related to the name and description of the business application that triggers a prediction. You can also set a training frequency for your machine-learning solution to collect and compare existing records with new records for a similarity definition.

    Use the similarity solution to categorize an application while it is on-boarded.