Configure prompt injection attack protection

  • Release version: Yokohama
  • Updated July 31, 2025
  • 1 minute to read
  • Activate or deactivate prompt injection attack detection to protect all generative AI applications and AI-generated text and conversations.

    Before you begin

    Role required: sn_generative_ai.nsa_admin

    About this task

    Prompt injection attacks are a type of cybersecurity attack where someone tries to override the initial instructions of an LLM to cause unintended behaviors. Now Assist Guardian detect and log these prompt injection attack attempts across all generative AI applications and features. You can also configure the prompt injection detection guardrail to block the AI-generated response when an attack is detected in addition to logging it.

    You can export logs for review. For more information, see Export Now Assist Guardian logs.

    Procedure

    1. Navigate to All > Now Assist Admin > Settings.
    2. In the side panel, go to Now Assist Guardian > Prompt Injection.
    3. Select the toggle to activate or deactivate prompt injection attack detection.
    4. Optional: Under Detection impact, select the options icon (Options icon.) and then select Edit to change how detected attacks are handled.

      You can choose whether prompt injection attacks are blocked as well as logged.

      Prompt injection protection detection impact selection card with an option to choose the detection impact.

    Result

    Prompt injection detection is configured on your instance. When enabled, you see a standard error message when an attack is detected.