Configure prompt injection attack protection

Yokohama Enable AI

Release

yokohama

ft:locale

en-US

ft:publication_title

Yokohama Enable AI

ft:clusterId

platai

bundleId

platai

workflow

Platform

Configure prompt injection attack protection

Release version: Yokohama

Updated July 31, 2025

1 minute to read

Activate or deactivate prompt injection attack detection to protect all generative AI applications and AI-generated text and conversations.

Before you begin

Role required: sn_generative_ai.nsa_admin

About this task

Prompt injection attacks are a type of cybersecurity attack where someone tries to override the initial instructions of an LLM to cause unintended behaviors. Now Assist Guardian detect and log these prompt injection attack attempts across all generative AI applications and features. You can also configure the prompt injection detection guardrail to block the AI-generated response when an attack is detected in addition to logging it.

You can export logs for review. For more information, see Export Now Assist Guardian logs.

Procedure

Navigate to All > Now Assist Admin > Settings.
In the side panel, go to Now Assist Guardian > Prompt Injection.
Select the toggle to activate or deactivate prompt injection attack detection.
Optional: Under Detection impact, select the options icon () and then select Edit to change how detected attacks are handled.

You can choose whether prompt injection attacks are blocked as well as logged.

Result

Prompt injection detection is configured on your instance. When enabled, you see a standard error message when an attack is detected.