Why the same input passes Guardrails in Agent Builder but fails in ChatKit

Sercan_Socialityio · February 12, 2026, 12:40pm

Hi everyone,

We’re seeing inconsistent Guardrails behavior between Agent Builder (workflow testing mode) and ChatKit integration.

When we provide the exact same user input:

The Guardrails configuration is the same.
The workflow logic is the same.
The input string is identical.

This makes us suspect that Guardrails may be evaluated under different conditions depending on the environment.

Some questions:

Does ChatKit inject additional system context before Guardrails evaluation?
Are Guardrails run on the raw user input in Agent Builder but on the full assembled prompt in ChatKit?
Is there a difference in execution order between environments?
Could model configuration or safety layers differ implicitly?
Is there any way to inspect Guardrails evaluation logs?

We’re trying to understand whether this is expected behavior or a configuration issue on our side.

Any insights would be appreciated

Topic		Replies	Views
Significant Hallucination Discrepancy: Agent Kit "Chat" Tab vs. API Workflow Integration Bugs	0	46	January 5, 2026
Different Outputs from Assistant API vs. Custom GPTs with Same Settings API api , custom-gpt , assistants-api	8	2667	November 22, 2024
The response from the OpenAI API is significantly off. However, I am receiving relevant results in the ChatGPT app. Bugs gpt-4 , chatgpt	0	337	July 23, 2024
Agent Builder Error 400 missing reasoning items Bugs	4	414	November 11, 2025
Differences in Results between Personalized GPT via Dashboard and Assistant API API assistants-api	0	917	January 29, 2024