Client-side secret redaction for LLM prompts (LeakGuard MVP)

petritbahtiri24 · April 22, 2026, 1:25pm

I’ve been working on a Chrome extension that acts as a client-side privacy layer for LLM usage.

The idea:
Detect likely secrets in the prompt before it’s sent, replace them with local placeholders (e.g. [PWM_1]), and ensure only redacted data leaves the browser.

What’s currently working:

deterministic mapping (same secret → same placeholder)
idempotent behavior (already-redacted input stays unchanged)
mixed input handling (raw + placeholder in same prompt)
detection of common patterns (API keys, tokens, JWTs, connection strings, etc.)
verified via DevTools that outbound payloads contain only placeholders

This is not meant to be “perfect security,” but a safety layer to reduce accidental leakage during day-to-day LLM usage.

What I’m looking for:

where would you try to break this?
what edge cases am I missing?
how would you approach unknown secret detection (entropy vs context)?

Repo: you can find it in github with name petritbahtiri123/LeakGuard

Topic		Replies	Views
Prompt Enhancing and managing chrome extension Prompting prompt , prompt-engineering , prompting	0	924	November 1, 2024
Unveiling Hidden Instructions in Chatbots Bugs bug , risks	19	11834	December 28, 2025
Security around client data Community	3	887	July 30, 2021
Challenge: Hack this prompt! API	15	6043	December 28, 2025
Chrome extension and API Key Security API	1	688	December 9, 2024

Client-side secret redaction for LLM prompts (LeakGuard MVP)

Related topics