OpenAI Privacy Filter for detecting and masking PII in text

vb · April 22, 2026, 5:39pm

OpenAI has published Privacy Filter, a small model for detecting and masking PII in text. It can run locally, supports a 128k context window, and comes with tools for redaction, evaluation, and fine-tuning. Looks especially useful for teams that need fast, on-prem privacy filtering with control over precision and recall.

Highlights:

Permissive Apache 2.0 license: ideal for experimentation, customization, and commercial deployment.
Small size: Runs in a web browser or on a laptop – 1.5B parameters total and 50M active parameters.
Fine-tunable: Adapt the model to specific data distributions through easy and data efficient finetuning.
Long-context: 128,000-token context window enables processing long text with high throughput and no chunking.
Runtime control: configure precision/recall tradeoffs and detected span lengths through preset operating points.

EricGT · April 22, 2026, 6:54pm

Personally Identifiable information (PII)

Topic		Replies	Views
Security around client data Community	3	879	July 30, 2021
Using ChatGPT to detect harmful behavior without losing access due to violating OpenAI's content policies API	1	761	May 24, 2023
RAG on private dataset via LangChain, does OpenAI / ChatGPT get access to the documents? API	15	19936	February 6, 2024
Integrating LLMs with Sensitive Data in Production: OpenAI API vs Azure OpenAI? API lost-user	2	151	March 19, 2026
Introducing gpt-oss-safeguard: Open Safety Reasoning Models with Custom Policies Open Models oss-safeguard	0	417	October 29, 2025

OpenAI Privacy Filter for detecting and masking PII in text

Related topics