Reverse engineering GPTs and grabbing knowledge files

0xeb · December 1, 2023, 6:21pm

Please check this out: https://youtu.be/HEAPCyet2XM?si=yefmnvsPkVIG1a1G

Now there are protection prompts but still they can be bypassed.

An official / robust protection mechanism can be handy especially;y when monetization is in effect.

callum.bradbury · December 1, 2023, 7:09pm

The GPT is not there to help you, or protect you. Once the user opens it, it’s there to serve them to the best of its ability. The only way to really safeguard against the user seeing things you don’t want them to, is to not let the GPT see anything you wouldn’t show the user.

curt.kennedy · December 1, 2023, 7:30pm

Unless some serious security is implemented, the prompts and knowledge can be extracted from any GPT. I think prompts will be easier to extract, and the knowledge will be harder, especially if there is a lot of knowledge uploaded to the GPT. But the attacker could get the gist of the knowledge.

Without any protection from attack, this will make monetization murky, because the worth of the GPT is now close to nothing because the GPT can have its information extracted, and it can be cloned.

So it will be interesting watching this.

anon10827405 · December 1, 2023, 8:13pm

Let’s compare this comment to websites.

I bet people had this same argument. “How can I obfuscate my site”, “What’s the purpose if someone else can copy/paste it”. We have minifying which to an extent works, but with effort can be reverse-engineered.

GPTs, like websites have front-end components: Retrieval, instructions. So by themself they can be considered “worth close to nothing” like a simple website with no back-end functionality and only contains aggregated relevant PDFs and text.

The important component, the “moat” is the back-end service. The actions. The function-calling. This is what will define each GPT. Truly, as an Assistant to convert unstructured semantics into powerful API calls, and continue a conversation with the retrieved/updated information.

Lastly, if my instructions and retrieval documents also relate to the action results then what’s the point in copying them? Why would I bother reverse-engineering the interface of ChatGPT when most of its functionality is intertwined with the back-end?

I mean, my lord. The instructions are completely public and we can discover the file names and reverse-engineer the data. This is not the intended philosophy.

Screenshot from 2023-12-01 15-39-55

(I have no idea why this file is double-uploaded, this was when GPTs were first released lol)

TL;DR: Stop worrying about protecting your prompts and your knowledge files (for public-facing GPTs). Assume that everything can be extracted and is rightfully “public-facing”.

N2U · December 1, 2023, 8:56pm

100% agree, if you want to only allow specific information for specific people, then the solution is Oauth and actions.

0xeb · December 5, 2023, 6:14am

I agree with you Curt.

I made a post about that: How to protect your GPTs against instruction leakage or "cracking"

I also made, perhaps, the most comprehensive protection instructions list here: GitHub - 0xeb/gpt-analyst: GPT-Analyst: A GPT for GPT analysis and reverse engineering

Jacek · December 11, 2023, 11:02pm

GPT White hat hack GPT White hat hack. Custom GPTs Marketplace is going to be… | by Jacek Wojcieszyński | Nov, 2023 | Medium

Topic		Replies	Views
Discussion how to secure your GPTs in different levels with my examples Community cybersecurity , gpts	3	1109	February 21, 2024
Custom GPTs, GPT Store and instructions protection GPT builders custom-instructions , gpt , protection	2	1853	March 27, 2025
Someone stealing my GPT listed Plugins / Actions builders plugin-development	11	1933	October 25, 2024
(this post was deleted by the author) Community chatgpt , gpts , mygpts	12	5679	November 13, 2023
😱 Concerns About File Information Extraction from GPTs Uploads Community gpts	14	4822	February 16, 2024

Reverse engineering GPTs and grabbing knowledge files

Related topics