Creating an AI detector, I think i have it

matt0sai · December 12, 2023, 4:10pm

Ok an idea. With Anthropic’s paper about monosemanticity in the neuron’s abstraction (VM I guess?), I would posit that detecting generated text would be more feasible. However, I’m sure that each model would have to have its own interpretability methodology, as each training round messes with the weights and activations. Who knows, but if we can get mechinterp systems working well we might be able to figure this out.

Topic		Replies	Views
What are your strategies for spotting AI writing? Community chatgpt , writing	45	1525	April 17, 2025
Does using ChatGPT change your vocabulary, too? Community chatgpt , in-the-news	33	2819	April 21, 2024
Are GPT writers a waste of time? GPT builders	17	1728	December 11, 2024
GPT scares me and here's why Prompting	91	13191	December 15, 2023
Darn masterpiece from GPT-4o in my math education Prompting gpt-4	22	968	December 2, 2024

Creating an AI detector, I think i have it

Related topics