Okay I’ve come back to this project with AutoMuse3 with some new ideas!
- Rather than a finetuning project, this is more of a cognitive architecture project (prompt engineering).
- I use lessons from my “simulation microservice” to run a text-based environment simulation
- I use lesson from cognitive architecture to run a very lightweight character model for each character
- It runs recursively, summarizing the whole story, adding a scene, character, and plot events
- After simulating the setting/character/plot it tries to convert those logs into entertaining prose
It’s pretty rough right now. It tends to really go off the rails. One time, it blew up the coffee shop, killing all the characters. Another time, one of the characters left to go to work and it got stuck in an infinite loop having everyone saying goodbye. So there’s still work to do! But I think I’m onto something here. Just a little bit more sophistication and dynamic control, and it will be able to simulate several characters for an arbitrary length of time.
The prose-writing part is probably going to require a lot of prompt engineering or finetuning. Well, all of this could benefit from finetuning, but it’s off to a great start. Here’s the code:
https://github.com/daveshap/AutoMuse3
And here’s a companion video explaining it and demonstrating it: