_j
2
I provided the same analysis.
- It always costs more
- It should be costing OpenAI less
- There is token billing overlap in prediction hits and misses
- Speed is slower without majority context matching
Conclusion: If you don’t want to always pay more for dubious speed benefit in an arbitrary or even targeted application - don’t.
3 Likes