It would be a lot easier to diagnose and evaluate the chance that this tool is emitted as a function name instead of used properly if all logprobs weren’t blocked to developers after the AI produces an initial un-reweightable tool token…
It would be a lot easier to diagnose and evaluate the chance that this tool is emitted as a function name instead of used properly if all logprobs weren’t blocked to developers after the AI produces an initial un-reweightable tool token…