Hi! We’re using agents SDK and sending traces via API - we’re able to manually run evals on recent workflows and evaluate them however it doesn’t seem too automated or scalable.
Is there a recommended workflow for automating the upload of responses / traces for evaluation and running that evaluation via the API either in real-time or as scheduled?