How to run SWE-bench Lite evaluation with Claude Agent SDK agents?