Multi-step agent workflows are fragile. Each step requires a new LLM call that can hallucinate, pick the wrong tool, or lose context. py-code-mode takes a different approach: Agents write Python code.