While PromptRunner is the standard interface for LLM calls, scenarios like fast research only require the LLM for query understanding and parameter decomposition.
In these cases, the final "Generation" step is redundant. Moreover, the "Long-In, Long-Out" nature of these tasks causes significant latency.
Could Embabel support User-Controlled Tool Execution, similar to Spring AI: User-Controlled Tool Execution?
This would allow PromptRunner to terminate immediately after tool execution, bypassing the final generation round to minimize latency and cost while preserving the Agent's intent-parsing capabilities.