Fix handling of multiple tool calls in single LLM response #241

finbarr · 2025-06-12T03:59:09Z

Summary

Fixed Anthropic provider to properly handle multiple tool calls in a single response
Added test coverage for multiple tool calls scenario

Context

This PR addresses issue #236 where the Anthropic provider was only processing the first tool call when multiple tool calls were returned in a single LLM response.

Changes

Modified parse_tool_calls method to handle arrays of tool blocks
Updated format_tool_call to iterate through all tool calls
Changed tool block selection to use select instead of find to get all tool_use blocks
Added comprehensive test cases for multiple tool call handling

Test plan

Added new test cases for multiple tool calls
All existing tests pass
Verified fix works with Anthropic Claude models

🤖 Generated with Claude Code

Previously, the library only processed the first tool call when an LLM returned multiple tool_use blocks in a single response. This limitation prevented parallel function execution, a key feature of modern LLMs. Changes: - Update Anthropic provider to extract ALL tool_use blocks from responses - Modify tool parser to handle arrays of content blocks - Ensure all tools execute before continuing the conversation - Add comprehensive tests for multi-tool scenarios across all providers This enables use cases like: - Executing multiple independent operations in parallel - Rolling dice multiple times in one request - Fetching data from multiple sources simultaneously The implementation maintains backward compatibility while extending support for advanced parallel tool calling capabilities. Fixes the limitation where only the first tool call was processed when multiple were requested.

finbarr · 2025-06-12T04:00:21Z

One note here is that I couldn't get bedrock credentials to work, try as I might. So the bedrock VCR is faked for the multi tool calls. Every other VCR is real.

tpaulshippy

I just tested this against Bedrock. Looks good.

tpaulshippy · 2025-06-12T05:02:39Z

One note here is that I couldn't get bedrock credentials to work, try as I might. So the bedrock VCR is faked for the multi tool calls. Every other VCR is real.

chat_function_calling_bedrock_anthropic_claude-3-5-haiku-20241022-v1_0_can_handle_multiple_tool_calls_in_a_single_response.yml.zip
Here's a real one...

finbarr · 2025-06-12T16:32:15Z

Thank you @tpaulshippy I added the real cassette.

…).

lib/ruby_llm/providers/anthropic/chat.rb

tpaulshippy · 2025-07-21T23:59:29Z

Merging this into my fork to see if it helps with intermittent "text content blocks must be non-empty" errors.

tpaulshippy · 2025-07-22T00:00:25Z

@finbarr Do you have time to address the feedback? If not, I can start a separate PR.

finbarr · 2025-07-22T08:57:15Z

@tpaulshippy feel free to go ahead. Thank you!

finbarr · 2025-07-22T09:08:22Z

@tpaulshippy I made a change. Feel free to modify further as needed.

- Updated test skip logic to check if provider is local using provider.local? method - Consolidated separate skip statements for Ollama and GPUStack into single check - Both providers have local? => true and share similar limitations with tool usage

codecov · 2025-07-22T20:52:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 87.72%. Comparing base (c80b1e3) to head (919929c).
Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #241      +/-   ##
==========================================
+ Coverage   87.67%   87.72%   +0.04%     
==========================================
  Files          78       78              
  Lines        3001     3005       +4     
  Branches      564      567       +3     
==========================================
+ Hits         2631     2636       +5     
+ Misses        370      369       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

crmne · 2025-07-22T20:53:59Z

@finbarr great PR, thank you! Merging

This was referenced Jun 12, 2025

[BUG] Multiple simultaneous tool calls behaves unexpectedly and causes error #236

Closed

Fix handling of multiple tool calls in single LLM response #239

Closed

tpaulshippy reviewed Jun 12, 2025

View reviewed changes

Added real bedrock VCR cassette for multiple tool use (h/t @tpaulshippy…

7debc4c

…).

finbarr force-pushed the multi branch from 2de42f1 to 7debc4c Compare June 12, 2025 16:37

crmne linked an issue Jun 24, 2025 that may be closed by this pull request

[BUG] Multiple simultaneous tool calls behaves unexpectedly and causes error #236

Closed

3 tasks

crmne requested changes Jul 16, 2025

View reviewed changes

lib/ruby_llm/providers/anthropic/chat.rb Outdated Show resolved Hide resolved

crmne added the bug Something isn't working label Jul 16, 2025

Changed find_tool_use to find_tool_uses.

cfd9f25

finbarr requested a review from crmne July 22, 2025 18:55

crmne approved these changes Jul 22, 2025

View reviewed changes

crmne added 4 commits July 22, 2025 22:14

Merge branch 'main' into multi

b9ba91d

Merge branch 'main' into multi

14b90ea

Merge branch 'main' into multi

919929c

crmne merged commit 0ae5770 into crmne:main Jul 22, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix handling of multiple tool calls in single LLM response #241

Fix handling of multiple tool calls in single LLM response #241

Uh oh!

finbarr commented Jun 12, 2025 •

edited

Loading

Uh oh!

finbarr commented Jun 12, 2025

Uh oh!

tpaulshippy left a comment

Uh oh!

tpaulshippy commented Jun 12, 2025 •

edited

Loading

Uh oh!

finbarr commented Jun 12, 2025

Uh oh!

Uh oh!

tpaulshippy commented Jul 21, 2025

Uh oh!

tpaulshippy commented Jul 22, 2025

Uh oh!

finbarr commented Jul 22, 2025

Uh oh!

finbarr commented Jul 22, 2025

Uh oh!

codecov bot commented Jul 22, 2025 •

edited

Loading

Uh oh!

crmne commented Jul 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Fix handling of multiple tool calls in single LLM response #241

Fix handling of multiple tool calls in single LLM response #241

Uh oh!

Conversation

finbarr commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Context

Changes

Test plan

Uh oh!

finbarr commented Jun 12, 2025

Uh oh!

tpaulshippy left a comment

Choose a reason for hiding this comment

Uh oh!

tpaulshippy commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

finbarr commented Jun 12, 2025

Uh oh!

Uh oh!

tpaulshippy commented Jul 21, 2025

Uh oh!

tpaulshippy commented Jul 22, 2025

Uh oh!

finbarr commented Jul 22, 2025

Uh oh!

finbarr commented Jul 22, 2025

Uh oh!

codecov bot commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

crmne commented Jul 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

finbarr commented Jun 12, 2025 •

edited

Loading

tpaulshippy commented Jun 12, 2025 •

edited

Loading

codecov bot commented Jul 22, 2025 •

edited

Loading