docs: sync OpenHands LLMs list with Agent SDK VERIFIED_OPENHANDS_MODELS #89

enyst · 2025-11-06T21:31:21Z

This PR updates the OpenHands LLMs documentation to match the source of truth in the Agent SDK.

Source of truth:

agent-sdk path: openhands-sdk/openhands/sdk/llm/utils/verified_models.py
list: VERIFIED_OPENHANDS_MODELS

Changes:

Added models: claude-haiku-4-5-20251001, gpt-5-codex, claude-opus-4-1-20250805, kimi-k2-0711-preview
Removed model: devstral-small-2505
Kept all other models and aligned order with the verified list
Clarified pricing table with N/A where provider pricing/limits are not documented

Why:

Ensure docs reflect the exact set of models that are verified to work with OpenHands via the Agent SDK
Avoid drift between docs and implementation

Files changed:

openhands/usage/llms/openhands-llms.mdx

Co-authored-by: openhands [email protected]

@enyst can click here to continue refining the PR

…LS\n\nSource of truth: openhands-sdk/openhands/sdk/llm/utils/verified_models.py\n- Add: claude-haiku-4-5-20251001, gpt-5-codex, claude-opus-4-1-20250805, kimi-k2-0711-preview\n- Remove: devstral-small-2505\n- Align order with VERIFIED_OPENHANDS_MODELS\n\nCo-authored-by: openhands <[email protected]>

mamoodi · 2025-11-06T21:36:38Z

I've removed myself and asked Xingyao for a look. I don't know how correct the changes are.

… N/A; add source note\n\nSource: litellm model_prices_and_context_window_backup.json; Verified list remains source-of-truth for models.\n\nCo-authored-by: openhands <[email protected]>

…50514 1M input tokens)\n\nCo-authored-by: openhands <[email protected]>

…ored-by: openhands <[email protected]>

…\n\nCo-authored-by: openhands <[email protected]>

…p LiteLLM source note\n\nCo-authored-by: openhands <[email protected]>

…red-by: openhands <[email protected]>

…nCo-authored-by: openhands <[email protected]>

openhands/usage/llms/openhands-llms.mdx

enyst · 2025-11-06T22:36:28Z

Yup! I verified some all over the place. In general, the list didn't change much:

I added more models from verified openhands list in agent-sdk
found prices for them in litellm's JSON
double checked Claudes and GPTs
and some extra "looks the same except for x" checks.

xingyaoww

Seems good to me, irrc there was a test case checking this table agaisnt litellm's model_price JSON, can we port that over to here as well? 🤔

…e JSON)\n\n- Skips models not present or intentionally N/A\n- Compares input/cached/output costs per 1M and token limits when available\n\nCo-authored-by: openhands <[email protected]>

…Rs\n\nCo-authored-by: openhands <[email protected]>

…ints in validator\n\nCo-authored-by: openhands <[email protected]>

…oq, cloudflare)\n\nCo-authored-by: openhands <[email protected]>

enyst · 2025-11-07T01:06:23Z

.github/scripts/validate_llm_pricing.py

+                add_fail(f"input_cost mismatch: mdx={mdx_input_cost} vs litellm={exp_input_cost}")
+
+        # Cached input cost
+        if exp_cached_cost is not None or mdx_cached_cost is not None:


GPT-5 explanation for why it cares about None on cached input, but not on input or output:

Cached input cost: We treat it as both a price and a capability signal (prompt caching support). So the validator enforces presence parity and numeric accuracy:

Both None → OK

LiteLLM None, MDX number → fail (docs claim caching where provider doesn’t report it)

LiteLLM number, MDX None → fail (docs missing a provider-reported caching price)

Both numbers → compare within tolerance

Input/output costs: These are fundamental but occasionally missing in LiteLLM for preview/edge cases. To avoid false failures due to incomplete upstream data, we only compare when both sides provide numbers; if either is None, we skip strict enforcement.

It makes sense to me... WDYT?

xingyaoww

LGTM

enyst requested a review from mamoodi as a code owner November 6, 2025 21:31

mintlify bot deployed to staging November 6, 2025 21:32 View deployment

mamoodi requested review from xingyaoww and removed request for mamoodi November 6, 2025 21:36

docs: populate OpenHands LLM prices from LiteLLM DB; keep qwen3-coder…

d219bb5

… N/A; add source note\n\nSource: litellm model_prices_and_context_window_backup.json; Verified list remains source-of-truth for models.\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:03 View deployment

docs: align pricing/limits with LiteLLM DB (incl. claude-sonnet-4-202…

0497069

…50514 1M input tokens)\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:05 View deployment

docs: fix o4-mini cached read to bash.275 per 1M (LiteLLM)\n\nCo-auth…

66ca58a

…ored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:06 View deployment

docs: correct gemini-2.5-pro cached read to bash.125 per 1M (LiteLLM)…

b319d32

…\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:07 View deployment

docs: reorder Anthropic models to top; remove note about em dash; kee…

60576f5

…p LiteLLM source note\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:18 View deployment

docs: move devstral-small-2507 below devstral-medium-2507\n\nCo-autho…

f5ee940

…red-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:24 View deployment

docs: clarify pricing note—provider rates with no OpenHands markup\n\…

daed252

…nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 22:30 View deployment

enyst commented Nov 6, 2025

View reviewed changes

openhands/usage/llms/openhands-llms.mdx Outdated Show resolved Hide resolved

Update openhands/usage/llms/openhands-llms.mdx

406edba

mintlify bot deployed to staging November 6, 2025 22:31 View deployment

xingyaoww reviewed Nov 6, 2025

View reviewed changes

test: add validator to compare MDX pricing vs LiteLLM price DB (remot…

db3aecd

…e JSON)\n\n- Skips models not present or intentionally N/A\n- Compares input/cached/output costs per 1M and token limits when available\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 23:45 View deployment

ci: add GH Action to validate MDX LLM pricing against LiteLLM DB on P…

4ac6079

…Rs\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 6, 2025 23:57 View deployment

chore(test): drop typing imports; use Python 3.12 builtins for type h…

e520183

…ints in validator\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 7, 2025 00:00 View deployment

chore(test): restrict provider fallbacks (drop vertex_ai, bedrock, gr…

79c7e8f

…oq, cloudflare)\n\nCo-authored-by: openhands <[email protected]>

mintlify bot deployed to staging November 7, 2025 00:51 View deployment

enyst commented Nov 7, 2025

View reviewed changes

xingyaoww approved these changes Nov 7, 2025

View reviewed changes

xingyaoww merged commit dce8b12 into main Nov 7, 2025
3 checks passed

xingyaoww deleted the sync-verified-openhands-models branch November 7, 2025 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: sync OpenHands LLMs list with Agent SDK VERIFIED_OPENHANDS_MODELS #89

docs: sync OpenHands LLMs list with Agent SDK VERIFIED_OPENHANDS_MODELS #89

enyst commented Nov 6, 2025 •

edited

Loading

Uh oh!

mamoodi commented Nov 6, 2025

Uh oh!

Uh oh!

enyst commented Nov 6, 2025

Uh oh!

xingyaoww left a comment

Uh oh!

enyst Nov 7, 2025 •

edited

Loading

Uh oh!

enyst Nov 7, 2025

Uh oh!

xingyaoww left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

docs: sync OpenHands LLMs list with Agent SDK VERIFIED_OPENHANDS_MODELS #89

docs: sync OpenHands LLMs list with Agent SDK VERIFIED_OPENHANDS_MODELS #89

Conversation

enyst commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mamoodi commented Nov 6, 2025

Uh oh!

Uh oh!

enyst commented Nov 6, 2025

Uh oh!

xingyaoww left a comment

Choose a reason for hiding this comment

Uh oh!

enyst Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

enyst Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

xingyaoww left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

enyst commented Nov 6, 2025 •

edited

Loading

enyst Nov 7, 2025 •

edited

Loading