You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| GenSelect |`genselect`| Generative Solution Selection - generates multiple candidates and selects the best based on quality criteria |
162
163
| Web Search |`web_search`| Performs Google searches using Chrome automation (Selenium) to gather search results and URLs |
163
164
|[Deep Research](optillm/plugins/deep_research)|`deep_research`| Implements Test-Time Diffusion Deep Researcher (TTD-DR) for comprehensive research reports using iterative refinement |
165
+
|[Proxy](optillm/plugins/proxy)|`proxy`| Load balancing and failover across multiple LLM providers with health monitoring and round-robin routing |
164
166
165
167
We support all major LLM providers and models for inference. You need to set the correct environment variable and the proxy will pick the corresponding client.
166
168
@@ -703,6 +705,7 @@ python -m pytest tests/
703
705
## References
704
706
-[Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques](https://arxiv.org/abs/2506.08060)
705
707
-[AutoThink: efficient inference for reasoning LLMs](https://dx.doi.org/10.2139/ssrn.5253327) - [Implementation](optillm/autothink)
708
+
-[Deep Think with Confidence: Confidence-guided reasoning and inference-time scaling](https://arxiv.org/abs/2508.15260) - [Implementation](optillm/deepconf)
706
709
-[Self-Discover: Large Language Models Self-Compose Reasoning Structures
0 commit comments