Skip to content

Conversation

@LuMarans30
Copy link
Contributor

I've added a new section to the installation guide that provides step-by-step commands for running the proxy with a local server, specifically llama.cpp.

Added a section in `Installation` that lists the needed commands for running the proxy with a local server
Added the `-c` parameter to `llama-server` command in order to increase the context length to 4096 tokens, from the default of 2048 tokens.
@LuMarans30 LuMarans30 mentioned this pull request Sep 21, 2024
@codelion
Copy link
Member

This looks good, but are you sure llama-cpp-python. supports multiple responses using the n param ? I think it has the same limitation as the llama-server as it is just a wrapper around it.

@codelion
Copy link
Member

This PR fixes #8.

@codelion codelion added the documentation Improvements or additions to documentation label Sep 21, 2024
@LuMarans30
Copy link
Contributor Author

LuMarans30 commented Sep 21, 2024

This looks good, but are you sure llama-cpp-python. supports multiple responses using the n param ? I think it has the same limitation as the llama-server as it is just a wrapper around it.

Sorry, my bad. I misread the message and thought it was a merged PR. I've removed llama-cpp-python from the list of alternative servers.

@codelion codelion merged commit 22c9fc4 into algorithmicsuperintelligence:main Sep 21, 2024
@codelion
Copy link
Member

@LuMarans30 I recently added a CLA, can you please sign it here - https://cla-assistant.io/codelion/optillm

@LuMarans30
Copy link
Contributor Author

LuMarans30 commented Dec 14, 2024

Done, signed the CLA.

@codelion
Copy link
Member

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants