Skip to content

Conversation

@ggerganov
Copy link
Member

@ggerganov ggerganov commented Jul 12, 2024

fix #8367

Testing:

make -j && ./llama-server --hf-repo "Qwen/Qwen2-0.5B-Instruct-GGUF" --hf-file qwen2-0_5b-instruct-q2_k.gguf

curl \
  --request POST --url http://localhost:8080/v1/chat/completions \
  --header "Content-Type: application/json" \
  --data '{"temperature": 0.0, "messages": [ { "role": "system", "content": "You are a helpful assistant." }, { "role": "user", "content": [ { "type": "text", "text": "tell me a " }, { "type": "text", "text": "riddle" } ] } ] }'

Co-authored-by: Xuan Son Nguyen <[email protected]>
@ggerganov ggerganov merged commit 4e24cff into master Jul 12, 2024
@ggerganov ggerganov deleted the gg/server-chat-content branch July 12, 2024 11:48
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024
* server : handle content array in chat API

* Update examples/server/utils.hpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024
* server : handle content array in chat API

* Update examples/server/utils.hpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>
Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Jul 15, 2024
* server : handle content array in chat API

* Update examples/server/utils.hpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

server : support content array in OAI chat API

3 participants