Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

Conversation

@hiento09
Copy link

@hiento09 hiento09 commented Aug 4, 2024

No description provided.

@hiento09 hiento09 force-pushed the feature/cicd-v0.11.0 branch from 9700c99 to f6c111c Compare August 4, 2024 15:11
@hiento09 hiento09 force-pushed the feature/cicd-v0.11.0 branch 2 times, most recently from 0e4e9b6 to b643423 Compare August 4, 2024 17:53
@hiento09 hiento09 force-pushed the feature/cicd-v0.11.0 branch from b643423 to e752dd4 Compare August 4, 2024 18:12
Co-authored-by: vansangpfiev <[email protected]>
@vansangpfiev vansangpfiev merged commit f0556a4 into rebase/v0.11.0 Aug 5, 2024
vansangpfiev added a commit that referenced this pull request Aug 6, 2024
* TensorRT-LLM v0.10 update

* TensorRT-LLM Release 0.10.0

---------

Co-authored-by: Loki <[email protected]>
Co-authored-by: meghagarwal <[email protected]>

* TensorRT-LLM v0.11 Update (NVIDIA#1969)

* fix: add formatter

* fix: use executor API

* fix: sync

* fix: remove requests thread

* fix: support unload endpoint for server example, handle release resources properly

* refactor: InferenceState

* fix: new line character for Mistral and Openhermes

* fix: add benchmark script

* Add Dockerfile for runner windows (#69)

* Add Dockerfile for runner windows

* Add Dockerfile for linux

* Change CI agent

* fix: build linux (#70)

Co-authored-by: vansangpfiev <[email protected]>

---------

Co-authored-by: Hien To <[email protected]>
Co-authored-by: vansangpfiev <[email protected]>
Co-authored-by: vansangpfiev <[email protected]>

* fix: default batch_size

* chore: only linux build

---------

Co-authored-by: Kaiyu Xie <[email protected]>
Co-authored-by: Loki <[email protected]>
Co-authored-by: meghagarwal <[email protected]>
Co-authored-by: sangjanai <[email protected]>
Co-authored-by: hiento09 <[email protected]>
Co-authored-by: Hien To <[email protected]>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants