Commit 84bc3d8
committed
Squashed commit of the following:
commit 121dcdc
Author: Samuel Monson <[email protected]>
Date: Fri Oct 10 09:36:09 2025 -0400
Configurable max_tokens/max_completion_tokens key (#399)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
Makes the `max_tokens` request key configurable through an environment
variable per endpoint type. Defaults to `max_tokens` for legacy
`completions` and `max_completion_tokens` for `chat/completions`
## Details
<!--
Provide a detailed list of all changes introduced in this pull request.
-->
- Add the `GUIDELLM__OPENAI__MAX_OUTPUT_KEY` config option which is a
dict mapping from route name -> output tokens key. Default is
`{"text_completions": "max_tokens", "chat_completions":
"max_completion_tokens"}`
## Test Plan
<!--
List the steps needed to test this PR.
-->
-
## Related Issues
<!--
Link any relevant issues that this PR addresses.
-->
- Closes #395
- Closes #269
- Related #210
---
- [x] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
---------
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
commit a24a22d
Author: Samuel Monson <[email protected]>
Date: Thu Oct 9 15:57:19 2025 -0400
Fix typo in CI (#401)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
## Details
<!--
Provide a detailed list of all changes introduced in this pull request.
-->
- [ ]
## Test Plan
<!--
List the steps needed to test this PR.
-->
-
## Related Issues
<!--
Link any relevant issues that this PR addresses.
-->
- Resolves #
---
- [ ] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
Signed-off-by: Samuel Monson <[email protected]>
commit 81af01b
Author: Samuel Monson <[email protected]>
Date: Thu Oct 9 15:53:45 2025 -0400
Fix the failing CI again (#400)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
## Details
<!--
Provide a detailed list of all changes introduced in this pull request.
-->
- [ ]
## Test Plan
<!--
List the steps needed to test this PR.
-->
-
## Related Issues
<!--
Link any relevant issues that this PR addresses.
-->
- Resolves #
---
- [ ] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
Signed-off-by: Samuel Monson <[email protected]>
commit 90a05ab
Author: Samuel Monson <[email protected]>
Date: Thu Oct 9 14:26:50 2025 -0400
Fix for container rc tag (Attempt 2) (#398)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
This is the same fix as #389 but applied to the RC workflow rather than
the release workflow as was the original intent with #389. Both
workflows need this change so not reverting the other one.
---
- [x] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
Signed-off-by: Samuel Monson <[email protected]>
commit 000b39e
Author: Samuel Monson <[email protected]>
Date: Fri Oct 3 17:46:04 2025 -0400
Fix for container rc tag (#389)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
Fix to parsing rc ref in CI
---
- [x] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
Signed-off-by: Samuel Monson <[email protected]>
commit 108a657
Author: Benjamin Blue <[email protected]>
Date: Fri Oct 3 10:35:32 2025 -0400
update tpot to itl in labels and code use (#386)
## Summary
We want to use ITL instead of TPOT. The data we had previously happened
to be ITL data, but all of the labels indicate that it is TPOT data. Now
the code and labels reflect that it is ITL data.
## Test Plan
- Everything works, tests pass, No use of TPOT in the UI
---------
Signed-off-by: dalthecow <[email protected]>
Co-authored-by: Samuel Monson <[email protected]>
commit b1b1b78
Author: Benjamin Blue <[email protected]>
Date: Wed Oct 1 13:14:47 2025 -0400
update default build values to use versioned builds (#310)
## Summary
With the default path referring to the versioned build now, users will
no longer experience their html reports breaking randomly when the build
files are updated.
Also fixed versioned build directory path issue that I missed previously
---------
Signed-off-by: dalthecow <[email protected]>
commit 5c9982a
Merge: ad25e06 2c0d993
Author: Mark Kurtz <[email protected]>
Date: Wed Oct 1 08:23:27 2025 -0400
first benchark testing example (#328)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
<img width="1757" height="1212" alt="image"
src="https://github.com/user-attachments/assets/fbfddeac-ca56-40c0-b7ae-d2f17d50823a"
/>
## Details
<!--
Provide a detailed list of all changes introduced in this pull request.
-->
- [ ]
## Test Plan
<!--
List the steps needed to test this PR.
-->
-
## Related Issues
<!--
Link any relevant issues that this PR addresses.
-->
- Resolves #
---
- [ ] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
commit 2c0d993
Merge: d1297fe ad25e06
Author: Mark Kurtz <[email protected]>
Date: Wed Oct 1 08:20:10 2025 -0400
Merge branch 'main' into example_simulator
commit ad25e06
Merge: f8f6f9d c32896c
Author: Mark Kurtz <[email protected]>
Date: Wed Oct 1 08:19:59 2025 -0400
Add formatting to json file with metrics (#372)
## Summary
It's inconvenient to look at metrics.
## Details
-
## Test Plan
- code launch
## Related Issues
- Resolves ##371
---
- [x] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
commit d1297fe
Merge: 8159ca7 f8f6f9d
Author: Mark Kurtz <[email protected]>
Date: Wed Oct 1 08:17:36 2025 -0400
Merge branch 'main' into example_simulator
commit c32896c
Merge: 0701389 f8f6f9d
Author: Mark Kurtz <[email protected]>
Date: Wed Oct 1 08:14:35 2025 -0400
Merge branch 'main' into add_json_formatiing
commit f8f6f9d
Author: Samuel Monson <[email protected]>
Date: Tue Sep 30 10:21:54 2025 -0400
Container CI bugfix and disable dry-run on image cleaner (#379)
## Summary
<!--
Include a short paragraph of the changes introduced in this PR.
If this PR requires additional context or rationale, explain why
the changes are necessary.
-->
Final pieces needed for image CI work. Fully enables auto `latest`,
`stable` tags and old image pruning.
## Details
<!--
Provide a detailed list of all changes introduced in this pull request.
-->
- Add `pipefail` to list-tags command to catch failures
- Add missing `ghcr.io/` to skopeo commands
- Disable dry-run option for development image cleanup job
## Test Plan
Ran with `workflow_dispatch` [see
here](https://github.com/vllm-project/guidellm/actions/runs/18108553536)
<img width="2032" height="955" alt="2025-09-29T15-45-39"
src="https://github.com/user-attachments/assets/b981ab01-fe90-4e15-bf60-cb483508065e"
/>
<img width="1204" height="579" alt="2025-09-29T15-46-02"
src="https://github.com/user-attachments/assets/68118168-2e80-4d45-92cc-47badc1caf16"
/>
---
- [x] "I certify that all code in this PR is my own, except as noted
below."
## Use of AI
- [ ] Includes AI-assisted code completion
- [ ] Includes code generated by an AI application
- [ ] Includes AI-generated tests (NOTE: AI written tests should have a
docstring that includes `## WRITTEN BY AI ##`)
---------
Signed-off-by: Samuel Monson <[email protected]>
commit 0701389
Author: psydok <[email protected]>
Date: Thu Sep 25 23:14:36 2025 +0500
Add formatting to json file with metrics
Signed-off-by: psydok <[email protected]>
commit 8159ca7
Author: guangli.bao <[email protected]>
Date: Mon Sep 15 12:07:08 2025 +0800
first draft
Signed-off-by: guangli.bao <[email protected]>1 parent fbd417f commit 84bc3d8
File tree
23 files changed
+223
-86
lines changed- .github/workflows
- docs
- assets
- examples
- src
- guidellm
- presentation
- ui/lib
- components
- MetricsSummary
- WorkloadMetrics
- store
- slices
- benchmarks
- metrics
- slo
- tests
- ui/unit/mocks
- unit
23 files changed
+223
-86
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
12 | 15 | | |
13 | 16 | | |
14 | 17 | | |
15 | 18 | | |
16 | 19 | | |
17 | 20 | | |
18 | 21 | | |
19 | | - | |
| 22 | + | |
20 | 23 | | |
21 | 24 | | |
22 | 25 | | |
23 | 26 | | |
24 | | - | |
| 27 | + | |
25 | 28 | | |
26 | 29 | | |
27 | 30 | | |
| |||
31 | 34 | | |
32 | 35 | | |
33 | 36 | | |
34 | | - | |
| 37 | + | |
35 | 38 | | |
36 | 39 | | |
37 | 40 | | |
38 | 41 | | |
39 | | - | |
| 42 | + | |
| 43 | + | |
40 | 44 | | |
41 | 45 | | |
42 | 46 | | |
43 | | - | |
| 47 | + | |
44 | 48 | | |
45 | | - | |
| 49 | + | |
46 | 50 | | |
47 | 51 | | |
48 | | - | |
49 | | - | |
| 52 | + | |
| 53 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
228 | 228 | | |
229 | 229 | | |
230 | 230 | | |
231 | | - | |
| 231 | + | |
232 | 232 | | |
233 | 233 | | |
234 | 234 | | |
| |||
298 | 298 | | |
299 | 299 | | |
300 | 300 | | |
301 | | - | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
302 | 307 | | |
303 | 308 | | |
304 | 309 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
227 | 227 | | |
228 | 228 | | |
229 | 229 | | |
230 | | - | |
| 230 | + | |
231 | 231 | | |
232 | 232 | | |
233 | 233 | | |
| |||
297 | 297 | | |
298 | 298 | | |
299 | 299 | | |
300 | | - | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
301 | 306 | | |
302 | 307 | | |
303 | 308 | | |
| |||
Loading
Loading
Loading
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
121 | | - | |
| 121 | + | |
| 122 | + | |
122 | 123 | | |
123 | 124 | | |
124 | 125 | | |
125 | 126 | | |
126 | | - | |
| 127 | + | |
| 128 | + | |
127 | 129 | | |
128 | 130 | | |
129 | 131 | | |
| |||
210 | 212 | | |
211 | 213 | | |
212 | 214 | | |
213 | | - | |
| 215 | + | |
214 | 216 | | |
215 | 217 | | |
216 | 218 | | |
| |||
219 | 221 | | |
220 | 222 | | |
221 | 223 | | |
222 | | - | |
| 224 | + | |
223 | 225 | | |
224 | 226 | | |
225 | 227 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
35 | | - | |
36 | | - | |
| 35 | + | |
| 36 | + | |
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| |||
89 | 89 | | |
90 | 90 | | |
91 | 91 | | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
92 | 96 | | |
93 | 97 | | |
94 | 98 | | |
| |||
Lines changed: 15 additions & 15 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
57 | | - | |
| 57 | + | |
58 | 58 | | |
59 | 59 | | |
60 | 60 | | |
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | 64 | | |
65 | | - | |
| 65 | + | |
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
| |||
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
75 | | - | |
76 | | - | |
| 75 | + | |
| 76 | + | |
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
126 | | - | |
| 126 | + | |
127 | 127 | | |
128 | 128 | | |
129 | 129 | | |
| |||
133 | 133 | | |
134 | 134 | | |
135 | 135 | | |
136 | | - | |
137 | | - | |
138 | | - | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
139 | 139 | | |
140 | 140 | | |
141 | | - | |
| 141 | + | |
142 | 142 | | |
143 | 143 | | |
144 | 144 | | |
| |||
212 | 212 | | |
213 | 213 | | |
214 | 214 | | |
215 | | - | |
| 215 | + | |
216 | 216 | | |
217 | 217 | | |
218 | 218 | | |
| |||
222 | 222 | | |
223 | 223 | | |
224 | 224 | | |
225 | | - | |
226 | | - | |
| 225 | + | |
| 226 | + | |
227 | 227 | | |
228 | 228 | | |
229 | 229 | | |
230 | 230 | | |
231 | 231 | | |
232 | 232 | | |
233 | | - | |
234 | | - | |
235 | | - | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
236 | 236 | | |
237 | 237 | | |
238 | 238 | | |
| |||
0 commit comments