Skip to content

Commit c6e07ed

Browse files
change tests, add to CI
Signed-off-by: Govind Ramnarayan <[email protected]>
1 parent 95ba19c commit c6e07ed

File tree

3 files changed

+4
-2
lines changed

3 files changed

+4
-2
lines changed

tests/integration/defs/examples/test_ad_speculative_decoding.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@ def run_with_autodeploy(model, speculative_model_dir, batch_size):
6969

7070
# Configure KV cache
7171
kv_cache_config = KvCacheConfig(
72-
free_gpu_memory_fraction=0.2,
72+
free_gpu_memory_fraction=0.1,
7373
)
7474

7575
# Configure AutoDeploy LLM arguments

tests/integration/test_lists/test-db/l0_h100.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -107,6 +107,8 @@ l0_h100:
107107
- accuracy/test_llm_api_autodeploy.py::TestLlama3_1_8B::test_auto_dtype[True-1]
108108
- accuracy/test_llm_api_autodeploy.py::TestNemotronH::test_auto_dtype[False]
109109
- accuracy/test_llm_api_autodeploy.py::TestNemotronH::test_auto_dtype[True]
110+
- examples/test_ad_speculative_decoding.py::test_autodeploy_spec_dec[1]
111+
- examples/test_ad_speculative_decoding.py::test_autodeploy_spec_dec[4]
110112
- condition:
111113
ranges:
112114
system_gpu_count:

tests/unittest/_torch/auto_deploy/unit/singlegpu/test_ad_speculative_decoding.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@ def test_ad_speculative_decoding_smoke():
4141

4242
# Configure KV cache
4343
kv_cache_config = KvCacheConfig(
44-
free_gpu_memory_fraction=0.1,
44+
free_gpu_memory_fraction=0.0001,
4545
)
4646

4747
experiment_config["args"]["runtime"] = "trtllm"

0 commit comments

Comments
 (0)