Skip to content

Commit 55e817f

Browse files
committed
Adapted to pytest framework
1 parent 1077f2e commit 55e817f

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

test/common/llmperf/run_inference.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -95,6 +95,7 @@ def run_test_cases(test_cases, timestamp_dir, model, server_url, tokenizer_path)
9595
tokenizer_path=tokenizer_path,
9696
user_metadata={"case_idx": i, "phase": "prefill"},
9797
)
98+
reset_prefill_cache(env, server_url)
9899
# Then run normal mode
99100
print("[INFO] Prefill completed, switching to normal mode execution")
100101
summary = run_token_benchmark(

0 commit comments

Comments
 (0)