DeepSeek-V3, R1 671B on 8xH100 Throughput Benchmarks
-
Updated
Mar 13, 2025 - Python
DeepSeek-V3, R1 671B on 8xH100 Throughput Benchmarks
Describing How to Use Throughput Mode to Run Inference Effectively on Multiple NCS2 Devices with Intel (r) OpenVINO toolkit
This project is an implementation of a high performant, thread safe logs distributor system. The system accepts and distributes packet requests from a configurable number of agents and distributes to analyzers.
evaluate llm's generation speed via API
Add a description, image, and links to the throughput-performance topic page so that developers can more easily learn about it.
To associate your repository with the throughput-performance topic, visit your repo's landing page and select "manage topics."