Implemented SIGNUM function kernel #43
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
FPGA Log:
root@agilex7_dk_si_agf014ea:/tsi/ganesh/work# ./run_llama.sh
load_model: using TSavorite backend
Calculating mem_size 384 2 and creating ggml context
Creating input Tensor
Creating Backend Buffer
Loading Input Tensor Data to Backend Buffer
Bringing tensor data from Backend buffer and printing 32 tensor data:
[ 1.10 2.30 3.20 4.00 5.00 6.00 7.00 8.00 9.00 10.00 11.00 12.00 13.00 14.00 15.00 16.00 17.00 18.00 19.00 20.00 21.00 22.00 23.00 24.00 25.00 26.00 27.00 28.00 29.00 30.00 31.00 32.00 ]
main: compute buffer size: 0.2500 KB
Under Test case for compute API creating build_graph
Compute Done
operation type: 0, num of elements 32
compute is also done
TEST CASE PASSED
GGML Tsavorite Profiling Results:
Calls Total(ms) T/call Self(ms) Function
[Thread] tsi::runtime::TsavRT::awaitCommandListCompletion (cumulative over all threads)
[Thread] tsi::runtime::TsavRT::finalizeCommandList (cumulative over all threads)
[Thread] tsi::runtime::TsavRT::processResponses (cumulative over all threads)
[Thread] tsi::runtime::TsavRTFPGA::finalize (cumulative over all threads)
[Thread] tsi::runtime::TsavRT::allocate (cumulative over all threads)
[Thread] tsi::runtime::TsavRTFPGA::loadBlob (cumulative over all threads)
[Thread] tsi::runtime::TsavRT::addCommandToList (cumulative over all threads)
[Thread] tsi::runtime::TsavRTFPGA::unloadBlob (cumulative over all threads)
[Thread] tsi::runtime::TsavRT::deallocate (cumulative over all threads)
========================================================================================================================
- 2079.6420 0.0000 2079.6420 [100.00%] TOTAL
Counter Metrics:
Metric Min Max Avg
Queue_0_Occupancy 0.0000 1.0000 0.3333
root@agilex7_dk_si_agf014ea:/tsi/ganesh/work#