1
- # matrix multiply sample
2
- A sample containing multiple implementations of matrix multiplication. This sample code is implemented using C++ and SYCL language for CPU and GPU.
1
+ # Matrix Multiply Sample
2
+ A sample containing multiple implementations of matrix multiplication. This sample code is implemented using DPC++ language for CPU and GPU.
3
3
4
4
| Optimized for | Description
5
5
|:--- |:---
@@ -26,7 +26,7 @@ Edit the line in multiply.h to select the version of the multiply function:
26
26
#define MULTIPLY multiply1
27
27
28
28
29
- ### on Linux
29
+ ### On a Linux* System
30
30
To build DPC++ version:
31
31
cd <sample dir>
32
32
cmake .
@@ -35,7 +35,7 @@ Edit the line in multiply.h to select the version of the multiply function:
35
35
Clean the program
36
36
make clean
37
37
38
- ### on Windows - Visual Studio 2017 or newer
38
+ ### On a Windows* System Using Visual Studio 2017 or newer
39
39
* Open Visual Studio 2017
40
40
* Select Menu "File > Open > Project/Solution", find "matrix_multiply" folder and select "matrix_multiply.sln"
41
41
* Select Menu "Project > Build" to build the selected configuration
@@ -47,6 +47,23 @@ Edit the line in multiply.h to select the version of the multiply function:
47
47
Debug - MSBuild matrix_multiply.sln /t:Rebuild /p:Configuration="Debug"
48
48
49
49
50
+ ## Running the Sample
51
+
52
+ # Example of Output
53
+
54
+ ./matrix.dpcpp
55
+ Address of buf1 = 0x7f5e687eb010
56
+ Offset of buf1 = 0x7f5e687eb180
57
+ Address of buf2 = 0x7f5e67fea010
58
+ Offset of buf2 = 0x7f5e67fea1c0
59
+ Address of buf3 = 0x7f5e677e9010
60
+ Offset of buf3 = 0x7f5e677e9100
61
+ Address of buf4 = 0x7f5e66fe8010
62
+ Offset of buf4 = 0x7f5e66fe8140
63
+ Using multiply kernel: multiply1
64
+ Running on Intel(R) Gen9
65
+ Elapsed Time: 0.539631s
66
+
50
67
## Running an Intel Advisor analysis
51
68
------------------------------------------
52
69
0 commit comments