Skip to content

Commit c5eedc8

Browse files
author
kevin.p.oleary
committed
New style
1 parent ce9ad28 commit c5eedc8

File tree

1 file changed

+30
-9
lines changed
  • Tools/VTuneProfiler/matrix_multiply_vtune

1 file changed

+30
-9
lines changed

Tools/VTuneProfiler/matrix_multiply_vtune/README.md

Lines changed: 30 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,21 @@
1-
# matrix multiply sample
2-
A sample containing multiple implementations of matrix multiplication. This sample code is implemented using C++ and SYCL language for CPU and GPU.
1+
# Matrix Multiply Sample
2+
A sample containing multiple implementations of matrix multiplication. This sample code is implemented using DPC++ language for CPU and GPU.
33

44
| Optimized for | Description
55
|:--- |:---
66
| OS | Linux Ubuntu 18.04; Windows 10
77
| Hardware | Kaby Lake with GEN9 or newer
8-
| Software | Intel(R) oneAPI DPC++ Compiler beta; Intel(R) VTune(TM) Profiler
8+
| Software | Intel(R) oneAPI DPC++ Compiler (beta); VTune(TM) Profiler
99
| What you will learn | How to profile an application using Intel(R) VTune(TM) Profiler
1010
| Time to complete | 15 minutes
1111

12-
12+
## Purpose
13+
14+
The Matrix Multiplication sample performs basic matrix multiplication. Three version are provided that use different features of DPC++.
15+
16+
## Key Implementation details
17+
18+
The basic DPC++ implementation explained in the code includes device selector, buffer, accessor, kernel, and command groups.
1319

1420
## License
1521
This code sample is licensed under MIT license
@@ -26,7 +32,7 @@ Edit the line in multiply.h to select the version of the multiply function:
2632
#define MULTIPLY multiply1
2733

2834

29-
### on Linux
35+
### On a Linux* System
3036
To build DPC++ version:
3137
cd <sample dir>
3238
cmake .
@@ -35,21 +41,36 @@ Edit the line in multiply.h to select the version of the multiply function:
3541
Clean the program
3642
make clean
3743

38-
### on Windows - Visual Studio 2017 or newer
44+
### On a Windows* System Using Visual Studio 2017 or newer
3945
* Open Visual Studio 2017
4046
* Select Menu "File > Open > Project/Solution", find "matrix_multiply" folder and select "matrix_multiply.sln"
4147
* Select Menu "Project > Build" to build the selected configuration
4248
* Select Menu "Debug > Start Without Debugging" to run the program
43-
49+
4450
### on Windows - command line - Build the program using MSBuild
4551
DPCPP Configurations:
4652
Release - MSBuild matrix_multiply.sln /t:Rebuild /p:Configuration="Release"
4753
Debug - MSBuild matrix_multiply.sln /t:Rebuild /p:Configuration="Debug"
4854

4955

56+
## Running the Sample
57+
58+
### Example of Output
59+
60+
./matrix.dpcpp
61+
Address of buf1 = 0x7f5e687eb010
62+
Offset of buf1 = 0x7f5e687eb180
63+
Address of buf2 = 0x7f5e67fea010
64+
Offset of buf2 = 0x7f5e67fea1c0
65+
Address of buf3 = 0x7f5e677e9010
66+
Offset of buf3 = 0x7f5e677e9100
67+
Address of buf4 = 0x7f5e66fe8010
68+
Offset of buf4 = 0x7f5e66fe8140
69+
Using multiply kernel: multiply1
70+
Running on Intel(R) Gen9
71+
Elapsed Time: 0.539631s
72+
5073
## Running an Intel VTune Profiler analysis
5174
------------------------------------------
5275

5376
vtune -collect gpu-hotspots -- ./matrix.dpcpp
54-
55-

0 commit comments

Comments
 (0)