Skip to content

Feature update for intrin_dot_sample.cpp #63

@zjin-lcf

Description

@zjin-lcf

Summary

Include a short summary of the request. Sections below provide guidance on
what factors are considered important for a feature request.

README says that "They provide access to instructions that cannot be generated using the standard constructs of the C and C++ languages, and allow code to leverage performance enhancing features unique to specific processors. "
May we use some SIMD compiler options to achieve vectorization ?

Problem statement

A comparison between the performance of using intrinsics and the performance of using SIMD compiler options

Preferred solution

Add a solution to the problem where the size is not a multiple of 8. Currently, it is 24.
Add a solution, if possible, to adding some SIMD compiler options to achieve vectorization

Thanks

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions