OMP: use pragmas instead of c-api to allocate and delete target data #708

alkino · 2021-12-10T14:57:16Z

This way is more convenient for us.

CI_BRANCHES:NMODL_BRANCH=hackathon_main,NEURON_BRANCH=master,

bbpbuildbot · 2021-12-10T15:46:03Z

Logfiles from GitLab pipeline #28830 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-10T17:43:48Z

Logfiles from GitLab pipeline #28849 (:white_check_mark:) have been uploaded here!

Status and direct links:

olupton

Looks nice! I think we should do a little renaming to make things more consistent, but I think it should all be easy search/replace stuff once we decide what to do.

coreneuron/gpu/nrn_acc_manager.cpp

bbpbuildbot · 2021-12-13T13:08:49Z

Logfiles from GitLab pipeline #28955 (:white_check_mark:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-13T15:05:08Z

Logfiles from GitLab pipeline #29026 (:white_check_mark:) have been uploaded here!

Status and direct links:

olupton

🚀

Summary of changes: - Support OpenMP target offload when NMODL and GPU support are enabled. (#693, #704, #705, #707, #708, #716, #719) - Use sensible defaults for the --nwarp parameter, improving the performance of the Hines solver with --cell-permute=2 on GPU. (#700, #710, #718) - Use a Boost memory pool, if Boost is available, to reduce the number of independent CUDA unified memory allocations used for Random123 stream objects. This speeds up initialisation of models using Random123, and also makes it feasible to use NSight Compute on models using Random123 and for NSight Systems to profile initialisation. (#702, #703) - Use -cuda when compiling with NVHPC and OpenACC or OpenMP, as recommended on the NVIDIA forums. (#721) - Do not compile for compute capability 6.0 by default, as this is not supported by NVHPC with OpenMP target offload. - Add new GitLab CI tests so we test CoreNEURON + NMODL with both OpenACC and OpenMP. (#698, #717) - Add CUDA runtime header search path explicitly, so we don't rely on it being implicit in our NVHPC localrc. - Cleanup unused code. (#711) Co-authored-by: Pramod Kumbhar <[email protected]> Co-authored-by: Ioannis Magkanaris <[email protected]> Co-authored-by: Christos Kotsalos <[email protected]> Co-authored-by: Nicolas Cornu <[email protected]>

Summary of changes: - Support OpenMP target offload when NMODL and GPU support are enabled. (BlueBrain/CoreNeuron#693, BlueBrain/CoreNeuron#704, BlueBrain/CoreNeuron#705, BlueBrain/CoreNeuron#707, BlueBrain/CoreNeuron#708, BlueBrain/CoreNeuron#716, BlueBrain/CoreNeuron#719) - Use sensible defaults for the --nwarp parameter, improving the performance of the Hines solver with --cell-permute=2 on GPU. (BlueBrain/CoreNeuron#700, BlueBrain/CoreNeuron#710, BlueBrain/CoreNeuron#718) - Use a Boost memory pool, if Boost is available, to reduce the number of independent CUDA unified memory allocations used for Random123 stream objects. This speeds up initialisation of models using Random123, and also makes it feasible to use NSight Compute on models using Random123 and for NSight Systems to profile initialisation. (BlueBrain/CoreNeuron#702, BlueBrain/CoreNeuron#703) - Use -cuda when compiling with NVHPC and OpenACC or OpenMP, as recommended on the NVIDIA forums. (BlueBrain/CoreNeuron#721) - Do not compile for compute capability 6.0 by default, as this is not supported by NVHPC with OpenMP target offload. - Add new GitLab CI tests so we test CoreNEURON + NMODL with both OpenACC and OpenMP. (BlueBrain/CoreNeuron#698, BlueBrain/CoreNeuron#717) - Add CUDA runtime header search path explicitly, so we don't rely on it being implicit in our NVHPC localrc. - Cleanup unused code. (BlueBrain/CoreNeuron#711) Co-authored-by: Pramod Kumbhar <[email protected]> Co-authored-by: Ioannis Magkanaris <[email protected]> Co-authored-by: Christos Kotsalos <[email protected]> Co-authored-by: Nicolas Cornu <[email protected]> CoreNEURON Repo SHA: BlueBrain/CoreNeuron@423ae6c

alkino marked this pull request as draft December 10, 2021 14:57

alkino marked this pull request as ready for review December 10, 2021 17:46

alkino requested review from olupton and pramodk December 10, 2021 23:15

olupton reviewed Dec 13, 2021

View reviewed changes

olupton mentioned this pull request Dec 13, 2021

Implement omp_get_mapped_ptr until it is available #705

Merged

iomaganaris reviewed Dec 13, 2021

View reviewed changes

coreneuron/gpu/nrn_acc_manager.cpp Outdated Show resolved Hide resolved

coreneuron/gpu/nrn_acc_manager.cpp Outdated Show resolved Hide resolved

alkino force-pushed the no_more_c_api branch from 94c5e57 to d16ee73 Compare December 13, 2021 11:18

Nicolas Cornu added 3 commits December 13, 2021 12:24

Make a try with pragmas

0c42f40

Simplifying pointers

84f0322

More template, more const

f34aabf

alkino force-pushed the no_more_c_api branch from d16ee73 to f34aabf Compare December 13, 2021 11:25

Fix static_cast to const_cast

4b9af99

olupton reviewed Dec 13, 2021

View reviewed changes

coreneuron/gpu/nrn_acc_manager.cpp Outdated Show resolved Hide resolved

Fix VecPlayContinuous::discon_indices_ and OpenACC compilation.

b1ac853

olupton approved these changes Dec 13, 2021

View reviewed changes

olupton merged commit 0fe815e into hackathon_main Dec 13, 2021

olupton deleted the no_more_c_api branch December 13, 2021 15:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OMP: use pragmas instead of c-api to allocate and delete target data #708

OMP: use pragmas instead of c-api to allocate and delete target data #708

Uh oh!

alkino commented Dec 10, 2021

Uh oh!

bbpbuildbot commented Dec 10, 2021

Uh oh!

bbpbuildbot commented Dec 10, 2021

Uh oh!

olupton left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bbpbuildbot commented Dec 13, 2021

Uh oh!

bbpbuildbot commented Dec 13, 2021

Uh oh!

olupton left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

OMP: use pragmas instead of c-api to allocate and delete target data #708

OMP: use pragmas instead of c-api to allocate and delete target data #708

Uh oh!

Conversation

alkino commented Dec 10, 2021

Uh oh!

bbpbuildbot commented Dec 10, 2021

Uh oh!

bbpbuildbot commented Dec 10, 2021

Uh oh!

olupton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bbpbuildbot commented Dec 13, 2021

Uh oh!

bbpbuildbot commented Dec 13, 2021

Uh oh!

olupton left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants