GPU data management using OpenACC as well as OpenMP API #704

pramodk · 2021-12-06T21:00:07Z

Description

Wrapping data transfer API routines for OpenACC as well as OpenMP APIs.
Cleanup
Replace other OpenACC APIs used
Rewrite all acc_update_device by pragmas
Add omp update device for each update device
Rewrite all acc_update_self by pragmas
Add omp update host for each update self
Fix last acc_update_self and acc_update_device

** Use certain branches for the GitLab/SimulationStack CI**

CI_BRANCHES:NMODL_BRANCH=hackathon_main,NEURON_BRANCH=master,

bbpbuildbot · 2021-12-06T21:46:07Z

Logfiles from GitLab pipeline #28284 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-07T11:30:18Z

Logfiles from GitLab pipeline #28369 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-07T14:50:16Z

Logfiles from GitLab pipeline #28405 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-07T15:16:06Z

Logfiles from GitLab pipeline #28422 (:no_entry:) have been uploaded here!

Status and direct links:

…ORENEURON_PREFER_OPENMP_OFFLOAD

bbpbuildbot · 2021-12-08T10:13:12Z

Logfiles from GitLab pipeline #28474 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-08T13:33:19Z

Logfiles from GitLab pipeline #28511 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-08T14:58:17Z

Logfiles from GitLab pipeline #28535 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-08T17:34:55Z

Logfiles from GitLab pipeline #28569 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-12-08T22:45:47Z

Logfiles from GitLab pipeline #28578 (:no_entry:) have been uploaded here!

Status and direct links:

…s with OpenMP

bbpbuildbot · 2021-12-09T09:34:40Z

Logfiles from GitLab pipeline #28601 (:white_check_mark:) have been uploaded here!

Status and direct links:

* IvocVect members t_ and y_ were copied twice * only discon_indices_ is pointer and hence that needs to be copied

bbpbuildbot · 2021-12-09T12:28:35Z

Logfiles from GitLab pipeline #28639 (:white_check_mark:) have been uploaded here!

Status and direct links:

Summary of changes: - Support OpenMP target offload when NMODL and GPU support are enabled. (#693, #704, #705, #707, #708, #716, #719) - Use sensible defaults for the --nwarp parameter, improving the performance of the Hines solver with --cell-permute=2 on GPU. (#700, #710, #718) - Use a Boost memory pool, if Boost is available, to reduce the number of independent CUDA unified memory allocations used for Random123 stream objects. This speeds up initialisation of models using Random123, and also makes it feasible to use NSight Compute on models using Random123 and for NSight Systems to profile initialisation. (#702, #703) - Use -cuda when compiling with NVHPC and OpenACC or OpenMP, as recommended on the NVIDIA forums. (#721) - Do not compile for compute capability 6.0 by default, as this is not supported by NVHPC with OpenMP target offload. - Add new GitLab CI tests so we test CoreNEURON + NMODL with both OpenACC and OpenMP. (#698, #717) - Add CUDA runtime header search path explicitly, so we don't rely on it being implicit in our NVHPC localrc. - Cleanup unused code. (#711) Co-authored-by: Pramod Kumbhar <[email protected]> Co-authored-by: Ioannis Magkanaris <[email protected]> Co-authored-by: Christos Kotsalos <[email protected]> Co-authored-by: Nicolas Cornu <[email protected]>

Summary of changes: - Support OpenMP target offload when NMODL and GPU support are enabled. (BlueBrain/CoreNeuron#693, BlueBrain/CoreNeuron#704, BlueBrain/CoreNeuron#705, BlueBrain/CoreNeuron#707, BlueBrain/CoreNeuron#708, BlueBrain/CoreNeuron#716, BlueBrain/CoreNeuron#719) - Use sensible defaults for the --nwarp parameter, improving the performance of the Hines solver with --cell-permute=2 on GPU. (BlueBrain/CoreNeuron#700, BlueBrain/CoreNeuron#710, BlueBrain/CoreNeuron#718) - Use a Boost memory pool, if Boost is available, to reduce the number of independent CUDA unified memory allocations used for Random123 stream objects. This speeds up initialisation of models using Random123, and also makes it feasible to use NSight Compute on models using Random123 and for NSight Systems to profile initialisation. (BlueBrain/CoreNeuron#702, BlueBrain/CoreNeuron#703) - Use -cuda when compiling with NVHPC and OpenACC or OpenMP, as recommended on the NVIDIA forums. (BlueBrain/CoreNeuron#721) - Do not compile for compute capability 6.0 by default, as this is not supported by NVHPC with OpenMP target offload. - Add new GitLab CI tests so we test CoreNEURON + NMODL with both OpenACC and OpenMP. (BlueBrain/CoreNeuron#698, BlueBrain/CoreNeuron#717) - Add CUDA runtime header search path explicitly, so we don't rely on it being implicit in our NVHPC localrc. - Cleanup unused code. (BlueBrain/CoreNeuron#711) Co-authored-by: Pramod Kumbhar <[email protected]> Co-authored-by: Ioannis Magkanaris <[email protected]> Co-authored-by: Christos Kotsalos <[email protected]> Co-authored-by: Nicolas Cornu <[email protected]> CoreNEURON Repo SHA: BlueBrain/CoreNeuron@423ae6c

pramodk marked this pull request as draft December 6, 2021 21:00

Base automatically changed from olupton/basic-openmp to hackathon_main December 7, 2021 12:13

Add functions for using openmp or oacc

97466ea

alkino force-pushed the pramodk/basic-openmp branch from a6dcbf3 to 97466ea Compare December 7, 2021 15:06

fix typo and merge: p -> h_ptr and CORENRN_PREFER_OPENMP_OFFLOAD -> C…

2a538ee

…ORENEURON_PREFER_OPENMP_OFFLOAD

Fix update to => update from

da86888

More acc_* deletion

6147642

Fix missing pragmas

b5665fa

Add -mp=gpu in order to link gpu runtime with tests as well

f4d3be5

pramodk added 2 commits December 9, 2021 09:25

temporary workaround for testing: IvocVect copying & association fail…

350fa55

…s with OpenMP

Add nrn_assert for OpenMP target API checks

35d4cdf

Fix the issue with VecPlayContinious copy to gpu

29b358a

* IvocVect members t_ and y_ were copied twice * only discon_indices_ is pointer and hence that needs to be copied

pramodk marked this pull request as ready for review December 9, 2021 12:31

pramodk merged commit 02abf78 into hackathon_main Dec 9, 2021

pramodk deleted the pramodk/basic-openmp branch December 9, 2021 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU data management using OpenACC as well as OpenMP API #704

GPU data management using OpenACC as well as OpenMP API #704

Uh oh!

pramodk commented Dec 6, 2021 •

edited

Loading

Uh oh!

bbpbuildbot commented Dec 6, 2021

Uh oh!

bbpbuildbot commented Dec 7, 2021

Uh oh!

bbpbuildbot commented Dec 7, 2021

Uh oh!

bbpbuildbot commented Dec 7, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 9, 2021

Uh oh!

bbpbuildbot commented Dec 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GPU data management using OpenACC as well as OpenMP API #704

GPU data management using OpenACC as well as OpenMP API #704

Uh oh!

Conversation

pramodk commented Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbpbuildbot commented Dec 6, 2021

Uh oh!

bbpbuildbot commented Dec 7, 2021

Uh oh!

bbpbuildbot commented Dec 7, 2021

Uh oh!

bbpbuildbot commented Dec 7, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 8, 2021

Uh oh!

bbpbuildbot commented Dec 9, 2021

Uh oh!

bbpbuildbot commented Dec 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pramodk commented Dec 6, 2021 •

edited

Loading