You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Mar 20, 2023. It is now read-only.
Integrate changes from NERSC GPU hackathon. (#713)
Summary of changes:
- Support OpenMP target offload when NMODL and GPU support are enabled.
(#693, #704, #705, #707, #708, #716, #719)
- Use sensible defaults for the --nwarp parameter, improving the performance
of the Hines solver with --cell-permute=2 on GPU. (#700, #710, #718)
- Use a Boost memory pool, if Boost is available, to reduce the number of
independent CUDA unified memory allocations used for Random123 stream
objects. This speeds up initialisation of models using Random123, and also
makes it feasible to use NSight Compute on models using Random123 and for
NSight Systems to profile initialisation. (#702, #703)
- Use -cuda when compiling with NVHPC and OpenACC or OpenMP, as recommended
on the NVIDIA forums. (#721)
- Do not compile for compute capability 6.0 by default, as this is not
supported by NVHPC with OpenMP target offload.
- Add new GitLab CI tests so we test CoreNEURON + NMODL with both OpenACC and
OpenMP. (#698, #717)
- Add CUDA runtime header search path explicitly, so we don't rely on it being
implicit in our NVHPC localrc.
- Cleanup unused code. (#711)
Co-authored-by: Pramod Kumbhar <pramod.kumbhar@epfl.ch>
Co-authored-by: Ioannis Magkanaris <iomagkanaris@gmail.com>
Co-authored-by: Christos Kotsalos <christos.kotsalos@epfl.ch>
Co-authored-by: Nicolas Cornu <nicolas.cornu@epfl.ch>
0 commit comments