Wrapper of rocPRIM or CUB for GPU parallel primitives
https://github.com/ROCm/rocm-libraries/tree/develop/projects/hipcub