Access pointer to MultiFAB data for ALL boxes/fabs at once #3828
Unanswered
tony3601000
asked this question in
Q&A
Replies: 2 comments 4 replies
-
Maybe you can use the batched version of cublasDgemv. Or maybe it's not too bad to copy everything to a contiguous chunk of memory and perform cublas there. Or you can allocate the memory and give MultiFab the pointers. In that case, MultiFab just holds non-owning pointers provided by you. So you need to mange the memory allocation and deallocation. Or we maybe could add an option to allocate everything in a contiguous chunk of memory. It's not hard to do. |
Beta Was this translation helpful? Give feedback.
3 replies
-
@tony3601000 Could you give this a try? #3857 |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi there,
I need to perform operations for all cells in parallel using cuBLAS. Since the cells are grouped into boxes/fabs, I currently have the following code (simplified):
Ideally, I'd like to perform something like this instead:
I'm aware of the following strategies:
But these do not seem to be stored in a contiguous format. Otherwise, what is the best approach to implementing parallel GPU computing for a cluster of small fabs, each with a few cells?
Thank you for the help!
Best,
Anthony
Beta Was this translation helpful? Give feedback.
All reactions