Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Is there a typical parallelism scenario for hybrid MPI / OpenMP codes?

April 26, 2017hybrid MPI OpenMP parallelism scenario

0

Posted

Is there a typical parallelism scenario for hybrid MPI / OpenMP codes?

1 Answer

0

Posted

A typical scenario is to use MPI for domain decomposition with four or eight MPI processes per node and then use the remaining cores for OpenMP threads of parallelism within each domain. Frequently, this additional parallelism is at the loop level but the more computation per thread, the better. The threads belonging to each MPI process carry out their computation until some synchronization point or until they’ve completed. It is important to remember not to use more than six OpenMP threads per NUMA node (24 per node).