Scheduling Communications
Is it better to use MPI_Waitall or to schedule/order the requests?
- Does the implementation complete a Waitall in any order or does it prefer requests as ordered in the array of requests?
In principle, it should always be best to let MPI schedule the operations. In practice, it may be better to order either the short or long messages first, depending on how data is transferred.