【发布时间】:2011-09-25 14:37:38
【问题描述】:
我刚刚试验了 MPI,复制并运行了这段代码,取自 [LLNL MPI 教程][1] 的第二个代码示例。
#include <mpi.h>
#include <stdlib.h>
#include <stdio.h>
int main(int argc, char ** argv) {
int num_tasks, rank, next, prev, buf[2], tag1 = 1, tag2 = 2;
MPI_Request reqs[4];
MPI_Status status[2];
MPI_Init(&argc, &argv);
MPI_Comm_size(MPI_COMM_WORLD, &num_tasks);
MPI_Comm_rank(MPI_COMM_WORLD, &rank);
prev = rank - 1;
next = rank + 1;
if (rank == 0) prev = num_tasks - 1;
if (rank == (num_tasks - 1)) next = 0;
MPI_Irecv(&buf[0], 1, MPI_INT, prev, tag1, MPI_COMM_WORLD,
&reqs[0]);
MPI_Irecv(&buf[1], 1, MPI_INT, next, tag2, MPI_COMM_WORLD,
&reqs[1]);
MPI_Isend(&rank, 1, MPI_INT, prev, tag2, MPI_COMM_WORLD, &reqs[2]);
MPI_Isend(&rank, 1, MPI_INT, next, tag1, MPI_COMM_WORLD, &reqs[3]);
MPI_Waitall(4, reqs, status);
printf("Task %d received %d from %d and %d from %d\n",
rank, buf[0], prev, buf[1], next);
MPI_Finalize();
return EXIT_SUCCESS;
}
我本来期望这样的输出(例如,4 个任务):
$ mpiexec -n 4 ./m3
Task 0 received 3 from 3 and 1 from 1
Task 1 received 0 from 0 and 2 from 2
Task 2 received 1 from 1 and 3 from 3
Task 3 received 2 from 2 and 0 from 0
但是,相反,我得到了这个:
$ mpiexec -n 4 ./m3
Task 0 received 0 from 3 and 1 from 1
Task 1 received 0 from 0 and 2 from 2
Task 3 received 0 from 2 and 0 from 0
Task 2 received 0 from 1 and 3 from 3
也就是说,进入缓冲区 buf[0] 的消息(带有标记 == 1)总是得到值 0。此外,如果我更改代码以便将缓冲区声明为 buf[3] 而不是 buf[2 ],并用 buf[2] 替换 buf[0] 的每个实例,然后我就得到了我所期望的输出(即上面给出的第一个输出集)。出于某种原因,这看起来好像有些东西正在用 0 覆盖 buf[0] 中的值。但我看不出那可能是什么。顺便说一句,据我所知,我的代码(没有修改)与教程中的代码完全匹配,除了我的 printf。
谢谢!
【问题讨论】:
标签: c buffer mpi parallel-processing