Problem with more then 1 node.

Solntsev Pasha


I am using IntelMPI version (G). I wrote the script
to start Firefly via PBS.I setup appropriate variables and started.
And Firefly works fine on one node, but with 8 cpu's. Then i decided to
run it on 2 nodes(small test for 2 min's). But unfortunately i couldn't
start. In file from "#PBS -o file"  i found this

mpdboot_cl1n110 (handle_mpd_output 850): from mpd on cl1n124, invalid
port info:
cl1n124: Connection refused

I started Firefly via mpirun:

mpirun -np 8 firefly -r -f -p -stdext -ex /home/xe2/solntsev/bin \
-i /home/xe2/solntsev/work/test/ester_c2_mp2.inp \
-o /home/xe2/solntsev/work/test/ester_c2_mp2.out \
-t /scratch1/solntsev/687135.h2moabtorque

I also tried another one cluster.
Same problem. I can run Firefly on 8 cpu's, but only on one node.
same file (#PBS -o file) is empty, but (#PBS -e file) contained error

+ mpirun -np 8 /home/it2/solntsev/bin/firefly -r -f -p -stdext \
-ex /home/it2/solntsev/bin -i /home/it2/solntsev/work/job47.inp \
-o /home/it2/solntsev/work/job47.out \
-t /scratch1/solntsev/34181.node1081.localdomain
Traceback (most recent call last):
 File "", line 918, in
 File "", line 669, in mpdboot
 File "", line 758, in launch_one_mpd
 File "/usr/lib64/python2.6/", line 595, in __init__
   errread, errwrite)
 File "/usr/lib64/python2.6/", line 1106, in
   raise child_exception
OSError: [Errno 2] No such file or directory

Can you provide me any advice to solve my problem? If you need any extra information, just let me know.

Many thanks, Pavel.

