[molpro-user] Killing Molpro Jobs using SGE
Javier Díaz Montes
javier.diaz at uclm.es
Tue Dec 4 12:14:38 GMT 2007
Hi,
I have 2 processes tree in the MPICH master:
2679 ? S 4:45 /opt/gridengine/bin/lx26-amd64/sge_execd
27260 ? S 0:00 \_ sge_shepherd-230612 -bg
27315 ? Ss 0:00 \_ -sh
/opt/gridengine/default/spool/compute-0-19/job_scripts/230612
27423 ? S 0:00 \_ /bin/bash -xv .gw_mluisa_1/.wrapper
27442 ? S 0:00 \_ /bin/bash
/home/programs/molpro/bin/gwmolprop MgO3is2dbl1.com
27472 ? S 0:00 \_
/home/programs/molpro/molpro-Linux-x86_64-i8-2006.1/molpro -n6
--mpirun-machinefile /home/dartakan/.230612/machines
27473 ? S 0:00 \_ mpirun -np 6
-machinefile /home/dartakan/.230612/machines
/home/programs/molpro/molpro-Linux-x86_64-i8-2006.1/molprop_2006_1_i8_p4_mpi.exe
27480 ? Ss 0:00 orted --bootproxy 1 --name 0.0.6 --num_procs 7
--vpid_start 0 --nodename compute-0-19 --universe
dartakan at compute-0-19.local:default-universe --nsreplica
"0.0.0;tcp://192.168.1.235:33312" --gprreplica
"0.0.0;tcp://192.168.1.235:33312" --mpi-call-yield 0
27481 ? R 654:36 \_
/home/programs/molpro/molpro-Linux-x86_64-i8-2006.1/molprop_2006_1_i8_p4_mpi.exe
27483 ? S 0:04 \_
/home/programs/molpro/molpro-Linux-x86_64-i8-2006.1/molprop_2006_1_i8_p4_mpi.exe
And this other one on each MPICH slave:
4471 ? Ss 0:00 orted --bootproxy 1 --name 0.0.5 --num_procs 7
--vpid_start 0 --nodename compute-0-5 --universe dar
4472 ? R 624:03 \_
/home/programs/molpro/molpro-Linux-x86_64-i8-2006.1/molprop_2006_1_i8_p4_mpi.exe
4474 ? S 0:00 \_
/home/programs/molpro/molpro-Linux-x86_64-i8-2006.1/molprop_2006_1_i8_p4_mpi.exe
Then, if I kill a job from SGE, The second molprop_2006_1_i8_p4_mpi.exe
process will start to run
Regards,
Javi
En Tue, 04 Dec 2007 12:31:22 +0100, Reuti <reuti at staff.uni-marburg.de>
escribió:
> Hi,
>
>> I have a problem when I run Molpro in several nodes of a cluster. The
>> problem is when I try to kill a job.
>> If I kill the job, using the qdel command of sge, some processes of
>> Molpro remain running. I have seen that molpro create 2 processes, in
>> each node, of molprop_2006_1_i4_p4_mpi.exe, one running and another one
>> sleep. Then, when I kill the job, the processes running are killed and
>> the process which was sleeping starts to run.
>
> this is unusual. For MPICH it's normal to have two processes for each
> started task.
>
> What does the process tree look like on one of these nodes with a
> running job:
>
> ps -e f -o pid,ppid,pgrp,command
>
> (f w/o -) -- Reuti
>
>> Molpro create 2 processes of molprop_2006_1_i4_p4_mpi.exe in each node,
>> is this normal?
>> How could I kill a Molpro job?, because Now I have to kill these
>> processes using pkill command in each node.
>>
>> Regards,
>> Javi
>>
>>
>> --+---------------------------------------------------------------+
>> Javier Diaz Montes
>> PhD Candidate
>> Grupo de Quimica Computacional y Computacion de Alto Rendimiento.
>> Departamento de Tecnologias y Sistemas de Informacion.
>> Escuela Superior de Informatica.
>> Universidad de Castilla-La Mancha.
>> Paseo de la Universidad, 4; 13071 Ciudad Real; SPAIN
>> Tel.: 34-926295300; Ext: 3724
>> e-mail: javier.diaz at uclm.es
>> +---------------------------------------------------------------+
>
>
>
More information about the Molpro-user
mailing list