[molpro-user] possible bug: moldpro not releasing deleted scratch files
Andy May
MayAJ1 at cardiff.ac.uk
Thu Feb 21 10:26:24 GMT 2013
Jörg,
When Molpro opens a scratch file it immediately calls unlink() for that
file. This means the file is scheduled for deletion, but postponed until
either all processes close the file (i.e. when Molpro's finished with
them), or when the processes linked to them terminate. So the reason
these files are not actually deleted until you kill the process is
likely because they are still being used. My guess is that your
calculation really does need this amount of disk space.
The reason we use unlink() instead if putting in a call to delete the
file is to cover the case of the program crashing, or the user killing
the job. In both of these cases without the call to unlink() the files
would remain as permanent files on the disk since the file delete code
would have never been reached. Of course that's no problem on clusters
when the batch scheduler has been set up to clean scratch, but in most
other cases the scratch files would otherwise have to be removed manually.
Best wishes,
Andy
On 20/02/13 22:18, Jörg Saßmannshausen wrote:
> Dear all,
>
> I have come across something highly anoying in Molpro: When I am doing a
> rather large LCCSD(T)/cc-pVTZ calculation, the program does not release the
> deleted files and thus is filling up the scratch space:
>
> df /scr/
> Dateisystem Size Used Avail Use% Eingehängt auf
> /dev/md0 932G 932G 88K 100% /scr
>
> but:
> du -sh /scr/molpro/
> 35G /scr/molpro/
>
> The program crashes as the scratch is full but one process is still running.
> When I am then doing a lsof (output attached), I see a number of these files:
>
> molpro.ex 2452 sassy 19u REG 9,0 73 134219798
> /scr/molpro/init_job5000002452 (deleted)
>
> Once I kill the program in the queue, it is releasing the deleted files and I
> get my space back.
>
> $ df /scr/
> Dateisystem Size Used Avail Use% Eingehängt auf
> /dev/md0 932G 39G 893G 5% /scr
>
> This is not what I expect. I would have thought once the file is deleted it is
> released to the file system can remove it. As that is not happening, the
> scratch of 1 TB is filling up and my calculation crashes :-(
>
> I am using that version of Molpro.
> SHA1 : 2c68d29c09da70e1723824271fadde4bcd5f07a0
> ARCHNAME : Linux/x86_64
> FC : /opt/intel/compilerpro-12.0.2.137/bin/intel64/ifort
> FCVERSION : 12.0.2
>
> Is that a bug or is there something I am doing wrong here. I am using xfs for
> scratch.
>
> All the best from London!
>
> Jörg
>
>
>
> _______________________________________________
> Molpro-user mailing list
> Molpro-user at molpro.net
> http://www.molpro.net/mailman/listinfo/molpro-user
>
More information about the Molpro-user
mailing list