[Carpet] memory leak while running with carpet

Yosef Zlochower yosef at astro.rit.edu
Tue Jul 10 13:06:20 CEST 2007


Thanks Erik,

  I'll test the code by removing those thorns and checking the memory load.


Yosef

Erik Schnetter wrote:
> Hi Yosef,
>
> we found that the HDF5 library allocates some memory for internal 
> purposes, and continues to hold on to it, probably buffering 
> something.  This is much worse for 1.6.x than for 1.8.x.  You could 
> give 1.8.x a try.
>
> Christian Ott found a misbehaviour of the GNU libc which is triggered 
> by CarpetIOASCII (and probably also CarpetIOScalar and 
> CarpetIOBasic.)  Can you try running without them (or without 
> outputting anything with them) to see whether this helps?
>
> Apart from this a memory leak is difficult to track down.  It could be 
> anywhere -- Cactus, Carpet, your code, or it could be some interaction 
> between them.  There are no "relevant sections" of a parameter file; 
> there can only be such section if you know already where the memory 
> leak occurs.
>
> Can you try to remove thorns from your thorn list, trying to make the 
> memory leak go away?  After that it is necessary to watch the 
> simulation, probably in a debugger, and catch it increasing its memory 
> usage.  If you can point me to a routine we can go on from there.
>
> I'm afraid that your section of a parameter file doesn't ring any 
> bell.  Your parameter settings look pretty normal, and I don't see 
> such a drastic problem.
>
> -erik
>
> On Jul 10, 2007, at 03:54:45, Yosef Zlochower wrote:
>
>> Hi,
>>
>>   I forgot to mention that the memory leak is quite significant. The
>> memory load tends
>>  to double after a few hours of runtime.
>>
>> Yosef
>>
>> Yosef Zlochower wrote:
>>> Hi
>>>
>>>  I am trying to locate the source of a memory leak in my code. I am 
>>> using
>>> stable (version 3) version of carpet and I noticed that the memory 
>>> usage
>>> continuously increases with time. If  I  checkpoint and restart, then
>>> the memory usage goes back to its original size, but then increases
>>> with time.
>>>
>>> I attached the relevant sections of the par file. Any suggestion on
>>> how I can
>>> track down the source of this leak. I am running the code on a cluster
>>> consisting
>>> of dual core AMD X86_64 procs with HTX Infiniband interconnect and the
>>> pathscale compiler.
>>>
>>> Thanks
>>>
>>> Yosef
>>> ------------------------------------------------------------------------ 
>>>
>>>



More information about the developers mailing list