High RAM usage for May 15th release

nazar-pc · May 29, 2024, 1:47pm

I’ve exhausted options to try remotely, trying to reproduce in VM now, hopefully I succeed, then I should be able to create a fix for this.

nazar-pc · May 30, 2024, 8:19am

Is it reproducible with a single farm and large piece cache so all pieces are stored locally? I used 1.8T farm with 5% piece cache for testing and can’t reproduce it so far on Windows 11.

qbit · May 30, 2024, 10:52am

Just confirmed that May 6 release also has memory issue, didn’t find this out before because I only used this version for a very short time. 96GB memory ran out after two days, node exited after failing to allocate memory. I have other applications running on this machine, but they are not so memory consuming and they were running with old version nodes and farmers, this “memory allocating failed” error never happened before even months ago when farmer used to allocation lots of memory, at that time, it just eat all memory gradually but won’t crash.

wudung · May 30, 2024, 11:19am

You probably need at least four farms. I will try to reproduce it with four farms, later today . In the short test I ran, I didn’t see May 6th having the memory issue.

nazar-pc · May 30, 2024, 4:18pm

Well, if you can try to reduce that to one and confirm or deny that would help. I ideally need the smallest reproduction. I tried plotting with one and two farms for hours (not days though) and didn’t notice anything special.

I’m looking for patterns, if we can find it then I can modify software to accelerate it (for example I tried doing “plotting” without sector encoding) so we can see the behavior quickly and verify the fix.

nazar-pc · May 30, 2024, 4:20pm

That is odd and likely a different issue. What was the last version that didn’t have this issue? It is really important to be sure that the issue is there or not there or else we’ll spend a lot of time checking wrong stuff.

wudung · May 30, 2024, 11:43pm

With May 15th I was able to recreate the issue with 6 plots. I will run the May 6th as well. I will check the May 6th next.

wudung · May 31, 2024, 2:26pm

I do not see any memory leak with the May 6th release and 6 concurrent plots. I have not tried it with plots# > 6.

nazar-pc · May 31, 2024, 2:42pm

Can you try fewer maybe? Also how much time do you typically run it for before you see an issue and how severe it is by then?

wudung · May 31, 2024, 2:45pm

I tested the May 15th release with both four and six plots. The four-plot configuration didn’t encounter any issues. Based on my experience, it took approximately three hours to conclude whether there were any issues. In my setup farmer normally uses around 40GB of RAM. In cases where there were issues I saw 60-70GB of RAM usage after three hours.

nazar-pc · May 31, 2024, 2:51pm

Thanks, very helpful. All of the farms are still plotting, right?

wudung · May 31, 2024, 3:01pm

Yes, they are still plotting using May 6th release.

nazar-pc · June 1, 2024, 9:17pm

I think I managed to reproduce something like this, will try to narrow it down and hopefully fix next

nazar-pc · June 2, 2024, 4:36pm

I have theories of what the root cause might be, but it is a pain to deal with Windows, especially trying to debug something. As such I decided to simply constrain piece cache reads to a single read at a time. It was almost like this before, except it was also blocking executor.

Let me know if it helps with the issue at all and if it does I’m fine keeping it as is on Windows for now and let other platforms benefit from higher concurrency in the meantime: Snapshot build · subspace/subspace@27b6eec · GitHub

wudung · June 2, 2024, 4:51pm

Thanks. I will try this out.

nazar-pc · June 2, 2024, 5:05pm

And assuming it works, try Snapshot build · subspace/subspace@e88371b · GitHub (it doesn’t then don’t bother). It does the same thing, but restricts concurrency of piece reading to 32 instead of 1.

UPD: 1 seems to work from my testing and 32 seems to not work, at least not work particularly well memory usage-wise.

wudung · June 3, 2024, 12:30pm

27b6eec release you shared didn’t work. I was able to reproduce the issue with 5 farms after 12 hours of plotting. The RAM usage after 12 hours was 80GB.

nazar-pc · June 3, 2024, 1:10pm

Thanks, I have no other ideas than bringing more or less the old code back. Here is a build with that change in progress: Snapshot build · subspace/subspace@9c32cd4 · GitHub

Hopefully that works finally.

wudung · June 3, 2024, 2:02pm

Thank you. I’ll give this a try and provide an update. Apologies, Windows has been quite frustrating.

nazar-pc · June 3, 2024, 2:46pm

Yeah, I wish it was the first time, but it is not

Topic		Replies	Views
Fake display of high RAM usage or RAM leak on Windows by Subspace farmer Support	132	1454	March 8, 2024
High memory usage for June 11th release Support	26	311	July 3, 2024
Memory doesn't release after reploting completed (Advanced CLI May-24) Support	2	63	June 9, 2024
Slowed plotting speed for mining due to an excessive number of physical SSDs Support	10	425	November 2, 2023
Windows version - Subspace farmer: how does it use mapped file in RAM? Support	2	240	October 14, 2023

High RAM usage for May 15th release

Related topics