More rewards on smaller plots

SeryogaLeshii · September 9, 2023, 5:16pm

I added randomisation of global_challenge in each round. Now it takes ~1.79 seconds to audit all 936 sectors. IOWAIT is quite high. The parallel audit still shows ~46ms.

MADV_RANDOM, as I remember, just prevents unneeded read-ahead. MADV_DONTNEED implies that the data is no longer needed and the cache is dropped if there is no memory pressure, but it didn’t help, as expected.

nazar-pc · September 9, 2023, 5:38pm

Okay, so we do need to parallelize it. That is helpful, I’ll see how to do it. I’d like the threads to still be named according to plots for easier debugging, but simply using rayon is easier of course.

nazar-pc · September 9, 2023, 6:14pm

Here is a snapshot build (still in progress) with parallel auditing as implemented in Parallelize audit across multiple cores by nazar-pc · Pull Request #1944 · subspace/subspace · GitHub : Snapshot build · subspace/subspace@81ed094 · GitHub

Once CI run finishes there will be both executables attached to the workflow artifacts and container images will also be published. Give it a try and let me know how it works.

Once the team reviews it and there is a positive feedback we’ll make another Gemini 3f release with this change included.

Thanks for testing and actionable feedback!

SeryogaLeshii · September 9, 2023, 7:10pm

I’ve started the farmer with these changes. So far I notice a slight decrease in the average IOWAIT, but it takes time to get results.

SeryogaLeshii · September 9, 2023, 7:14pm

One more thing, the farmer earned 2 rewards almost right away. That looks promising.

SeryogaLeshii · September 9, 2023, 7:25pm

+9 rewards. That’s an all-time record. The farmer used to earn 3-4 rewards per hour, but here it’s as many as 9 in about 15 minutes.

SeryogaLeshii · September 9, 2023, 7:39pm

Does it make sense and is it possible to implement parallelism for proving?

nazar-pc · September 9, 2023, 9:11pm

Proving is already highly parallel!

It was a major bottleneck initially and was very heavily optimized. In fact it sacrifices efficiency (by burning more CPU than would be necessary otherwise) and memory usage to achieve lower latency.

We started with something like 1.5 seconds proving on my machine (Core i9-13900K with 8C/16T performance cores at 5.5GHz and 16 efficiency cores at 4.3GHz), now the same proving takes 150ms (in memory) and we still know we can do better, just very hard to actually get there. Disk reads are also parallelized there.

SeryogaLeshii · September 9, 2023, 10:19pm

Yes, I noticed that initially, but decided to give it a try nonetheless.

I tried to foolishly parallelise it using Rayon and was able to achieve on my machine a 2x reduction in time spent proving benchmarks on 48 sectors.
11.7s/43.6s → 6.6s

Initially the proving benchmark time on 48 sectors was around 11.7 seconds, but now it is around 43.6 seconds. I can’t tell the exact reason at the moment.

nazar-pc · September 9, 2023, 10:53pm

You will unlikely have 48 sectors to prove on a real netwok with a lot of space, there is no point in optimizing throughput. As I mentioned above, it is designed to achieve lower latency instead for a single sector, that was the design goal.

SeryogaLeshii · September 9, 2023, 11:00pm

Got it, thank you for the clarification.

SeryogaLeshii · September 9, 2023, 11:08pm

Increasing the number of Rayon threads to values much higher than the number of threads in the system, (via the RAYON_NUM_THREADS environment variable) significantly reduces IOWAIT and thus increases CPU utilisation. In auditing, the most costly part is reading, and SSDs only benefit from parallel operations (if supported by the filesystem), so it helps. Helps a lot.

SeryogaLeshii · September 9, 2023, 11:17pm

By setting the value to 3 times the number of threads in the system I achieved an average iowait around 0.1% with occasional peaks of 0.3-0.4%.

nazar-pc · September 9, 2023, 11:29pm

The goal is not to remove IOWAIT to zero though. We’ll have to look into changing the way farmer works probably. Right now it is designed to use memory mapped I/O, but maybe that was not such a good decision after all.

SeryogaLeshii · September 10, 2023, 10:00pm

@nazar-pc Parallel auditing has almost completely removed the impact of plotting on farming. Plotting used to be able to reduce the amount of rewards a farmer brings by half (and sometimes even more), but now they are either the same or the reduction is minimal.

nazar-pc · September 10, 2023, 10:44pm

Great. I have also created Make plot reads, at least for auditing, async · Issue #1946 · subspace/subspace · GitHub, it’ll be a bit tricky and maybe awkward to do, but should remove the need to create many threads in the first place and will improve efficiency.

Thanks for testing and all the feedback!

nazar-pc · September 11, 2023, 2:04pm

Here is the official release: Release gemini-3f-2023-sep-11 · subspace/subspace · GitHub

Topic		Replies	Views
Rewards vs plotts-size bug Support gemini-3d	2	281	May 11, 2023
Plotting affects on reward amount very much Support	1	241	September 20, 2023
No more awards. Gemini-3e, subspace-cli 0.5.0-alpha Support	2	252	July 3, 2023
More miss reward with Mar 11 release in Windows Support	71	862	March 18, 2024
No Tssc Rewards on Gemini 3f Support	20	689	September 8, 2023

More rewards on smaller plots

Related topics