Network Latency & Timeouts after Starting Node & Farmer

nazar-pc · February 4, 2024, 8:23pm

Assuming the issue is number of threads, you can shut down all Subspace software and try iperf3 with -P NUM_STREAMS option. I believe by default it is 1, but you can try setting it to 10, 100, see if there it goes offline after some reasonable number.

For your reference, each Subspace instance (both node and farmer) are configured to up to 100 outgoing streams by default right now, but I don’t think it actually reaches that often.

Just did a test, with max (128) streams I can push almost 1Gbps, which is the limit of what that server can do.

repost · February 5, 2024, 2:39pm

Current state is that I have Space Acres + 1 Node/Farmer, and the network is still doing well. The person from the ISP said they had done something so who knows. Today I will slowly turn on more farmers. I switched my setup up a bit so I have 1 Node and I am connecting each Farmer to the single node.

Space Acres + Node/Farmer

Connecting to host 65.109.39.229, port 5201
Reverse mode, remote host 65.109.39.229 is sending
[  5] local 192.168.10.101 port 45996 connected to 65.109.39.229 port 5201
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-1.00   sec  1.04 MBytes  8.70 Mbits/sec                  
[  5]   1.00-2.00   sec  14.5 MBytes   121 Mbits/sec                  
[  5]   2.00-3.00   sec  17.9 MBytes   150 Mbits/sec                  
[  5]   3.00-4.00   sec  17.9 MBytes   150 Mbits/sec                  
[  5]   4.00-5.00   sec  17.9 MBytes   150 Mbits/sec                  
[  5]   5.00-6.00   sec  17.3 MBytes   145 Mbits/sec                  
[  5]   6.00-7.00   sec  17.8 MBytes   150 Mbits/sec                  
[  5]   7.00-8.00   sec  17.9 MBytes   150 Mbits/sec                  
[  5]   8.00-9.00   sec  18.0 MBytes   151 Mbits/sec                  
[  5]   9.00-10.00  sec  17.9 MBytes   150 Mbits/sec                  
- - - - - - - - - - - - - - - - - - - - - - - - -
[ ID] Interval           Transfer     Bitrate         Retr
[  5]   0.00-10.17  sec   170 MBytes   140 Mbits/sec    0             sender
[  5]   0.00-10.00  sec   158 MBytes   133 Mbits/sec                  receiver

iperf Done.

with -P 128

[SUM]   0.00-10.00  sec   917 MBytes   769 Mbits/sec  1689             sender
[SUM]   0.00-10.17  sec   804 MBytes   664 Mbits/sec                  receiver

with -P 128 -R The result is

[SUM]   0.00-10.17  sec   568 MBytes   468 Mbits/sec  5960             sender
[SUM]   0.00-10.00  sec   487 MBytes   408 Mbits/sec                  receiver

EDIT: As soon as a I start a second farm the network dies. Still planning to set up a VPN for subspace traffic, just haven’t gotten a chance.

nazar-pc · February 5, 2024, 8:23pm

Would be great if you can crack them to tell you what they did and what they are limiting in the first place.

repost · February 6, 2024, 2:40pm

Well it still isn’t fixed unfortunately. I’ll call them again today and see if I can talk to a tech or someone that can give me more information. It’s a busy week this week unfortunately but will see if I can carve out some time.

The VPN solution still works - and I’m setting up a VLAN anyways for just my crytpo servers and plan to put all my crypto on a VPN. My fiance is tolerating my network crashes but in the future would like to make sure only the crypto stuff is impacted.

repost · February 7, 2024, 5:25am

The VPN is the holy grail. Syncing is at 50+ bps and Piece Cache syncs in just a few minutes. Zero impact to my home network. VPN keeps up with at least 3 node/farmers.

ISP would not tell me the issue they apparently noticed the other day. ISP would also not admit that they did any sort of rate limiting or blocking of traffic. So who knows. The only solution for me was to set up a VPN.

I now have pfsense w/ OpenVPN client setup, connected to an EC2 instance running OpenVPN server. I have this set as a gateway on the pfsense and I set up an alias that routes specific IPs through the VPN. I wanted to do it by VLAN, which I will eventually, but some of the servers are on a switch that is not VLAN capable which is shared with regular devices I don’t want to route through the VPN. But working on a solution to that.

EDIT: I spoke too soon - I’m getting high CPU usage and packet loss. I think I will need to upgrade my pfsense now

EDIT 2: After piece cache sync CPU usage went down quite a bit. So I’m slowing starting up each Server. So far I have 5 PCs farming and network is holding up well

repost · February 8, 2024, 3:34pm

Alright one more update. Staggering my start ups I successfully have 8 farmers running on the VPN perfectly. No impact to my home network. I still plan to upgrade my pfsense as CPU usage is hanging around 50-60%.

nazar-pc · March 17, 2024, 12:32pm

Which software and version are you using right now? I have some ideas on what to experiment with and can create a custom build for that version with a single change to see if it helps.

dragonP · March 17, 2024, 12:52pm

So what you can do with VPN, you cannot do without it, means that your ISP sets ‘connection limit’ on your internet line.

And they don’t admit it, they don’t confirm nor deny.

Personally, I would think to escalate further, or eventually change the ISP. I’ll use my customer’s right.

nazar-pc · March 19, 2024, 4:58pm

@repost I’m wondering if Snapshot build · subspace/subspace@483db85 · GitHub helps you in any way. If my current theory is correct, it should help. But would be great to first confirm that you still have issues with mar-18 before trying that one such that we have a proper comparison.

Dreamminer · March 23, 2024, 5:58pm

I have tested this build today, and seems something is working differently with the network saturation issue. This morning I was trying to run Space Acres 0.1.11-1, and impacting the network really badly, dropping connections, and timeouts. I switched over to the Ubuntu build, Node sync’d then started farming/plotting, still some issue I would say but something is very different. Happy to continue to test and provide logs etc. I am running tests on both Windows & Ubuntu. I sense that the network issue is really a larger issue on Linux/Ubuntu than Windows but I have no firm evidence at this point. Running Data Dog on both systems to monitor.

After red line is the new build.

nazar-pc · March 23, 2024, 6:27pm

There is no difference network-wise between Windows and Ubuntu from architecture point of view.

It would be good if you can quantify “very different”, but either way this topic is not about Ubuntu vs Windows.

What I am curious about right now is if you have issues with CLI of the current latest release, whether switching to above experimental build changes anything for you. Ideally we’d see that it stops networking from breaking.

Dreamminer · March 24, 2024, 4:30pm

I have been running the experimental version for over 16 hours, with good results.
What is different from ver feb 19th, is my network is not “stopping” the Feb 19th version would crash my network completely, and I would have to stop running the CLI or space acres. I had given up with SubSpace due to this issue. With this Experimental version(ubuntu) it is better, but I sense there is still something impacting the network that is not bandwidth. Not sure if you can share the change as to narrow my focus on what to monitor or not. Ping times good and no dropped packets are zero in the past few hours.
images below are ping test this AM, and the Ubuntu host monitor. Happy to revert to Mar22nd version if that helps test your theory.

Dreamminer · March 24, 2024, 4:31pm

Second image, forum limits embeds I guess.

nazar-pc · March 24, 2024, 7:30pm

My suspicion is that it was never bandwidth, but rather churn of connections. And probably because you run Space Acres it still impacts things the same way as before. Since we were using UDP that doesn’t have “connections”, it creates firewall states that then expire over time and apparently some routers abilities and/or ISP limits are low enough to be overwhelmed by the number of states being generated.

That test build removes UDP/QUIC support from software completely, switching to TCP instead. While the same number of connections is made, TCP has connections and has explicit connection closing, such that firewalls/routers can drop corresponding states immediately (if they wish to, they don’t always do that).

Would be great to get more confirmations before removing QUIC for everyone.

Dreamminer · March 24, 2024, 8:44pm

Thanks for the explanation, I was planning to setup a “better” router to confirm a few other things I am working on, so happy to pull stats on that, should be in the next week.

nazar-pc · March 29, 2024, 9:39pm

Latest releases of both Space Acres and CLI removed QUIC, I hope it works better for you now:

Nacho-Neko · April 2, 2024, 9:45pm

I privately messaged you in January about the infinite retry loop bug in UDP connections in libp2p. It has finally been removed from the current version along with qubic. Perhaps it’s time to find out the reason behind the qubic issue.

nazar-pc · April 2, 2024, 10:15pm

I do not remember this. And why would you message that to me rather than to libp2p maintainers?

Nacho-Neko · April 2, 2024, 10:23pm

Because there really was no need to specifically use UDP for data transmission, considering the QoS policies of various countries and other factors, I wanted you to disable UDP at that time.

nazar-pc · April 2, 2024, 10:54pm

I’m very confused with what QoS and countries have to do with infinite retry loop UDP bug in libp2p

Topic		Replies	Views
Certain farmers are saturating internet connection Support	17	332	December 27, 2023
Node knocks out internet in whole house Support	24	1123	March 26, 2023
Farmers can't link to node Support	3	234	September 15, 2023
Regarding the fact that there are multiple servers in the LAN, can only Support faq , gemini , nodes , farmer	50	1065	October 18, 2023
Node rewards issue Incentivized Testnet	4	370	October 25, 2022

Network Latency & Timeouts after Starting Node & Farmer

Related topics