After renaming paritydb folder to back it up, (renamed due to to OOM Killer) node synched okay, but now that operator re-registered, operator fails to start and crashes with:
Dec 05 17:23:00 mmvt1 subspace-node[2191]: 2023-12-05T22:23:00.402805Z [Consensus] ⚙️ Syncing 0.0 bps, target=#525569 (40 peers), best: #409761 (0xf8c1…d11f), finalized #376724 (0x17d4…cfe4), ⬇ 106.7kiB/s ⬆ 13.1kiB/s
Dec 05 17:23:02 mmvt1 subspace-node[2191]: 2023-12-05T22:23:02.144594Z [Domain] Failed to process consensus block error=UnknownBlock("Header was not found in the database: 0x1e27b40e3142420329ab58b9e42fcf484f3ede9ee13ce4a5b7f853154f21a78a")
Dec 05 17:23:02 mmvt1 subspace-node[2191]: 2023-12-05T22:23:02.144731Z [Domain] Essential task `domain-operator-worker` failed. Shutting down service.
Dec 05 17:23:02 mmvt1 subspace-node[2191]: 2023-12-05T22:23:02.144873Z [Domain] Domain starter exited with an error Other("Essential task failed.")
Dec 05 17:23:02 mmvt1 subspace-node[2191]: 2023-12-05T22:23:02.144881Z [Domain] Essential task `domain` failed. Shutting down service.
Dec 05 17:23:02 mmvt1 subspace-node[2191]: Error: SubstrateService(Other("Essential task failed."))
Dec 05 17:23:02 mmvt1 systemd[815]: subspace-node.service: Main process exited, code=exited, status=1/FAILURE
Dec 05 17:23:02 mmvt1 systemd[815]: subspace-node.service: Failed with result 'exit-code'.
I’m not sure, but I have tested locally that if I rename the whole --base-path folder the operator can restart successfully.
A few questions/things I need @jrwashburn to help with to locate the problem:
Have you renamed the whole --base-path or just the chains/subspace_gemini_3g/paritydb
Is the operator failed immediately after starting the node or after the node syncing for some time
Plz run this command and let me know the result: subspace-node check-block 0x1e27b40e3142420329ab58b9e42fcf484f3ede9ee13ce4a5b7f853154f21a78a --chain gemini-3g --base-path <PATH>
I renamed them both again and re-synced overnight, and the node is running okay this time. Would I need to take down the node, restore the old paritydb folders and then run the check-block? And if I do that, will I be able to just rename back to the good paritydb folders and not have to sync all over again?
Check the same block number (i.e. #160648) has the same hash (i.e. 0x3928…a566) as in the the RPC endpoint node
restore the old paritydb folders and then run the check-block?
If your old paritydb folders still exist (i.e. have the exact same data as it first shut down due to OOM), you can run the command directly in the old folder