SPX Snapshot & SplitStoreV2 - Testing & Call Outs #9179
Replies: 7 comments 32 replies
-
|
Beta Was this translation helpful? Give feedback.
-
Step 4:
Step5:
Please complete the following after observing usual operations:
Sizes of the folders on a weekly basis for a period of 4 weeks:
|
Beta Was this translation helpful? Give feedback.
-
Sizes of the folders on a weekly basis for a period of 4 weeks:
Round2 on v1.17.2-rc2
|
Beta Was this translation helpful? Give feedback.
-
Meest NAM
After performing "Split store", wdpost failed, but let's see if there are any other causes
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Step 5: Reply cardPlease provide the information below after completion of the above steps and immediately following the successful start-up of your SplitStore enabled daemon:
We will be asking for updated sizes of your
|
Beta Was this translation helpful? Give feedback.
-
Hello SPX Fellows
Thank you to everyone who participated in the initial SpiltStoreV2 testing round!!
@ZenGround0 has been working tirelessly on the upcoming SplitStoreV2 rollout and we have a shiny new release for you to break 😉.
Thanks to @travisperson we also have the brand new Snapshot Service which was soft-launched on 4th August 2022! You can read more about the new service on this public Notion page.
It goes without saying that both of these new releases represent a huge improvement over the current implementations!
We would sincerely appreciate your help in testing both of these new enhancements and seeing as enabling splitstore ideally also involves a new snapshot import, we're going to test both at the same time. This assignment is open to all SPX fellows, please see the details below to get started,
This assignment is applicable to both mainnet and calibration instances but we are particularly interested in hearing about your mainnet experiences. In the event of any critical failures on your node, recovery is a very familiar procedure as detailed below.
Please reach out to @jennijuju or @TippyFlits on Slack if you have any questions! 🙏
Step 1:⚠️ Backup ALL lotus wallets before doing anything! (⚠️
lotus wallet list
)Step 2: Download a new lightweight snapshot & Update to testing release
Mainnet - https://snapshots.mainnet.filops.net/minimal/latest
Calibration - https://snapshots.calibrationnet.filops.net/minimal/latest
Step 3: Enable SplitStore and configure ColdStore mode
SplitStore can be run in
universal
,discard
and nowAutoPrune
modes and all three modes have been enhanced since the previous testing round. We are looking for a selection of SPX fellows to test a combination of three. Select 🎉 if you are testinguniversal
mode and 🚀 if you are opting fordiscard
mode and 👀 if your are testingAutoPrune
. Please review the emoji count at the bottom of this post before selecting a mode, an even balance of the 3 options would be much appreciated. The selected mode can be set in your/.lotus/config.toml
as follows:EnableSplitstore = true
[Chainstore.Splitstore]
ColdStoreType = "universal"
EnableColdStoreAutoPrune = false
EnableSplitstore = true
[Chainstore.Splitstore]
ColdStoreType = "discard"
EnableColdStoreAutoPrune = false
EnableSplitstore = true
[Chainstore.Splitstore]
ColdStoreType = "universal"
EnableColdStoreAutoPrune = true
Step 4: Prepare your
/.lotus/datastore
and import snapshot/.lotus/datastore/chain
folder./.lotus/datastore/chain
folder and clear your/.lotus/datastore/splitstore
folder by running./lotus-shed splitstore clear --repo=/path/to/your/lotus/repo
/.lotus/datastore/chain
and/.lotus/datastore/splitstore
folder sizes immediately following snapshot import and successful chain sync completion and record your results in the reply card below.lotus log set-level --system splitstore debug
Step 5: Reply card
Please provide the information below after completion of the above steps and immediately following the successful start-up of your SplitStore enabled daemon:
Please complete the following after observing usual operations:
lotus daemon
stays in sync/.lotus/datastore/splitstore/hot.badger
🔥 🦡/.lotus/datastore/chain
folder is being successfully garbage collectedWe will be asking for updated sizes of your
/.lotus/datastore/chain
and/.lotus/datastore/splitstore
folders on a weekly basis for a period of 4 weeks. We will publish weekly reminders in#spx-current-class
but this is obviously dependent on your testing start date. Sizes should be recorded at the end of each week and inGiB
format. Please consider setting weekly reminders so metrics remain consistent and accurate. 🙏Reminders and Risk Mitigations
In the event of a critical failure or data corruption, you will need to re-import a lightweight snapshot with the following steps:
/.lotus /datastore
folder/.lotus/config.toml
-EnableSplitstore = false
📓 In the event that you are unable to start your daemon after trying the above steps, please delete just the folders in your
/.lotus
directory and retry the lightweight snapshot import. ❗ ONLY DELETE THE FOLDERS - theapi
,token
andconfig.toml
files should not be deleted!Questions are welcomed in this discussion, the team will be monitoring and will check in regularly.
Issues can be reported at https://github.com/filecoin-project/lotus/issues. Issues should be submitted with:
lotus pprof goroutines
andlotus-miner pprof goroutines
📓 And thank you to Reiers Productions for the very cool banner!!
Beta Was this translation helpful? Give feedback.
All reactions