Skip to content

Commit f85bf59

Browse files
Raphael/bulk details (#203)
* Update bulk-loader.mdx details about p directory * Update bulk-loader.mdx add increment to trigger snapshot * Update bulk-loader.mdx * Update dgraph/admin/bulk-loader.mdx --------- Co-authored-by: Ryan Fox-Tyler <[email protected]>
1 parent 3fe91d7 commit f85bf59

File tree

1 file changed

+14
-8
lines changed

1 file changed

+14
-8
lines changed

dgraph/admin/bulk-loader.mdx

Lines changed: 14 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -267,20 +267,26 @@ In case your dataset is small (a few gigabytes) it would be convenient to start
267267
by initializing just one Alpha node and then let the snapshot be streamed among
268268
the other Alpha replicas. You can follow these steps:
269269

270-
1. Run Bulk Loader only on one server
270+
1. Run Bulk Loader only on one Alpha server
271271

272-
2. Once the `p` directory has been created by the Bulk Loader, then start
273-
**only** the first Alpha replica
272+
2. Once the generated `out\0\p` directory has been created by the Bulk Loader,
273+
copy the `p` directory (default path is `out/0/p`) to the Alpha volume.
274274

275-
3. Wait for 1 minute to ensure that a snapshot has been taken by the first Alpha
275+
3. Start **only** the first Alpha replica
276+
277+
4. Generate some mutations. Without mutation the Alpha will not create a
278+
snapshot. You can run `dgraph increment -n 10000` to generate some mutations
279+
on an internal counter not affecting your data.
280+
281+
5. Wait for 1 minute to ensure that a snapshot has been taken by the first Alpha
276282
node replica. You can confirm that a snapshot has been taken by looking for
277283
the following message":
278284

279285
```txt
280286
I1227 13:12:24.202196 14691 draft.go:571] Creating snapshot at index: 30. ReadTs: 4.
281287
```
282288

283-
4. After confirming that the snapshot has been taken, you can start the other
289+
6. After confirming that the snapshot has been taken, you can start the other
284290
Alpha node replicas (number of Alpha nodes must be equal to the `--replicas`
285291
flag value set in the Zero nodes). Now the Alpha node (the one started in
286292
step 2) logs similar messages:
@@ -305,9 +311,9 @@ When your dataset is pretty big (larger than 10 GB) it is faster that you just
305311
copy the generated `p` directory (by the Bulk Loader) among all the Alphas
306312
nodes. You can follow these steps:
307313

308-
1. Run Bulk Loader only on one server
309-
2. Copy (or use `rsync`) the `p` directory to the other servers (the servers you
310-
are using to start the other Alpha nodes)
314+
1. Run Bulk Loader only on one Alpha server
315+
2. Copy (or use `rsync`) the generated `out\0\p` directory to all Alpha nodes
316+
(the servers you are using to start the Alpha nodes)
311317
3. Now, start all Alpha nodes at the same time
312318

313319
If the process went well **all** Alpha nodes take a snapshot after 1 minute. You

0 commit comments

Comments
 (0)