feat: update blockbuilder to use scheduler for fetching jobs #15224

ashwanthgoli · 2024-12-03T06:35:03Z

What this PR does / why we need it:

Updates block builder to use scheduler APIs for getting jobs and updating their status
Adds a sync loop to periodically call syncJob to update the status of inflight jobs
Tries to rename all instances of slimgester to blockbuilder
Moves chunk appender code to a separate file appender.go
Removes controller as it is no longer required

Special notes for your reviewer:
I renamed slimgester.go to blockbuilder.go in the last commit, so github does not show the diff anymore and treats it as a new file. Please check the first two commits to view the new changes made in builder.go

Checklist

Reviewed the CONTRIBUTING.md guide (required)
Documentation added
Tests updated
Title matches the required conventional commits format, see here
- Note that Promtail is considered to be feature complete, and future development for logs collection will be in Grafana Alloy. As such, feat PRs are unlikely to be accepted unless a case can be made for the feature actually being a bug fix to existing behavior.
Changes that require user attention or interaction to upgrade are documented in docs/sources/setup/upgrade/_index.md
If the change is deprecating or removing a configuration option, update the deprecated-config.yaml and deleted-config.yaml files respectively in the tools/deprecated-config-checker directory. Example PR

owen-d

Left a few things to fix, but giving approval to unblock you.

owen-d · 2024-12-04T04:42:08Z

pkg/blockbuilder/types/proto/blockbuilder.proto

@@ -29,6 +29,7 @@ message GetJobResponse {
 message CompleteJobRequest {
  string builder_id = 1;
  Job job = 2;
+  int64 LastConsumedOffset = 3;


Why do we need this? Jobs are completed all-or-nothing because tsdbs are built at the end. The job itself contains the offset range

owen-d · 2024-12-04T05:19:44Z

pkg/blockbuilder/builder/slimgester.go

+
+	lastConsumedOffset, err := i.processJob(ctx, job, logger)
+	// TODO: pass lastConsumedOffset as a separate field
+	job.Offsets.Max = lastConsumedOffset


I think it's much simpler if the jobs are predetermined at the scheduler. This was the initial design and although it does introduce a bit of lag (we only process offsets known at the time of job creation), I think the simplicity & separation of concerns are more beneficial (at least for now).

owen-d · 2024-12-04T05:21:11Z

pkg/blockbuilder/builder/slimgester.go


-	exists, job, err := i.jobController.LoadJob(ctx)
+func (i *BlockBuilder) runOne(ctx context.Context, workerID string) error {
+	// assuming GetJob blocks/polls until a job is available


I suspect we'll need to retry when there are no jobs here, but as you said it's also possible the transport handles this

owen-d · 2024-12-04T05:25:32Z

pkg/loki/modules.go

-	if err != nil {
-		return nil, err
+	readerFactory := func(partitionID int32) (partition.Reader, error) {
+		return partition.NewKafkaReader(


This will panic b/c it's creating new clients each time, each which use the same metrics namespacing internally. Instead, we could create a single client which creates cheap copies via a WithPartition(x) -> Self or similar.

not sure if it is safe to make copies of kgo.Client. we'd need separate instances of it as we mutate it while setting offset for consumption.

working around this by registering metrics only once 786186a.

ashwanthgoli added 4 commits December 3, 2024 10:46

chore: split kafka reader into reader and offset manager

8b2f67c

lint

0f8bf25

metric rename

bfa2a41

feat: integrate block builder to use the scheduler interface

85ed8c7

pull-request-size bot added the size/XL label Dec 3, 2024

ashwanthgoli changed the base branch from main to refactor-kafka-reader December 3, 2024 06:35

pull-request-size bot added size/L and removed size/XL labels Dec 3, 2024

ashwanthgoli added 2 commits December 3, 2024 14:02

wire transport

9950524

tidy

705b277

pull-request-size bot added size/XXL and removed size/L labels Dec 3, 2024

make doc

6002f04

github-actions bot added the type/docs Issues related to technical documentation; the Docs Squad uses this label across many repositories label Dec 3, 2024

make format

283e4e1

ashwanthgoli marked this pull request as ready for review December 3, 2024 10:23

ashwanthgoli requested a review from a team as a code owner December 3, 2024 10:23

Base automatically changed from refactor-kafka-reader to main December 4, 2024 04:40

ashwanthgoli added 2 commits December 4, 2024 10:29

Merge branch 'main' into blockbuilder-use-scheduler

9b0d0b4

remove files

4addff2

owen-d approved these changes Dec 4, 2024

View reviewed changes

ashwanthgoli added 3 commits December 4, 2024 11:54

register metrics once

786186a

Merge branch 'main' into blockbuilder-use-scheduler

8ae9af3

fixup! Merge branch 'main' into blockbuilder-use-scheduler

c5cba68

ashwanthgoli merged commit 0d67831 into main Dec 4, 2024
59 checks passed

ashwanthgoli deleted the blockbuilder-use-scheduler branch December 4, 2024 09:44

This was referenced Dec 23, 2024

chore(k234): release 3.4.0 #15536

Closed

chore(k235): release 3.4.0 #15555

Closed

loki-gh-app bot mentioned this pull request Jan 6, 2025

chore(k236): release 3.4.0 #15595

Closed

loki-gh-app bot mentioned this pull request Jan 13, 2025

chore(k237): release 3.4.0 #15705

Closed

loki-gh-app bot mentioned this pull request Jan 20, 2025

chore(k238): release 3.4.0 #15847

Closed

This was referenced Feb 3, 2025

chore(k240): release 3.4.0 #16074

Closed

chore(k239): release 3.4.0 #16102

Merged

chore(k241): release 3.4.0 #16153

Closed

loki-gh-app bot mentioned this pull request Feb 12, 2025

chore(k239): release 3.4.0 (backport main) #16210

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: update blockbuilder to use scheduler for fetching jobs #15224

feat: update blockbuilder to use scheduler for fetching jobs #15224

Uh oh!

ashwanthgoli commented Dec 3, 2024 •

edited

Loading

Uh oh!

owen-d left a comment

Uh oh!

owen-d Dec 4, 2024

Uh oh!

owen-d Dec 4, 2024

Uh oh!

owen-d Dec 4, 2024

Uh oh!

owen-d Dec 4, 2024

Uh oh!

ashwanthgoli Dec 4, 2024

Uh oh!

Uh oh!

Uh oh!

feat: update blockbuilder to use scheduler for fetching jobs #15224

feat: update blockbuilder to use scheduler for fetching jobs #15224

Uh oh!

Conversation

ashwanthgoli commented Dec 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

owen-d left a comment

Choose a reason for hiding this comment

Uh oh!

owen-d Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

owen-d Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

owen-d Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

owen-d Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

ashwanthgoli Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ashwanthgoli commented Dec 3, 2024 •

edited

Loading