Fixing Peak Sample Count Per Series #604

Naman-B-Parlecha · 2025-07-07T23:17:17Z

Resolve: #573

Currently, the peak sample count is incorrectly reported as 1 because currStepSamples is reset to 0 at every step iteration, preventing proper accumulation of peak samples across series.

To fix this we can maintain a vector to track sample count per step across all series and add up the sample at each step. Once all the all sample per step of all series are processed we can update the telemetry for all series

Signed-off-by: Naman-B-Parlecha <[email protected]>

Naman-B-Parlecha · 2025-07-07T23:20:06Z

@yeya24 @MichaHoffmann PTAL!

yeya24 · 2025-07-07T23:29:51Z

I think it is better to do #573 (comment).
We can decouple updating peak samples from IncrementSamplesAtTimestamp.

Naman-B-Parlecha · 2025-07-07T23:31:15Z

Sure refactoring!

harry671003 · 2025-07-11T21:45:23Z

Thanks for the work on this!

Before we finalize the implementation, I'd like to clarify what we mean by peak samples in the context of the Thanos engine.

In the Prometheus engine, peak samples is defined as the highest number of samples held in memory at any moment during query execution, across all operators. This is used to enforce the --query.max-samples limit and prevent memory exhaustion.

A few open questions for alignment:

Are we using the same definition of peak samples in the Thanos engine? That is, tracking the max number of samples in memory at any point during evaluation (across all operators)?
Or are we maintaining per-operator peak samples separately? If so, how are we aggregating or enforcing limits?
At what points in the engine are we calling UpdatePeak? Are we sure those points capture all relevant memory usage?

Having a shared definition with Prometheus helps make this limit consistent and easier to reason about. If we’re diverging, it would be good to document how and why.

Curious to hear thoughts.

Naman-B-Parlecha · 2025-07-14T07:46:52Z

Hey @harry671003 thanks for the clarification,
i m not entirely sure at what point we should be updating the peak but i believe we want to do something similar to what happens in prometheus?

@yeya24 is more qualified to answer this i believe as he has worked closely with the engine

harry671003 · 2025-07-18T17:46:24Z

I thought a little bit more about this:

Let's define the Peak samples first. Peak samples for an operator is the total samples the operator processed in a single Next() call. Maximum Peak samples would be the peak samples across all operators.

One way to implement this is to use the telemetry operator
https://github.com/thanos-io/promql-engine/blob/main/execution/telemetry/telemetry.go#L204


func (t *Operator) Next(ctx context.Context) ([]model.StepVector, error) {
	start := time.Now()
	totalSamplesBefore := t.OperatorTelemetry.Samples().TotalSamples() // Get the samples before calling Next()
	defer func() { t.OperatorTelemetry.AddNextExecutionTime(time.Since(start)) }()
	out, err := t.inner.Next(ctx)
	if err != nil {
		return nil, err
	}
 	totalSamplesAfter :=  t.OperatorTelemetry.Samples().TotalSamples() // Get the samples after calling Next()
 	t.OperatorTelemetry.UpdatePeak(totalSamplesAfter - totalSamplesBefore) // The diff is the peak samples.
 	
 	return out, err
}

Wdyt @yeya24 @Naman-B-Parlecha ?

Naman-B-Parlecha · 2025-07-18T19:01:48Z

I thought a little bit more about this:

Let's define the Peak samples first. Peak samples for an operator is the total samples the operator processed in a single Next() call. Maximum Peak samples would be the peak samples across all operators.

One way to implement this is to use the telemetry operator https://github.com/thanos-io/promql-engine/blob/main/execution/telemetry/telemetry.go#L204
func (t *Operator) Next(ctx context.Context) ([]model.StepVector, error) {
	start := time.Now()
	totalSamplesBefore := t.OperatorTelemetry.Samples().TotalSamples() // Get the samples before calling Next()
	defer func() { t.OperatorTelemetry.AddNextExecutionTime(time.Since(start)) }()
	out, err := t.inner.Next(ctx)
	if err != nil {
		return nil, err
	}
 	totalSamplesAfter :=  t.OperatorTelemetry.Samples().TotalSamples() // Get the samples after calling Next()
 	t.OperatorTelemetry.UpdatePeak(totalSamplesAfter - totalSamplesBefore) // The diff is the peak samples.
 	
 	return out, err
}
Wdyt @yeya24 @Naman-B-Parlecha ?

great was looking into how prometheus is updating its peak
but curious about why is peak value the diff of after and before
shouldnt it be the highest in of the series? (this is how i m updating it currently in this pr not sure if i m forgetting something)

harry671003 · 2025-07-18T20:01:28Z

@Naman-B-Parlecha

Yes, what you're doing is fundamentally the same. This is just a cleaner way of doing it.

We decouple the IncrementSamples() and UpdatePeaks()
We only do it in telemetry.Operator

Naman-B-Parlecha · 2025-07-18T20:09:42Z

@Naman-B-Parlecha

Yes, what you're doing is fundamentally the same. This is just a cleaner way of doing it.

We decouple the IncrementSamples() and UpdatePeaks()

We only do it in telemetry.Operator

Thanks for the clarification 🙌
Should I refactor the code in this pr or would u like to make a fresh pr and add changes to it?

harry671003 · 2025-07-18T20:12:42Z

Use the same PR.
Also include unit tests.

Signed-off-by: Naman-B-Parlecha <[email protected]>

harry671003

Great work! Just some comments.

execution/telemetry/telemetry.go

harry671003 · 2025-07-21T17:47:16Z

engine/explain_test.go

+|   |   |   |   |---[concurrent(buff=2)]: max_series: 1 total_samples: 0 peak_samples: 0
+|   |   |   |   |   |---[matrixSelector] rate({[__name__="http_requests_total"]}[10m0s] 0 mod 2): max_series: 1 total_samples: 1010 peak_samples: 200
+|   |   |   |   |---[concurrent(buff=2)]: max_series: 1 total_samples: 0 peak_samples: 0
+|   |   |   |   |   |---[matrixSelector] rate({[__name__="http_requests_total"]}[10m0s] 1 mod 2): max_series: 1 total_samples: 1010 peak_samples: 200


Is the peak samples correct here?
For the selector {[__name__="http_requests_total"]}[10m0s]

Conditions:

Steps batch is 10.

Selectors shards = 2

Calculation:

Each Next() call has 10 steps.

Each step has 1 series (due to sharding)

For each step:

Each series has a sample every 30 seconds.

In [10m] range, there will be 10*60s/30s = 20 samples

Total samples in each step = 10 steps * 1 series * 20 samples = 200

Looks correct.

Signed-off-by: Naman-B-Parlecha <[email protected]>

harry671003

LGTM! Great work. Thank you.

Naman-B-Parlecha · 2025-07-21T23:49:30Z

Thanks for the helping me out @harry671003 @yeya24 🙌

yeya24

Thanks @Naman-B-Parlecha and @harry671003! I like how this is implemented now and it seems clean.

Let's fix lint. Unit tests also failed so maybe need to take another look

Signed-off-by: Naman-B-Parlecha <[email protected]>

execution/telemetry/telemetry.go

Signed-off-by: Naman-B-Parlecha <[email protected]>

Naman-B-Parlecha · 2025-07-25T07:09:57Z

@yeya24 PTAL

execution/telemetry/telemetry.go

Signed-off-by: Naman-B-Parlecha <[email protected]>

fix: Peak Sample Per Series

8d63a2d

Signed-off-by: Naman-B-Parlecha <[email protected]>

harry671003 self-requested a review July 9, 2025 22:32

Naman-B-Parlecha force-pushed the NamanParlecha/FixPeakSample branch from f81684b to 0e0ae60 Compare July 19, 2025 15:54

Naman-B-Parlecha added 3 commits July 19, 2025 21:26

refactor: migrating to new sample update

5104a47

Signed-off-by: Naman-B-Parlecha <[email protected]>

update: changing opentel next

0c73cd9

Signed-off-by: Naman-B-Parlecha <[email protected]>

test: peak value

7d06931

Signed-off-by: Naman-B-Parlecha <[email protected]>

Naman-B-Parlecha force-pushed the NamanParlecha/FixPeakSample branch from 0e0ae60 to 7d06931 Compare July 19, 2025 15:56

harry671003 reviewed Jul 21, 2025

View reviewed changes

harry671003 requested a review from yeya24 July 21, 2025 17:53

removing update peak

eb0809d

Signed-off-by: Naman-B-Parlecha <[email protected]>

Naman-B-Parlecha force-pushed the NamanParlecha/FixPeakSample branch from 625dbe1 to eb0809d Compare July 21, 2025 22:13

Naman-B-Parlecha requested a review from harry671003 July 21, 2025 22:19

update: new UpdatePeak Interface In OperatorTelemetry

ef60fd8

Signed-off-by: Naman-B-Parlecha <[email protected]>

Naman-B-Parlecha force-pushed the NamanParlecha/FixPeakSample branch from 3096f45 to ef60fd8 Compare July 21, 2025 23:42

harry671003 approved these changes Jul 21, 2025

View reviewed changes

yeya24 approved these changes Jul 23, 2025

View reviewed changes

chore: removing unused updatePeak

b26781e

Signed-off-by: Naman-B-Parlecha <[email protected]>

yeya24 reviewed Jul 25, 2025

View reviewed changes

execution/telemetry/telemetry.go Outdated Show resolved Hide resolved

update: Handling NoopTelemetry Error

a74d0fa

Signed-off-by: Naman-B-Parlecha <[email protected]>

yeya24 reviewed Jul 25, 2025

View reviewed changes

execution/telemetry/telemetry.go Outdated Show resolved Hide resolved

fix: making sampleCount zero

d61f72e

Signed-off-by: Naman-B-Parlecha <[email protected]>

yeya24 merged commit 91e6e32 into thanos-io:main Jul 26, 2025
11 of 12 checks passed

Fixing Peak Sample Count Per Series #604

Fixing Peak Sample Count Per Series #604

Uh oh!

Conversation

Naman-B-Parlecha commented Jul 7, 2025

Uh oh!

Naman-B-Parlecha commented Jul 7, 2025

Uh oh!

yeya24 commented Jul 7, 2025

Uh oh!

Naman-B-Parlecha commented Jul 7, 2025

Uh oh!

harry671003 commented Jul 11, 2025

Uh oh!

Naman-B-Parlecha commented Jul 14, 2025

Uh oh!

harry671003 commented Jul 18, 2025

Uh oh!

Naman-B-Parlecha commented Jul 18, 2025

Uh oh!

harry671003 commented Jul 18, 2025

Uh oh!

Naman-B-Parlecha commented Jul 18, 2025

Uh oh!

harry671003 commented Jul 18, 2025

Uh oh!

harry671003 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

harry671003 Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harry671003 left a comment

Choose a reason for hiding this comment

Uh oh!

Naman-B-Parlecha commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yeya24 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Naman-B-Parlecha commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

harry671003 Jul 21, 2025 •

edited

Loading

Naman-B-Parlecha commented Jul 21, 2025 •

edited

Loading