MQE: Implement experimental info function #13443

zenador · 2025-11-09T21:15:50Z

What this PR does

Support the experimental info function in MQE.

Which issue(s) this PR fixes or relates to

Fixes https://github.com/grafana/mimir-squad/issues/3084

Checklist

Tests updated.
Documentation added.
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]. If changelog entry is not needed, please add the changelog-not-needed label to the PR.
about-versioning.md updated with experimental features.

Note

Implements info() to enrich series with labels from info metrics, adds planning/selector wiring, and comprehensive tests.

Functions/Operators:
- info() implemented via new InfoFunction operator to enrich base series with labels from matching info metrics (instance,job).
  - Resolves overlapping info by latest timestamp; ignores enriching info-series themselves; preserves base histograms; errors if info metrics have histograms.
Planner:
- Normalizes info() args: defaults second arg to target_info; enforces vector selector; auto-adds __name__="target_info" if missing; marks FUNCTION_INFO as not needing dedup.
- Registers FUNCTION_INFO with InfoFunctionOperatorFactory.
Selector:
- Adds ReturnSampleTimestampsPreserveHistograms to InstantVectorSelector to expose timestamps as floats while preserving histograms (used by info()).
Tests:
- Adds upstream and custom info.test covering label matching, lookback/@/offset, churn, conflicts, and error cases.

^{Written by Cursor Bugbot for commit 7f1aa4a. This will update automatically on new commits. Configure here.}

…e inner series

…a so we can get from the pool just once with the right number

cursor · 2025-11-09T21:18:11Z

pkg/streamingpromql/operators/functions/info.go

+
+		newLabelSets = append(newLabelSets, lb.Labels())
+		labelSetsOrder = append(labelSetsOrder, makeLabelSetsHash(labelSets))
+	}


Bug: Labels Builder Accumulates, Corrupting Label Combinations

The labels.Builder in combineLabels is missing a reset between iterations of the outer loop over labelSetsMap. After calling lb.Reset(innerSeries.Labels) once at line 379, the builder accumulates labels from each iteration without resetting. This causes labels from previous labelSets iterations to leak into subsequent iterations, creating incorrect combined label sets. The builder should be reset to innerSeries.Labels at the start of each iteration through labelSetsMap.

charleskorn · 2025-11-12T09:34:32Z

pkg/streamingpromql/operators/functions/info.go

+
+	timeRange                types.QueryTimeRange
+	expressionPosition       posrange.PositionRange
+	enableDelayedNameRemoval bool


Is this used anywhere?

charleskorn · 2025-11-12T09:35:54Z

pkg/streamingpromql/planning.go

+				dataLabelMatchersExpr, ok := expr.Args[1].(*parser.VectorSelector)
+				if !ok {
+					return nil, fmt.Errorf("expected second argument to 'info' function to be a VectorSelector, got %T", expr.Args[1])
+				}


We'll need to add a special case to CSE to ensure it doesn't replace the second argument to info with a deduplicated expression.

charleskorn · 2025-11-12T09:37:01Z

pkg/streamingpromql/planning.go

+			if len(expr.Args) == 1 {
+				infoExpr, err := parser.ParseExpr("target_info")
+				if err != nil {
+					return nil, err
+				}
+				expr.Args = append(expr.Args, infoExpr)


It might be good to do this as a very early AST optimisation pass. This will allow all other optimisation passes to ignore the fact that info can have one or two arguments, and instead they can just handle the two argument case.

charleskorn · 2025-11-12T09:39:24Z

pkg/streamingpromql/operators/functions/info.go

+	ivs, ok := f.Info.(*selectors.InstantVectorSelector)
+	if !ok {
+		return nil, fmt.Errorf("info function 2nd argument is not an instant vector selector")
+	}
+	// Override float values to reflect original timestamps.
+	ivs.ReturnSampleTimestampsPreserveHistograms = true


If we require Info to be an InstantVectorSelector, then we should enforce this by changing the type of the parameter to NewInfoFunction.

charleskorn · 2025-11-12T09:41:49Z

pkg/streamingpromql/operators/functions/info.go

+	ivs, ok := f.Info.(*selectors.InstantVectorSelector)
+	if !ok {
+		return nil, fmt.Errorf("info function 2nd argument is not an instant vector selector")
+	}
+	// Override float values to reflect original timestamps.
+	ivs.ReturnSampleTimestampsPreserveHistograms = true


Rather than setting this here, we should set it on the vector selector node in nodeFromExpr. This will allow optimisation passes to respect that this selector needs to behave differently (eg. that it's not equivalent to another similar selector without this set).

(You'll then need to update the vector selector node's equivalence and description methods to match.)

charleskorn · 2025-11-12T09:44:17Z

pkg/streamingpromql/operators/functions/info.go

+	innerMetadata, err := f.Inner.SeriesMetadata(ctx, matchers)
+	if err != nil {
+		return nil, err
+	}
+	defer types.SeriesMetadataSlicePool.Put(&innerMetadata, f.MemoryConsumptionTracker)
+
+	infoMetadata, err := f.Info.SeriesMetadata(ctx, matchers)


We can't pass matchers as-is here as they might apply to the labels from Inner or from Info, and so they could filter out all the series from the other selector.

(eg. imagine matchers contains a environment="prod" matcher, and the environment label only appears on Inner - passing this matcher to Info will cause it to incorrectly return no results)

charleskorn · 2025-11-12T09:45:08Z

pkg/streamingpromql/operators/functions/info.go

+	innerMetadata, err := f.Inner.SeriesMetadata(ctx, matchers)
+	if err != nil {
+		return nil, err
+	}
+	defer types.SeriesMetadataSlicePool.Put(&innerMetadata, f.MemoryConsumptionTracker)
+
+	infoMetadata, err := f.Info.SeriesMetadata(ctx, matchers)


One thing we do need to do here: we should generate some matchers for job and instance based on the series returned by Inner. Otherwise we could unnecessarily select all known target_info series during the query time range.

charleskorn · 2025-11-12T09:56:23Z

pkg/streamingpromql/operators/functions/info.go

I think we should rejig this to avoid loading all of the info series upfront, as this could be a lot of data.

It's OK for SeriesMetadata to return series that will later turn out to have no samples, so we can do something similar to what we do for binary operations.

I'm imagining something like this:

in SeriesMetadata:

load all series metadata from Inner

compute the possible set of instance and job labels

load all series metadata from Info, passing the matchers for instance and job

compute the cross product of all possible inner and info series, in the same order that the inner series are produced

in NextSeries:

if we don't have an active inner series:

read the next inner series

read all corresponding info series for this inner series, buffering any we read that will be needed for other inner series

compute all of the output series based on the inner series and the info series (in the same order these were returned by SeriesMetadata)

return the first output series for this inner series

if we have an active inner series, return the next output series we computed when we first read it

As a later improvement (not in this PR), if we can ask ingesters and store-gateways to sort returned series by arbitrary labels, then we can ask them to send series sorted by instance and job (and then by all other labels), and there should be almost no buffering required. This would also be beneficial for binary operations and aggregations - we could ask for series to be sorted by binary operation matching labels or aggregation grouping labels to reduce the amount of buffering these operators need to do.

zenador added 16 commits November 3, 2025 13:44

Add test cases for info

42e87be

Enable upstream tests

afe7cf7

Allow info func to be called with placeholder results

0822e2c

Implement info func without considering inner samples

5f145c9

Handle removal of series in metadata properly

2671c80

Go sample by sample and allow for multiple result series from a singl…

db344a5

…e inner series

Attempt memory fix

ae08865

More memory fixes

617cf31

Try to fix failing upstream test case

4c7df44

Fix the test cases that started failing

f6c93d9

Simplify the new ReturnSampleTimestampsPreserveHistograms

2b882d2

Start to handle SeriesMetadata memory tracking more properly

c0f20d6

Do a first pass to get the exact length of the enriched SeriesMetadat…

668e405

…a so we can get from the pool just once with the right number

Simplify by removing passInner

e4d8285

Simplify and add explanatory comments

4d181ac

Merge branch 'main' into zenador/mqe-info-func

be02b9d

zenador requested a review from a team as a code owner November 9, 2025 21:15

cursor bot reviewed Nov 9, 2025

View reviewed changes

make format-promql-tests

7f1aa4a

charleskorn reviewed Nov 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MQE: Implement experimental info function #13443

MQE: Implement experimental info function #13443

zenador commented Nov 9, 2025 •

edited by cursor bot

Loading

Uh oh!

cursor bot Nov 9, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

charleskorn Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MQE: Implement experimental info function #13443

Are you sure you want to change the base?

MQE: Implement experimental info function #13443

Conversation

zenador commented Nov 9, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

Uh oh!

cursor bot Nov 9, 2025

Choose a reason for hiding this comment

Bug: Labels Builder Accumulates, Corrupting Label Combinations

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zenador commented Nov 9, 2025 •

edited by cursor bot

Loading