Add multi-label support to Find box for number fields by rvisser7 · Pull Request #6964 · LMFDB/lmfdb

rvisser7 · 2026-04-10T23:27:38Z

I've had a go at implementing issue #6882 in this PR, just for number fields for now. It essentially follows the SneakyBox approach suggested by @roed314 .

As always, any comments/feedback are very welcome! 🙂 At present, this is only implemented for number fields, but I'll hopefully extend this to other sections soon.

How it works:

A function multi_entry_jump_search has been added to search_wrapper.py. This is meant to be a generic handler which parses a comma-separated list of entries (e.g. could be labels/names/polynomials/equations) given as an input string in the jump box. Each entry is processed using a custom section-specific parser function parse_entry. (e.g. for number fields, this is just nf_string_to_label).

If any of the entries are unable to be parsed, it flashes an info message on the number of invalid entries. If all entries are invalid, it flashes an error message and returns the usual home search page.
Also in search_wrapper.py, a function parse_labels has been added to convert a "?labels=..." URL query into a database query of the form {"$in": [...]} on the label column.
I've tried to keep the section-specific code to a minimum. In particular, the only update to number_field.py is some code at the beginning of number_field_jump which first runs the generic multi_entry_jump_search parser, and setting the label_knowl argument (used to define the labels SneakyBox).

Examples for testing:

If there's no comma in the jump box, then it treats the input as usual. E.g.

http://localhost:37777/NumberField/?jump=2.2.5.1
http://localhost:37777/NumberField/?jump=Qsqrt7
http://localhost:37777/NumberField/?jump=x%5E2+-+3
E.g: a given list of labels:

http://localhost:37777/NumberField/?jump=2.2.5.1%2C+2.2.8.1%2C+2.2.12.1
E.g. a mix of labels, nicknames, polynomials:

http://localhost:37777/NumberField/?jump=2.2.5.1%2C+Qsqrt11%2C+x%5E3+-+2%2C+Qzeta10
E.g. a mix of labels and invalid entries:

http://localhost:37777/NumberField/?jump=3.3.49.1%2C+banana%2C+x%5E2+-+7
E.g. some random nonsense:

http://localhost:37777/NumberField/?jump=1234%2C+banana%2C+%21%21%21%2C+asdfghjk

For now, I've just put this as a draft, just to get any preliminary feedback on whether this looks ok, or whether this should maybe be implemented in a different way. If the editors are happy with the above implementation, I can then implement this for all other sections of the LMFDB where we'd like to support a multi-label search in the "Find" jump box. :)

…h_wrapper

roed314 · 2026-04-11T18:16:07Z

+    if not labels_input or not hasattr(table, "_label_col"):
+        return
+
+    labels, seen = [], set()


I think you could just do labels = list(set(label.strip() for label in labels_input.split(","))).

thanks, done!

Perhaps this doesn't matter too much, but would it maybe make more sense for parse_labels to instead be in the utils/search_parsing.py file?

roed314 · 2026-04-11T18:31:14Z

+    not_parsed, not_found = 0, 0
+    for entry in entries:
+        try:
+            label = parse_entry(entry)


@jwj61 expressed concern about polredabs getting called many times. I wonder if you could also add a progressive timeout here, where you stop if the total amount of parsing time surpasses an amount determined by a keyword.

Ah, of course, thanks - I forgot about this!

I've added a timer to this for loop. If the timer hits the value set by time_limit (I've put a default of 30 seconds), then it flashes an error and returns the index page. At the moment, the timer is only checked between each entry being parsed, so I am assuming that parsing at least a single entry won't take too long.

roed314 · 2026-04-11T18:37:21Z

+    - ``index_endpoint`` -- the URL for the index homepage for this section
+    - ``input_key`` -- the dictionary key for the jump search box (default: "jump")
+    - ``labels_jey`` -- the dictionary key for the labels search query (default: "labels")
+    - ``sep`` -- A string used as the seperator for parsing the jump box input (default: ",")


Maybe you can provide sep as a function, defaulting to lambda x: re.split(",", x). Then you can allow Q(sqrt2,sqrt3) for a field name even though it has a comma in it by making a more complicated splitting function.

Thanks, this is a great suggestion!

I've had a go at writing a function split_top_level_commas, currently placed just above multi_entry_jump_search. This takes some input string and returns a list of substrings which only splits on commas which are not inside any parentheses/brackets/braces. Since this would probably give the intended behaviour for most sections (not just number fields), I've made this the default separator function.

roed314 · 2026-04-11T18:37:59Z

+    - ``info`` -- the info dictionary passed in from front end
+    - ``parse_entry`` -- a custom function which converts a string (e.g. polynomial, equation, nickname etc) to be parsed into label
+    - ``label_exists`` -- a custom function which determines whether a given label exists in the database
+    - ``index_endpoint`` -- the URL for the index homepage for this section


technically this is the input to url_for, not a url itself.

thanks, fixed!

…update multi_entry_jump_search to use it as default separator

…r sep and time_limit

rvisser7 · 2026-04-12T02:52:24Z

Just for fun, maybe I can also mention another nice consequence of this PR: this also provides a convenient way get a search page for essentially any parametrised family of fields directly via the Find box (or the Labels box). So in particular, I think this also at least gives some partial progress towards issue #6948 🙂

E.g. to obtain a search results page for the first few cyclotomic fields (ordered by degree), we can just copy-paste the Python output of ",".join("Qzeta"+str(n) for n in range(100)) into the Find box. Or similarly, to get a search for pure cubic fields, we just paste the output of ",".join("x^3-"+str(n) for n in range(1000)) .

Just for convenience, I've given some links to search pages for some of the families mentioned in #6948 below. To estimate the polredabs cost, I've also given some rough estimates on the time taken for each page to load using the "jump" link, (measured using the Legendre server):

Cyclotomic fields $\mathbb{Q}(\zeta_n)$, for $n \leq 100$ (jump, labels). Takes < 1 sec to load.
Maximal totally real subfields $\mathbb{Q}(\zeta_n)^+$ of cyclotomics, for $n \leq 100$ (jump, labels). Takes < 1 sec to load.
Pure cubic fields $\mathbb{Q}(\sqrt[3]{a})$ for $0 \leq a < 1000$ (jump, labels). Takes ~11 sec to load.
Pure quartic fields $\mathbb{Q}(\sqrt[4]{a})$ for $0 \leq a < 1000$ (jump, labels). Takes ~12 sec to load.
Simplest cubic fields $\mathbb{Q}[x]/(x^3 - ax^2 - (a+3)x - 1)$ for $0 \leq a < 1000$ (jump, labels). Takes ~11 sec to load.
Ennola's cubic fields $\mathbb{Q}[x]/(x^3 + (a-1)x^2 - ax - 1)$ for $0 \leq a < 1000$: (jump, labels). Takes ~10 sec to load.

rvisser7 added 7 commits March 11, 2026 15:00

Add SneakyBox to search boxes

d3d727a

Add multi-label search to search_wrapper

0bc76de

Implements multi-label search functionality in number_field and searc…

49439a5

…h_wrapper

Add 'handle_multi_jump_search' function and update search array handling

53a422f

Add tests for multi-label search

7306887

Change "labels_knowl" to "label_knowl"

9571a66

Merge branch 'main' into new_jump_box

f8c122c

rvisser7 marked this pull request as draft April 10, 2026 23:32

rvisser7 added 4 commits April 10, 2026 20:37

Correct number field multi-label jump test

7579f46

Some light edits to the multi-entry jump box search parsing

af1edea

Small typo fix in number field import

2332fac

Small bug fixes

98e9e09

roed314 reviewed Apr 11, 2026

View reviewed changes

rvisser7 added 8 commits April 11, 2026 18:04

Added time_limit (with timer) to multi_entry_jump_search

3879b6f

Some minor extra comments added

6874f3b

Fix some typos in parse_labels function

4dfa224

Add split_top_level_commas function for parsing jump box input

f1bbfdc

Moved split_top_level_commas function for jump box input parsing and …

8dbe81f

…update multi_entry_jump_search to use it as default separator

Small change to parse_labels function

255215f

Small whitespace edit in number_field.py

bcf1d9e

Update multi_entry_jump_search docstring to clarify default values fo…

3074823

…r sep and time_limit

rvisser7 marked this pull request as ready for review April 12, 2026 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi-label support to Find box for number fields#6964

Add multi-label support to Find box for number fields#6964
rvisser7 wants to merge 19 commits intoLMFDB:mainfrom
rvisser7:new_jump_box

rvisser7 commented Apr 10, 2026 •

edited

Loading

Uh oh!

roed314 Apr 11, 2026

Uh oh!

rvisser7 Apr 11, 2026

Uh oh!

rvisser7 Apr 11, 2026

Uh oh!

roed314 Apr 11, 2026

Uh oh!

rvisser7 Apr 11, 2026 •

edited

Loading

Uh oh!

roed314 Apr 11, 2026

Uh oh!

rvisser7 Apr 11, 2026

Uh oh!

roed314 Apr 11, 2026

Uh oh!

rvisser7 Apr 11, 2026

Uh oh!

rvisser7 commented Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rvisser7 commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rvisser7 Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rvisser7 commented Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rvisser7 commented Apr 10, 2026 •

edited

Loading

rvisser7 Apr 11, 2026 •

edited

Loading