fix: add typecheck #3550

Samoed · 2025-11-11T22:08:17Z

If you add a model or a dataset, please add the corresponding checklist:

KennethEnevoldsen · 2025-11-12T16:08:10Z

mteb/results/benchmark_results.py

        return type(self).model_construct(model_results=new_model_results)

-    def join_revisions(self) -> Self:
+    def join_revisions(self) -> "BenchmarkResults":


I don't think it needs the "

I can change to this, but would require __future__

Would prefer __future__

KennethEnevoldsen · 2025-11-12T16:08:43Z

mteb/results/benchmark_results.py


-    def __iter__(self) -> Iterator[ModelResult]:
+    @override
+    def __iter__(self) -> Iterator[ModelResult]:  # type: ignore[override]


I am not sure I get the type error here?

BaseModel have it's own iterator

Ahh, but it still gives you a type error even with the decorator (seems to me like we are specifying it twice)

Yes, I'll remove it

mteb/results/benchmark_results.py

KennethEnevoldsen · 2025-11-12T16:10:24Z

mteb/models/instruct_wrapper.py

        """
        sentences = [text for batch in inputs for text in batch["text"]]
-        instruction = self.get_task_instruction(task_metadata, prompt_type)
+        instruction: str | None = self.get_task_instruction(task_metadata, prompt_type)


shouldn't this be annotated by the function?

Get task instruction will always return string, but later we have instructions = None and this cause conflict. Probably later can be changed to empty string

Then I would probably do like this:

Suggested change

instruction: str | None = self.get_task_instruction(task_metadata, prompt_type)

instruction: str | None # further up

instruction = self.get_task_instruction(task_metadata, prompt_type)

otherwise we are suggesting that the function could also return None even though it can't

KennethEnevoldsen · 2025-11-12T16:12:59Z

mteb/filter_tasks.py

-    exclude_aggregate: bool = False,
-    exclude_private: bool = False,
-) -> list[type[AbsTask]]: ...
+T = TypeVar("T", AbsTask, type[AbsTask])


Does this guarantee that if I provide AbsTask, then I won't get an type[AbsTask] out?

Also how does this influence the docs?

Does this guarantee that if I provide AbsTask, then I won't get an type[AbsTask] out?

Yes

Also how does this influence the docs?

I'll check

KennethEnevoldsen · 2025-11-12T16:14:17Z

mteb/evaluate.py

 def evaluate(
    model: ModelMeta | MTEBModels | SentenceTransformer | CrossEncoder,
-    tasks: AbsTask | Iterable[AbsTask],
+    tasks: AbsTask | Benchmark | Iterable[AbsTask | Benchmark],


Can it be an iterable of benchmarks? Benchmark should be an iterable of AbsTask

Hm, it seems that we don't support this, because I mistakenly looked to deprecated evaluator

mteb/tests/test_deprecated/test_MTEB.py

Line 16 in fe83e27

def test_run_using_benchmark(model: mteb.EncoderProtocol, tmp_path: Path):

Probably we need to add a test for new evaluator too

We can add support for it, but I am leaning toward not doing it. Keeps it cleaner and it is not like a for loop across benchmarks is hard to do.

KennethEnevoldsen · 2025-11-12T16:15:54Z

mteb/evaluate.py

 ) -> tuple[MTEBModels | ModelMeta, ModelMeta, ModelName, Revision]:
    from sentence_transformers import CrossEncoder, SentenceTransformer

+    wrapped: MTEBModels | ModelMeta


Suggested change

wrapped: MTEBModels | ModelMeta

wrapped_model: MTEBModels | ModelMeta

KennethEnevoldsen · 2025-11-12T16:17:18Z

mteb/deprecated_evaluator.py

-        self.tasks = list(tasks)
-        if len(self.tasks) > 0 and isinstance(self.tasks[0], Benchmark):
+        if isinstance(tasks, list) and all(
+            isinstance(task, Benchmark) for task in tasks
+        ):
            self.benchmarks = tasks
-            self.tasks = list(chain.from_iterable(self.tasks))


unsure why this was changed?

KennethEnevoldsen · 2025-11-12T16:17:43Z

mteb/cli/generate_model_card.py

 def generate_model_card(
    model_name: str,
-    tasks: list[AbsTask] | None = None,
+    tasks: Sequence[AbsTask] | None = None,


iterable or sequence?

KennethEnevoldsen · 2025-11-12T16:19:18Z

mteb/abstasks/abstask.py

+class AbsMetrics(TypedDict):
+    """The abstract class for the metrics returned by the tasks"""
+
+    ...
+


Unsure why this is added?

I tried to standartize, because dict is not compatible with mappting, but I will remove this I think and change evaluate subset to Mapping, which can handle both

Co-authored-by: Kenneth Enevoldsen <[email protected]>

github-actions · 2025-11-29T02:12:15Z

This pull request has been automatically marked as stale due to inactivity.

Samoed added 4 commits November 11, 2025 18:58

add pytyped

a09734f

start typing

98eab29

finish evaluators

e028aea

add more types

86e7efd

KennethEnevoldsen changed the title ~~feat: add typecheck~~ fix: add typecheck Nov 12, 2025

KennethEnevoldsen reviewed Nov 12, 2025

View reviewed changes

Update mteb/results/benchmark_results.py

84ab864

Co-authored-by: Kenneth Enevoldsen <[email protected]>

github-actions bot added the stale label Nov 29, 2025

	instruction: str \| None = self.get_task_instruction(task_metadata, prompt_type)
	instruction: str \| None # further up
	instruction = self.get_task_instruction(task_metadata, prompt_type)

	wrapped: MTEBModels \| ModelMeta
	wrapped_model: MTEBModels \| ModelMeta

fix: add typecheck #3550

Are you sure you want to change the base?

fix: add typecheck #3550

Uh oh!

Conversation

Samoed commented Nov 11, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants