Unfail repository tests by nikie · Pull Request #251 · awslabs/python-deequ

nikie · 2025-06-06T19:35:06Z

Issue #, if available:
There are no user facing changes.

Description of changes:

Unfail repository tests by removing pytest.mark.xfail marks and asserting the failing statements with self.assertRaises.
Apply black formatting to repository tests.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

github-actions

Generated by AI (model: us.anthropic.claude-opus-4-6-v1, prompt: 8c93b14f) — may not be fully accurate. Reply if this doesn't help.

github-actions · 2026-04-27T22:39:02Z

@@ -1,13 +1,13 @@
 # -*- coding: utf-8 -*-


nit: from py4j.protocol import Py4JError is imported but pytest was removed. However, Py4JError is used in the test, so this is fine. Just noting that pytest removal is correct since pytest.mark.xfail is no longer used.

github-actions · 2026-04-27T22:39:02Z

-            .saveOrAppendResult(resultKey)
-            .run()
+        with self.assertRaises(Py4JError) as err:
+            _ = (


The test_fail_no_load test asserts the error message string exactly, but MetricsRepository._check_RepositoryLoader checks if not self.RepositoryLoader which would raise AttributeError only if RepositoryLoader was never set as an attribute. However, repository.before(...) calls self._check_RepositoryLoader() which accesses self.RepositoryLoader. Since RepositoryLoader is never initialized in __init__ of FileSystemMetricsRepository (or its parent MetricsRepository), this will raise AttributeError: 'FileSystemMetricsRepository' object has no attribute 'RepositoryLoader' — but this is an implementation detail that could change. The assertion is correct for the current code, but it's brittle. Consider using self.assertIn or just self.assertRaises(AttributeError) without checking the exact message.

github-actions · 2026-04-27T22:39:02Z

-        """This test should fail because it doesn't call useRepository() before saveOrAppendResult()"""
+        """This run fails because it doesn't call useRepository() before saveOrAppendResult()."""
        metrics_file = FileSystemMetricsRepository.helper_metrics_file(self.spark, "metrics.json")
        print(f"metrics filepath: {metrics_file}")


test_fail_no_useRepository asserts a specific Py4J error message substring "Method saveOrAppendResult([class com.amazon.deequ.repository.ResultKey]) does not exist". This is fragile because the exact error message depends on the Py4J and Deequ JAR versions. If the Deequ version changes, this message could differ. Consider just asserting self.assertRaises(Py4JError) without checking the message content, or use a less specific substring like "saveOrAppendResult".

nikie added 2 commits June 6, 2025 22:18

Unfail repository tests.

99f6f3f

Apply black to repository tests.

e234409

github-actions Bot requested changes Apr 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unfail repository tests#251

Unfail repository tests#251
nikie wants to merge 2 commits intoawslabs:masterfrom
nikie:unfail-repository-tests

nikie commented Jun 6, 2025

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nikie commented Jun 6, 2025

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant