Feat/readability metric by Noel-Bastubbe · Pull Request #8 · HPI-Information-Systems/Metis

Noel-Bastubbe · 2026-02-26T10:48:04Z

No description provided.

…M only

Tomic-Riedel · 2026-03-08T19:06:07Z

Hey, Lisa tasked me with checking the PRs for framework conventions and I found a few things to address before we can merge this:

metis/metric/readability/__init__.py and README.md
Neither of these should exist inside a dimension folder. None of the other dimensions (completeness/, consistency/, etc.) have them, and it's important to keep that consistent. Please remove both files and move any relevant documentation to docs/ or the main README.

Class naming
I see that ReadabilityLLM and ReadabilityWordNet are the "real" classes, with readability_llm and readability_wordnet as thin alias subclasses at the bottom just for registry registration. The convention (which all four existing metrics follow) is that the metric class itself carries the {dimension}_{technique} name and there shouldn't be an intermediate PascalCase class at all. Could you flatten these into a single class each?

DQmetric values
DQmetric="llm" and DQmetric="readability_wordnet" don't follow the PascalCase short-name convention we use elsewhere (e.g. "NullRatio", "CountFDViolations", "DuplicateCount"). Please update to something like "LLM" and "WordNet".

Helper files in the dimension folder
llm_backend.py, scorers.py, and tokenization.py are utility modules sitting inside metis/metric/readability/. Was there a particular reason for keeping them there? If not, the convention is that helpers live in metis/utils/. In that case I'd suggest moving them there, possibly under a metis/utils/readability/ subfolder if they're readability-specific.

DQgranularity="schema"
This introduces a new granularity level that doesn't exist anywhere else in the framework right now. Before we add a new one I think we should loop in @lisehr to check whether a "schema" granularity level makes sense to add to the framework.

Docstrings
Both assess() methods are missing docstrings. The base class has a detailed template for exactly this, so please add them.

Thanks!

lisehr · 2026-03-22T18:21:21Z

metis/metric/readability/tokenization.py

@@ -0,0 +1,51 @@
+from __future__ import annotations
+
+import re


Please shortly comment what this does and why you couldn't use the python tokenization function

@Tomic-Riedel thanks for all the comments. We agreed that "schema" granularity make sense for readability since we might also have more schema metrics in the future. For now, we focused on the data level per default.

lisehr · 2026-03-22T18:22:51Z

metis/metric/readability/scorers.py

+# Utilities
+# ----------------------------
+
+def load_abbreviations(abbr_csv: Optional[str]) -> Dict[str, str]:


Also here, please shortly comment what this class is about

lisehr

LGTM and free to merge together with the other files, the small comments can be fixed lateron

Lara-Ho added 13 commits January 31, 2026 17:23

branch_push

331dfae

in three separate options: WordNet online, WordNet + LLM fallback, LL…

8ebafb3

…M only

in three separate options: WordNet online, WordNet + LLM fallback, LL…

342f421

…M only

code_correction

6c04c25

code_corrections

7f8a948

corrections

8e929da

correction

c39084a

correction

6102c89

changes

0e0b240

changes

652e2fc

ReadMe_change

1cbe84e

ReadMe_change

200ea1c

change

3030ad9

Noel-Bastubbe assigned Lara-Ho Feb 26, 2026

Noel-Bastubbe mentioned this pull request Feb 26, 2026

Feat/unify metric config #10

Open

Lara-Ho added 2 commits February 27, 2026 08:32

correction

2c9ad66

changes

021d1f5

Noel-Bastubbe marked this pull request as ready for review February 27, 2026 12:02

Noel-Bastubbe requested a review from lisehr February 27, 2026 12:03

lisehr reviewed Mar 22, 2026

View reviewed changes

lisehr approved these changes Mar 22, 2026

View reviewed changes

lisehr self-assigned this Mar 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/readability metric#8

Feat/readability metric#8
Noel-Bastubbe wants to merge 15 commits intomainfrom
feat/readability-metric

Noel-Bastubbe commented Feb 26, 2026

Uh oh!

Tomic-Riedel commented Mar 8, 2026

Uh oh!

lisehr Mar 22, 2026

Uh oh!

lisehr Mar 22, 2026

Uh oh!

lisehr Mar 22, 2026

Uh oh!

lisehr left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Noel-Bastubbe commented Feb 26, 2026

Uh oh!

Tomic-Riedel commented Mar 8, 2026

Uh oh!

lisehr Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

lisehr Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

lisehr Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

lisehr left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants