Fix dynamic top-k index scores by vdplasthijs · Pull Request #98 · WUR-AI/aether

vdplasthijs · 2026-04-23T16:34:51Z

What does this PR do?

Calculate theta_k for train only, and reuse for val and test.
Catch edge cases max = min leading to zero div
Printing optional, off by default
Elbow method removes nans and skips leading zeros for tighter distribution.
Update default concept caption v

Before submitting

Did you make sure title is self-explanatory and the description concisely explains the PR?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you list all the breaking changes introduced by this pull request?
Did you test your PR locally with pytest command?

- Calculate theta_k for train only, and reuse for val and test. - Catch edge cases max = min leading to zero div - Printing optional, off by default - Elbow method removes nans and skips leading zeros for tighter distribution. - Update default concept caption v

vdplasthijs · 2026-04-23T16:36:06Z

All index scores should now be bound between -1 and 1.

gabrieletijunaityte

Looks great! Thank you for the changes.

gabrieletijunaityte · 2026-04-24T11:09:38Z

Do you think we could cache these elbow values as the data split is consistent? Or is the computation very quick?

vdplasthijs · 2026-04-28T06:48:39Z

@gabrieletijunaityte thanks! What do you mean by cached here? The values are only computed for the train split; then cached (in the dictionary) and retrieved for the other splits. See ‎src/models/text_alignment_model.py‎ line 134.

vdplasthijs · 2026-04-28T12:47:45Z

Should be OK now @gabrieletijunaityte !

- Calculate thresholds on train set. - Option (default True) to use stored thresholds if available. - Option (default True) to save newly calculated values. - Option (default False) to also compute new thresholds if old thresholds are available for some concepts.

v2 (old) has hand picked theta values. v3 uses the values from v2 but adds baseline values. v4 has newly calculated thresholds (using elbow).

vdplasthijs · 2026-05-21T15:32:50Z

@gabrieletijunaityte could you please review again? I made the following changes:

Three things need to be stored: theta_k and is_max (same for all splits), and n_baseline (different per split). These can now be stored by the caption builder by passing the concept_config object.
I thought about moving all the calculations to a separate pre-processing file .. I see the upside of cleaner code, but downside is that you would still need to load the data via a datamodule (to get the splits, to get the separate n_baseline), which then needs config files etc. I think it's easier (for new UCs) to just do it automatically, but only if needed. This is similar to how we handle creating data splits (also in datamodule). Hope that makes sense!
Now; it will only calculate if values are not present for all concepts, or if requested specifically even if they are present (default False). Thresholds are calculated using the training data only. :)
By default, if new values are calculated, they are stored in a new file (1 version up).
It checks whether the is_max parameter matches the data (so is_max=True when the minority of points is greater than theta_k). If it does not match a pre-set is_max, it drops the caption. (If no is_max is set, it calculates and adds it).

I think that's all! Needed more features than I initially thought but all works well now. The only thing that I haven't implemented is to look for uni- vs bimodal distributions and add both max and min accordingly. Not sure if it's worth it, but it's possible ..

- Save mode - Save average index

vdplasthijs · 2026-05-27T10:36:55Z

@gabrieletijunaityte could you please review again? I have now:

Moved all these functionalities to data module.
Storing baseline accuracy instead of n baseline.
Fixed metric logging.

gabrieletijunaityte

Looks good, thanks for all the iterations!

vdplasthijs added 2 commits April 23, 2026 16:41

Merge branch 'develop' into aligment_experiments

5954997

vdplasthijs requested a review from gabrieletijunaityte April 23, 2026 16:34

Merge branch 'develop' into aligment_experiments

8795be7

gabrieletijunaityte approved these changes Apr 24, 2026

View reviewed changes

gabrieletijunaityte reviewed Apr 28, 2026

View reviewed changes

Comment thread src/models/text_alignment_model.py Outdated

Fix caching

b9cf1ec

gabrieletijunaityte reviewed Apr 29, 2026

View reviewed changes

Comment thread src/models/text_alignment_model.py Outdated

vdplasthijs and others added 8 commits May 20, 2026 11:42

Merge branch 'develop' into aligment_experiments

15051a3

Merge branch 'develop' into aligment_experiments

e3a92ed

Caption builder function write a new concept caption file

0089c5a

New concept captions

a5030f0

v2 (old) has hand picked theta values. v3 uses the values from v2 but adds baseline values. v4 has newly calculated thresholds (using elbow).

Checks if min/max corresponds to less than 50% of data

9f6b8e4

set is_max if needed

51c33ae

Allow for calcuting/saving is_max only

ada3d4c

vdplasthijs requested a review from gabrieletijunaityte May 22, 2026 09:25

vdplasthijs added 4 commits May 27, 2026 11:15

store baseline accuracy

3cc5156

Move concept caption threshold creation to datamodule

878c6f9

Fix logging alignment model

b2035b5

- Save mode - Save average index

Change concept caption files

b5a9f02

gabrieletijunaityte approved these changes May 27, 2026

View reviewed changes

gabrieletijunaityte merged commit 4bd13de into develop May 27, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix dynamic top-k index scores#98

Fix dynamic top-k index scores#98
gabrieletijunaityte merged 16 commits into
developfrom
aligment_experiments

vdplasthijs commented Apr 23, 2026

Uh oh!

vdplasthijs commented Apr 23, 2026

Uh oh!

gabrieletijunaityte left a comment

Uh oh!

gabrieletijunaityte commented Apr 24, 2026

Uh oh!

vdplasthijs commented Apr 28, 2026

Uh oh!

Uh oh!

vdplasthijs commented Apr 28, 2026

Uh oh!

Uh oh!

vdplasthijs commented May 21, 2026

Uh oh!

vdplasthijs commented May 27, 2026

Uh oh!

gabrieletijunaityte left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vdplasthijs commented Apr 23, 2026

What does this PR do?

Before submitting

Uh oh!

vdplasthijs commented Apr 23, 2026

Uh oh!

gabrieletijunaityte left a comment

Choose a reason for hiding this comment

Uh oh!

gabrieletijunaityte commented Apr 24, 2026

Uh oh!

vdplasthijs commented Apr 28, 2026

Uh oh!

Uh oh!

vdplasthijs commented Apr 28, 2026

Uh oh!

Uh oh!

vdplasthijs commented May 21, 2026

Uh oh!

vdplasthijs commented May 27, 2026

Uh oh!

gabrieletijunaityte left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants