Skip to content

Conversation

@JohT
Copy link
Owner

@JohT JohT commented Aug 9, 2025

🚀 Feature: Association rule metrics for Git History

Previously, only the commit count existed for files that co-changed (pairwise) together. Even if this is very intuitive it is biased towards files that change very often (README, pom.xml, ....). Now, a couple of other metrics like Confidence, Lift and Jaccard Similarity from the field of association rule learning are available and integrated in existing and newly added reports. Questions like "which files change together surprisingly often?" can now be answered with CSV reports and even visually in a treemap.

⚙️ Optimization

@JohT JohT self-assigned this Aug 9, 2025
@JohT JohT force-pushed the feature/git-normalized-co-change-count branch 6 times, most recently from afb4ef1 to 5a29a51 Compare August 12, 2025 07:38
@JohT JohT force-pushed the feature/git-normalized-co-change-count branch 8 times, most recently from d39cd1d to 3e95955 Compare August 23, 2025 09:58
@JohT JohT changed the title Calculate min confidence of two files changing together (normalized co-change count) Association rule metrics for files that changed together based on the git history Aug 23, 2025
@JohT JohT marked this pull request as ready for review August 23, 2025 10:08
@JohT JohT force-pushed the feature/git-normalized-co-change-count branch from 3e95955 to 2976072 Compare August 23, 2025 10:11
@JohT JohT merged commit 260d977 into main Aug 23, 2025
8 checks passed
@JohT JohT deleted the feature/git-normalized-co-change-count branch August 23, 2025 10:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants