Fix champs-scalar-coupling to include test molecule structures #70

jvpoulos · 2025-09-04T11:35:33Z

The prepare script explicitly filters structures to only include train molecules. This is incorrect for this competition --- in the Kaggle competition, structures.csv contains all molecules (both train and test). The test molecules need their structures to make predictions, but they're being filtered out.

Changes:

Modified prepare.py to include both train and test molecules in structures.csv
Updated checksums.yaml to reflect the new structures.csv checksum
Updated assertions to validate both train and test molecules

After fixing the data preparation issue, I used a LightGBM model trained on 25% of the data to make predictions:

  {
    "competition_id": "champs-scalar-coupling",
    "score": 0.5823,
    "gold_threshold": -2.87509,
    "silver_threshold": -2.03119,
    "bronze_threshold": -1.90122,
    "median_threshold": -0.9529,
    "any_medal": false,
    "gold_medal": false,
    "silver_medal": false,
    "bronze_medal": false,
    "above_median": false,
    "submission_exists": true,
    "valid_submission": true,
    "is_lower_better": true,
    "created_at": "2025-09-03T22:56:19.592098",
    "submission_path": "mlebench/competitions/champs-scalar-coupling/submission.csv"
  }

…uctures.csv

thesofakillers · 2025-09-08T10:20:51Z

Thank you for catching this. You are correct and there is a mistake in the prepare.py, and your fix seems right.

As explained in the readme in #66 we won't be merging this fix in yet, and will release it as a batch of fixes in a upcoming v2 to be released on openai/preparedness. I've added as tracked in #71. I will try to put you as co-author for when we release the fix.

For submissions to the v1 leaderboard, please proceed as if this issue was not present.

Fix champs-scalar-coupling to include test molecule structures in str…

043a279

…uctures.csv

thesofakillers added a commit that referenced this pull request Sep 8, 2025

catalogue issue described in #70

52e6ad4

thesofakillers mentioned this pull request Sep 8, 2025

Catalogue issue described in #70 re champs-scalar-coupling #71

Merged

thesofakillers closed this Sep 8, 2025

thesofakillers added a commit that referenced this pull request Sep 8, 2025

catalogue issue described in #70 (#71)

4c4a6ff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix champs-scalar-coupling to include test molecule structures #70

Fix champs-scalar-coupling to include test molecule structures #70

Uh oh!

jvpoulos commented Sep 4, 2025 •

edited

Loading

Uh oh!

thesofakillers commented Sep 8, 2025

Uh oh!

Uh oh!

Fix champs-scalar-coupling to include test molecule structures #70

Fix champs-scalar-coupling to include test molecule structures #70

Uh oh!

Conversation

jvpoulos commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Uh oh!

thesofakillers commented Sep 8, 2025

Uh oh!

Uh oh!

jvpoulos commented Sep 4, 2025 •

edited

Loading