Skip to content

General Fixes and Improvements#37

Merged
mjohnson541 merged 5 commits intomainfrom
general_improvements
May 21, 2025
Merged

General Fixes and Improvements#37
mjohnson541 merged 5 commits intomainfrom
general_improvements

Conversation

@mjohnson541
Copy link
Copy Markdown
Collaborator

  1. Fixed small bug in atom set handling in extension generation

  2. Fixed bug in complementary group handling. Complementary groups that were generated off of bond creation extensions were still considered complementary by node generation even though some were not complementary for the training set. The change checks whether the group is complementary for the training data at that node and only adds it as a complementary node if it is complementary with respect to the associated training data.

  3. Removed a duplicate line of code

  4. Small improvement to dictionary generation

  5. Add weighting of multi evaluation regressor node selection based on occurrence (along with uncertainty) and make it default. The application I originally developed the algorithm for involved training on data that was distributed differently than the prediction cases. However, in most applications one should assume the training distribution is the same as the prediction distribution. This change doesn't seem to matter so much for larger training sets and does not always improve model performance, but it does make a very significant difference in improving model performance consistency in the <1000 datapoint regime.

@mjohnson541 mjohnson541 merged commit 4641127 into main May 21, 2025
1 check passed
@mjohnson541 mjohnson541 deleted the general_improvements branch May 21, 2025 00:55
@mjohnson541 mjohnson541 restored the general_improvements branch June 29, 2025 01:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant