-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug transformer fitting #385
Merged
Merged
Changes from all commits
Commits
Show all changes
31 commits
Select commit
Hold shift + click to select a range
d44ee76
One transformer set for each fold and for train_valid dataset
stewarthe6 10675c6
Returned all_training_datasets
stewarthe6 71d0859
creating transformers now expects a DeepChem dataset instead of a mod…
stewarthe6 a2520a4
Added fold as a parameter to more functions, 'final' is the name of t…
stewarthe6 18987d3
No more transformers in perf_data, No more transformers in EpochManag…
stewarthe6 4689f9e
Switched back to saving one .pkl for all transformers. The pkl is sti…
stewarthe6 cd14bd9
Fixed issue where _create_*_transformers sometimes would not return a…
stewarthe6 00a74af
Missing model_dataset argument
stewarthe6 9cb3f4d
Removed double nested list
stewarthe6 72172a2
Updated transformer path in test
stewarthe6 9863aef
Removed fold argument and added backwards transformer functionality
stewarthe6 a40dffa
Removed a few 'final' arguments that are no longer used
stewarthe6 6332c60
Removed unused imports
stewarthe6 398cf06
specified fold for embedding features
stewarthe6 218f70d
More tests for perf_data
stewarthe6 78f7e35
Test to make sure transformers are fit correctly on training data only
stewarthe6 804a62b
Added check to make sure that every requested id in the subset has a …
stewarthe6 23bd84f
call get_untransformed_responses instead
stewarthe6 61c18fd
Cache the untransformed response dict
stewarthe6 9c8abc3
Should not have to pass a 'final' argument
stewarthe6 a7ed96a
Weights and y should be the same shape
stewarthe6 b0940ad
dataset transformation moved into generate_predictions()
paulsonak a7eb892
zero out transformed values that are larger than 1e30
paulsonak d1b25d8
get_untransformed_responses returns an array, not a dictionary
stewarthe6 74c1953
sped up and updated the test
stewarthe6 58c454d
Updated transformer test to correctly test the standard deviation and…
stewarthe6 d17bfc4
update large values to be capped at 1e30
paulsonak d5b3b55
Added a test for kfold cross validation transformers
stewarthe6 069fa28
Merge branch 'bug_transformer_fitting' of github.com:ATOMScience-org/…
stewarthe6 3fb2f45
Test for y transformers
stewarthe6 d309761
Removed unused 'fold' parameter. Added documentation for this PR
stewarthe6 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this
get_all_training_datasets
seems like a function that belongs inmodel_datasets
?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I disagree. The structure created by get_all_training_datasets is very specific to how transformers are implemented and I would like all that code in one place. Eventually I think there should be a TransformerManager object that can build all needed transformers given a dataset.