-
-
Notifications
You must be signed in to change notification settings - Fork 211
Additional fixes to PR 777 #967
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Checks manually cancelled. |
|
@PGijsbers wrt to why an experimental To handle mixed-type data frames with missing values, we need to create a transformation which imputes these missing values for both categorical and continuous types. Under the current OpenML setup, we cannot create a flow with two instances of the same class, a Creating an alias class within the testing script has issues while serializing since it has got no |
|
That makes sense (even though it is a bit unfortunate). However, why not consider creating the example on a task which does not have two types of missing values? While all examples implicitly depend on some version(s) of scikit-learn, it feels using an experimental feature has exceptionally narrow support. Old versions ( |
Yeah I was wondering why that was not done here but then I think we need to change the design of the example.
On further checking, this very task is already the case where missing values exist for categorical columns but not numeric columns. Replacing the experimental model with an RF expectedly worked fine. I pushed that version since your concern with this example being compatible across versions is an apt assessment. @mfeurer and I were earlier trying to come up with an example that allows the generic case of handling missing values across types. Given the constraints of the setup, the pipeline with the experimental model was the only way out we could think of. |
mfeurer
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. I'll approve once I see that the tests are green.
Reference Issue
PR #777.
What does this PR implement/fix? Explain your changes.
Makes the changes/improvements suggested by @PGijsbers that were not addressed prior to merge.