-
Notifications
You must be signed in to change notification settings - Fork 332
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FeatureUnhasher does not support an input_type of dict #236
Comments
As I recall, we added FeatureUnhasher mainly to support HashingVectorizer, so we started with 'string'. On a first sight, adding input_type='dict' support it a matter of removing the exception, changing the way I don't have immediate plans to implement this feature, but it looks like a good problem for new contributors, so pull requests are welcome! |
Is it okay if I work on this issue, I mean if nobody else is working on this? |
@kmike as i was going through tests there are no tests for the function featureunhasher..? |
@coderop2 right; adding them can be a good first step. It is tested only indirectly, by testing InvertableHashingVectorizer which uses FeatureUnhasher internally. |
So first we can include the functionality for input_type dict and then add
tests for both together
…On Wed, Mar 6, 2019, 1:10 AM Mikhail Korobov ***@***.***> wrote:
@coderop2 <https://github.com/coderop2> right; adding them can be a good
first step. It is tested only indirectly, by testing
InvertableHashingVectorizer which uses FeatureUnhasher internally.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#236 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AbHGbLeoo6utFnPNkeR5rXiKbIIwruV0ks5vTsgSgaJpZM4OxRnf>
.
|
@coderop2 yes, this works. Alternatively, one can start by adding tests for existing FeatureHasher, to get their feet wet; this would be a smaller change which can be merged separately. |
The current implementation only supports input types of String. It will be nice to have a FeatureUnhasher which accepts Featurehashers of input type dict
The text was updated successfully, but these errors were encountered: