Skip to content

Scikitlearn clustering methods cleanup #70

@ablaom

Description

@ablaom

I think these methods could do with a review. Here are a few things that look like issues to me:

  • Some models do not implement a transform method (presumably because python scikit-learn does not) but could, no? Example: DBSCAN

  • Recall that MLJ makes a distinction between report and fitted_params: the latter is for the learned parameters (in this case what is needed to assign a new observation to a class), and everything else goes in report. It seems that in the scikitlearn clustering wraps everything is just lumped into fitted_params. In particular this has led to inconsistency with the Clustering.jl models KMeans and KMedoids (which separate things correctly, as far as I can tell).

cc: @tlienart

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions