K means binning#51
Conversation
sarahmish
left a comment
There was a problem hiding this comment.
According to the issue you initially were calling it Cluster2Float and Float2Cluster. Is there a reason you changed it to Cluster2Scalar and Scalar2Cluster?
| "sigllm.primitives.transformation.Cluster2Scalar", | ||
| "sigllm.primitives.transformation.Cluster2Scalar", |
There was a problem hiding this comment.
What is the purpose of the second Clustr2Scalar? Note that the jupyter notebook in the PR seems to use an older version of the pipeline with only one Clustr2Scalar.
There was a problem hiding this comment.
@sarahmish good catch. I probably confused myself with scalars and floats, but I changed it back to original wording.
I believe there was an update in sigllm that requires y and y_hat to both be passed through scalar2float separately for the pipeline to work. Otherwise one is scaled by decimal and the other is not.
Resolve #56
Added K-Means binning as a way to pre-process floats into scalars.