You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Stepping through the example script for ExcelFormer, I notice that this line fails with my custom dataset.
AFAIK this seems due to CatToNumTransform adding _{i} strings to the end of categorical feature names, but these names not being changed in the output TensorFrame of the CatToNumTransform. Hence, the mutual_info_sort.transformed_stats being passed to ExcelFormer on line 107 contains _{i} categorical column names while the actual TensorFrame does not.
Case in point, calling this snippet to manually rename statistics to their original name fixes the issue:
That fix might not work if the classification task is other than binary though, hence the preferred fix would be for CatToNumTransform to actually rename the column names of the TensorFrames it transforms.
The text was updated successfully, but these errors were encountered:
Hi all!
Stepping through the example script for ExcelFormer, I notice that this line fails with my custom dataset.
AFAIK this seems due to
CatToNumTransform
adding_{i}
strings to the end of categorical feature names, but these names not being changed in the outputTensorFrame
of theCatToNumTransform
. Hence, themutual_info_sort.transformed_stats
being passed toExcelFormer
on line 107 contains_{i}
categorical column names while the actual TensorFrame does not.Case in point, calling this snippet to manually rename statistics to their original name fixes the issue:
That fix might not work if the classification task is other than binary though, hence the preferred fix would be for
CatToNumTransform
to actually rename the column names of theTensorFrame
s it transforms.The text was updated successfully, but these errors were encountered: