Skip to content

What is "Slot" (PFI documentation suggestions) #5954

Open
@torronen

Description

@torronen

I am running the new PFI API (main branch with #5934) for a FastTreeBinary loaded model created by AutoML API.

Main question:
I receive items like "Slot 48416" from MLContext.BinaryClassification.PermutationFeatureImportanceNonCalibrated().
I did not find documentation about how to interpret these items. What do they mean? I am stuck with this issue.

As I understand it comes from the features vector, for slots without a name.
I am confused about why my features vector has these additional items, and how can I backtrack which original feature they belong to? I have about 2000 features in my dataset.

Side items / suggestion for documentation:
I notice there are some logging code in PFI which seem to set the progress of PFI to ProgressHeader, but I could not find documentation on how can I read the progress?
pch.SetHeader(new ProgressHeader("processed slots"), e => e.SetProgress(0, processedCnt));

There is also another GitHub issue about the recommended value for permutation and number of examples, and estimation of the running time. It seems the number of examples maybe has higher running time than O(n) but I have still not understood the source or concept of PFI adequately. It would be also useful to know if increasing number of examples or increasing number of permutations would result in more accurate results. Do I understand correctly the accuracy does increase until number of permutations reaches number of features? Increasing number of examples would increase chance the dataset is adequately represented, is it correct?

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationRelated to documentation of ML.NETenhancementNew feature or requestquestionFurther information is requested

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions