[FEATURE]: Mahalanobis Distance Kennard-Stone (MDKS) Sampler #129

JacksonBurns · 2023-06-09T14:15:47Z

Is your feature request related to a problem? Please describe.

Better interpolative splits for Artificial Neural Networks in particular with Mahalanobis Distance Kennard-Stone here.

Use-cases/examples of this new feature

See linked paper for specific examples, but is reported to generally provide data splits for ANN applications.

Desired solution/workflow

Using the base implementation of the Kennard-Stone sampler, implement this.

Discussion

Unforunately without re-writing the Mahalanobis distance method to accept pre-computed pairwise distances this method will scale at least O(n^3)

JacksonBurns · 2023-07-11T16:47:50Z

This can actually be achieved with something like this:

_, _, _, indexes_train, indexes_val, indexes_test = train_val_test_split(
    responses,
    train_size=0.8,
    val_size=0.1,
    test_size=0.1,
    sampler="kennard_stone",
    random_state=42,
    hopts={"metric": "mahalanobis"},
    return_indices=True,
)

...and it's not terribly slow.

JacksonBurns · 2023-07-17T15:43:08Z

I have confirmed that the Mahalanobis distance is working as intended, although the example above should use sampler="spxy" since (despite the name) MDKS is derived from SPXY (which is itself derived from Kennard Stone). We can just add a row in the README samplers table saying that you can achieve this sampling using SPXY.

Resolves #129 - see discussion there.

JacksonBurns added the enhancement New feature or request label Jun 9, 2023

JacksonBurns mentioned this issue Jul 17, 2023

add readme line to explain how to use MDKS #149

Merged

kspieks closed this as completed in #149 Jul 18, 2023

kspieks added a commit that referenced this issue Jul 18, 2023

add readme line to explain how to use MDKS (#149)

f8ab231

Resolves #129 - see discussion there.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE]: Mahalanobis Distance Kennard-Stone (MDKS) Sampler #129

[FEATURE]: Mahalanobis Distance Kennard-Stone (MDKS) Sampler #129

JacksonBurns commented Jun 9, 2023

JacksonBurns commented Jul 11, 2023

JacksonBurns commented Jul 17, 2023

[FEATURE]: Mahalanobis Distance Kennard-Stone (MDKS) Sampler #129

[FEATURE]: Mahalanobis Distance Kennard-Stone (MDKS) Sampler #129

Comments

JacksonBurns commented Jun 9, 2023

JacksonBurns commented Jul 11, 2023

JacksonBurns commented Jul 17, 2023