Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sub-dataset selection method #2

Open
BreziTasbi opened this issue May 16, 2024 · 0 comments
Open

Sub-dataset selection method #2

BreziTasbi opened this issue May 16, 2024 · 0 comments

Comments

@BreziTasbi
Copy link
Collaborator

To train this contrast classifier, I have access to a vast dataset sourced from NeuroPoly servers and OpenNeuro. To maximize the utility of this data, I aim to create a balanced and diverse dataset to develop a robust model. I particularly want the model to learn the relationship between image content and contrast, rather than the specific characteristics of my sub-dataset and the contrast (such as resolution, orientation, framing).

- Balance among contrasts will be ensured by assigning weights relative to their representation in the dataset (upsampling).

- Data augmentation will simulate variations in framing, orientation, and resolution through random crops, rotations, and downscalings.

- I will estimate the dataset's bias based on different characteristics by evaluating the performance of basic classifiers trained exclusively with these data. The worse these classifiers, the better the dataset.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant