Description
Hi guys,
I'll share with you my practical experiences from trying to use openml
in the last days. Maybe it helps to improve the package :)
- I started by uploading a CSV dataset as a dataset using the web interface at https://www.openml.org/new/data. No matter what I put in and how many fields I fileld out I always faced the error below. It worked using
uploadOMLDataSet()
.
"Error 131: Problem validating uploaded description file - XML does not correspond to XSD schema. Please fill in all required (red) fields, upload a file or give a URL (not both), and avoid spaces in the dataset name."
-
I then tried the CL function
uploadOMLDataSet()
. Two points here: I was expecting to upload a dataset. A dataset for me is a CSV like file. However, the function actually expects what is a "task" object inmlr
. If one is used to work withmlr
, it is not logical to upload amlr
task as a "dataset" toopenml
. But maybeopenML
makes more sense here and themlr
logic lacks. -
Creating an
openML
task usingmakeOMLTask
could really need a better help page. How should the list for argsparameters
andinput
look like? -
Once I created a task using
makeOMLTask
, how can I upload it? There is an upload function for datasets, flows and runs but none for tasks? -
Is it possible to use a different resamp strategies in the web interface then the preselected ones? E.g. I would like to use "repeated spatial CV" from
mlr
or just "5-fold CV". -
How can I "activate" my dataset? E.g. https://www.openml.org/d/41110