Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
-
Updated
Feb 3, 2021 - Python
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
jazznet dataset of piano patterns for music audio machine learning research
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
2D Geometric shapes generator
We currently maintain 488 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please c…
SPREAD is a large-scale synthetic dataset for image- and point-cloud- based tasks in forestry.
A duplicate-free variant of the CIFAR test set.
Marktplaats.nl (Dutch Classifieds) Listing Scraper
Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
Simple task for mixed image-graph data
tools for a deep learning in physics research course
Practice on ML algorithms on different problems and data sets.
Installable python package for the costar dataset.
Scripts to create Music Information Retrieval datasets from streaming services for singer identification tasks
Add a description, image, and links to the machine-learning-dataset topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-dataset topic, visit your repo's landing page and select "manage topics."