Ten Thousand German News Articles Dataset for Topic Classification
-
Updated
Nov 7, 2022 - Python
Ten Thousand German News Articles Dataset for Topic Classification
NoiseMix - data generation for natural language
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
Neural Network aided diagnosis of Schizophrenia via patient-centered text Data
This python script will generate n pages of text with bbox and its ground truth labels. Also it supports various background colors, fonts etc. Additionally it can export the dataset as tfrecord
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
Add a description, image, and links to the text-datasets topic page so that developers can more easily learn about it.
To associate your repository with the text-datasets topic, visit your repo's landing page and select "manage topics."