Closed
Description
In a recent StackOverflow answer, our Roadmap was seemingly interpreted it to say that text/NLP is a purely future work item.
The roadmap currently says:
Featurization Improvements
- Text (*)
- Natural language text preprocessing such as tokenization, part-of-speech tagging, and sentence > breaking
- Pre-trained text models that can be used for extracting of semantic or sentiment features from text
- Image (*)
- Image preprocessing such as loading, resizing, and normalization if images
- Image featurization, including industry-standard pre-trained ImageNet neural models, such as ResNet and AlexNet
We should change the roadmap to indicate that further text/NLP techniques like pre-trained WordEmbedding models, improvements to tokenization, etc. are on the roadmap, whereas text handling, in the form of n-grams, already exists.