Background
Currently, Cobrix uses RDDs and .textFile() for handling UTF-8 ASCII files. For other encodings slower processing is used.
This can be improved based on this idea:
https://github.com/apache/spark/pull/21287/files
Feature
Improve performance of non-UTF8 encoded ASCII files processing