This version contains:
- Fix for segmentation fault occurring in case of large datasets. Large datasets are split into smaller chunks, allowing for proper calculation of kmer frequencies
- New command-line option available:
--batch_size
- specifying the maximum number of sequences used in the single step of kmer frequencies calculation - Calculated kmer frequencies are saved in the separate files, allowing for easy re-run of computations in case of interrupt.
This version does not contain any changes to models used for plasmid prediction.