You do not need to list nltk
and pandas
here.
main.py L:[4]
used[sklearn.model_selection]
for [importing KFold class].main.py L:[5]
used[sklearn.metrics]
for [importing confusion_matrix class].main.py L:[34]
used[KFold]
for [defining the kf object for cross validation purposes].main.py L:[232, 235, 236]
used[confusion_matrix]
for [constructing the confusion matrix].
Example usage: use the following command in the current directory.
python3 src/main.py --train data/train.csv --test data/test.csv --output output/test.csv
The data can be found in data/train.txt,and the in-domain test data can be found in data/test.txt.