SCARP2020-ML

The repository for SCARP 2020 on Network Traffic Analytics.

If you are using our code, please cite our papers -->

[1] Onur Barut, Matthew Grohotolski, Connor DiLeo, Yan Luo, Peilong Li, and Tong Zhang. 2020. Machine Learning Based Malware Detection on Encrypted Traffic: A Comprehensive Performance Study. In 7th International Conference on Networking, Systems and Security (7th NSysS 2020), December 22–24, 2020, Dhaka, Bangladesh. ACM, New York, NY, USA 9 Pages. https://doi.org/10.1145/3428363.3428365

[2] Derek Manning, Peilong Li, Xiaoban Wu, Yan Luo, Tong Zhang, Weigang Li. 2020. ACETA: Accelerating Encrypted Traffic Analytics on Network Edge. In ICC 2020-2020 IEEE International Conference on Communications (ICC), pp. 1-6.

The project is codenamed ANTA for Accelerated Network Traffic Analytics.

This is a joint project worked on by the following individuals from the Universities specified below:

Elizabethtown College (Elizabethtown, PA)
- Dr. Peilong Li - Advisor of the project
- Matthew Grohotolski - Student researcher
- Connor DiLeo - Student researcher
University of Massachusetts Lowell (Lowell, MA)
- Dr. Yan Luo - Advisor of the project
- Onur Barut - Student researcher
Intel Corp.
- Tong Zhang - Co-advisor of the project

Walkthrough Guide

Currently there is one main runnable file (nsyss_benchmark.py). A sample command can be found below with respective arguments for running.

python nsyss_benchmark.py --dataset NetML --model LR daal_LR --kfolds 10

-d, --dataset [DATASET]: Specifies which dataset to use (supported: NetML, CICIDS)
-m, --model [MODELTYPE]: Specifies which model to use (supported: LR, daal_LR, kNN, daal_kNN, RF, daal_DF, SVM ,daal_SVM, MLP, 1D_CNN, vino_1D_CNN, 2D_CNN, vino_2D_CNN, LSTM, vino_LSTM, CNN+LSTM, vino_CNN+LSTM)
-k, --kfolds[Number]: Specifies how many times to run each model

The above command supports a variety of model types which are all listed below.

Standard Supported Models:

Linear Regression (LR)
Random Forest (RF)
Support Vector Machine (svm)
K Nearest Neighbors (kNN)

DAAL Supported Models:

Linear Regression (daal_LR)
Decision Forest (daal_DF)
Support Vector Machine (daal_SVM)
K Nearest Neighbors (daal_kNN)

Standard AI Supported Models:

Artificial Neural Network (ANN)
Long Short Term Memory Neural Network (LSTM)
Convolutional + Long Short Term Memory Neural Network (CNN+LSTM)
Convolutional 1D Neural Network (1D_CNN)
Convolutional 2D Neural Network (2D_CNN)

OpenVINO AI Supported Models:

VINO Artificial Neural Network (vino_ANN)
VINO Long Short Term Memory Neural Network (vino_LSTM)
VINO Convolutional + Long Short Term Memory Neural Network (vino_CNN+LSTM)
VINO Convolutional 1D Neural Network (vino_1D_CNN)
VINO Convolutional 2D Neural Network (vino_2D_CNN)

Once a model is ran through the command above, it will begin training and testing. The profiler also keeps tabs on how much memory is utilized along with training and testing times.

Finally, once a model is finished it is saved in the results folder in either NetML or CICIDS2017 folder and is categorized by the starttime when the model began running. A list of final performance results can be found in the final model folder with average performance statistics.

Some benchmark commands are also provided in order to test various aspects of the program. See more example commands below.

With NetML dataset:

Run all ML / DAAL models: python nsyss_benchmark.py --dataset NetML --model LR daal_LR RF daal_DF SVM daal_SVM kNN daal_kNN --kfolds 10
Run all DL / VINO models: python nsyss_benchmark.py --dataset NetML --model ANN vino_ANN LSTM vino_LSTM CNN+LSTM vino_CNN+LSTM 1D_CNN vino_1D_CNN 2D_CNN vino_2D_CNN --kfolds 10

With CICIDS2017 dataset:

Run all ML / DAAL models: python nsyss_benchmark.py --dataset CICIDS2017 --model LR daal_LR RF daal_DF SVM daal_SVM kNN daal_kNN --kfolds 10
Run all DL / VINO models: python nsyss_benchmark.py --dataset CICIDS2017 --model ANN vino_ANN LSTM vino_LSTM CNN+LSTM vino_CNN+LSTM 1D_CNN vino_1D_CNN 2D_CNN vino_2D_CNN --kfolds 10

Additional Resources

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
data		data
models		models
old		old
utils		utils
LICENSE		LICENSE
README.md		README.md
nsyss_benchmark.py		nsyss_benchmark.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SCARP2020-ML

Walkthrough Guide

Additional Resources

daal4py Documentation Home

https://intelpython.github.io/daal4py/index.html

Intel DAAL Developer Guide

https://software.intel.com/en-us/download/intel-data-analytics-acceleration-library-developer-guide

OpenVINO Documentation Home

https://docs.openvinotoolkit.org/latest/index.html

OpenVINO Developer Guide

https://docs.openvinotoolkit.org/latest/openvino_docs_MO_DG_Deep_Learning_Model_Optimizer_DevGuide.html

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

BlueJayADAL/SCARP2020-ML

Folders and files

Latest commit

History

Repository files navigation

SCARP2020-ML

Walkthrough Guide

Additional Resources

daal4py Documentation Home

https://intelpython.github.io/daal4py/index.html

Intel DAAL Developer Guide

https://software.intel.com/en-us/download/intel-data-analytics-acceleration-library-developer-guide

OpenVINO Documentation Home

https://docs.openvinotoolkit.org/latest/index.html

OpenVINO Developer Guide

https://docs.openvinotoolkit.org/latest/openvino_docs_MO_DG_Deep_Learning_Model_Optimizer_DevGuide.html

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages