Towards a Neural Language Model for Signature Extraction

Description

This repository contains the source code and data on the experiments that where presented in the paper "Towards a Neural Language Model for Signature Extraction"

DOI: https://doi.org/10.1109/ISDFS.2017.7916497

Preparation of experiment environment:

Requirements: python2.7, pip

Create virtual environment and activate it (optional)

pip install virtualenv
virtualenv exp
. exp/bin/activate.fish

Install dependencies

pip install keras==2.0.4
pip install scikit-learn==0.18.1
pip install tensorflow==1.1.0-cp27 (CPU version, GPU version is tensorflow-GPU)
pip install h5py==2.7.1 (for saving keras models)

Run the experiments

Experiments should be run in order. So to reproduce the neural language model experiment, run the script which name starts with 100, then 101, etc.

To evaluate:

Run the 900 script. The -e parameter specifies the directory where the results should be stored.

Citation

If you plan to use this work, please use the following citation

@inproceedings{thaler2017towards, title={Towards a neural language model for signature extraction from forensic logs}, author={Thaler, Stefan and Menkonvski, Vlado and Petkovic, Milan}, booktitle={Digital Forensic and Security (ISDFS), 2017 5th International Symposium on}, pages={1--6}, year={2017}, organization={IEEE} }

Work used in this paper:

IPLoM Implentation:

Paper of IPLoM:

Title: 2012, Makanju et al., "A lightweight algorithm for message type extraction in system application logs"
DOI: http://dx.doi.org/10.1109/TKDE.2011.138

Paper that provided IPLoM sourcecode:

Title: 2014, He et al. , "An Evaluation Study on Log Parsing and Its Use in Log Mining"
Link: http://jiemingzhu.github.io/pub/pjhe_dsn2016.pdf
SourceCode: https://github.com/cuhk-cse/logparser/commit/d3fe123235899a2cf2d454434a3eb1a1222f03bd

LogCluster implementation

Title: 2015, Vaarandi et al. - LogCluster - A Data Clustering and Pattern Mining Algorithm for Event Logs
SourceCode: https://github.com/ristov/logcluster/commit/eadbf25df94257dc3cf72bb79e672d257bbce616

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
approaches		approaches
c010		c010
c020/results		c020/results
c100		c100
c110		c110
c120		c120
data		data
exp		exp
helpers		helpers
.gitignore		.gitignore
000_finalize_test_tags.py		000_finalize_test_tags.py
001_extract-true-signatures.py		001_extract-true-signatures.py
002_assign_retrieved_logs.py		002_assign_retrieved_logs.py
010_run_IPLoM.py		010_run_IPLoM.py
011_convert_results.py		011_convert_results.py
012_retrieve_logs_using_signatures.py		012_retrieve_logs_using_signatures.py
020_logclusterpl-commands.txt		020_logclusterpl-commands.txt
020_run_logcluster.pl		020_run_logcluster.pl
021_convert_results.py		021_convert_results.py
022_retrieve_logs_using_signatures.py		022_retrieve_logs_using_signatures.py
100_crossvalidate-bilstm-charmodel.py		100_crossvalidate-bilstm-charmodel.py
101_train-bilstm-charmodel.py		101_train-bilstm-charmodel.py
102_tag_testlogs-bilstm-charmodel.py		102_tag_testlogs-bilstm-charmodel.py
110_extract_signatures-vanilla.py		110_extract_signatures-vanilla.py
111_retrieve_logs_using_signatures.py		111_retrieve_logs_using_signatures.py
120_extract-signatures-bilstm-charmodel-heuristic.py		120_extract-signatures-bilstm-charmodel-heuristic.py
900_evaluate_correct_signature_assignment.py		900_evaluate_correct_signature_assignment.py
LICENSE		LICENSE
__init__.py		__init__.py
out.logsig		out.logsig
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards a Neural Language Model for Signature Extraction

Description

Preparation of experiment environment:

Create virtual environment and activate it (optional)

Install dependencies

Run the experiments

To evaluate:

Citation

Work used in this paper:

IPLoM Implentation:

Paper that provided IPLoM sourcecode:

LogCluster implementation

About

Releases

Packages

Languages

License

stefanthaler/2017-fnlm-experiments-supervised

Folders and files

Latest commit

History

Repository files navigation

Towards a Neural Language Model for Signature Extraction

Description

Preparation of experiment environment:

Create virtual environment and activate it (optional)

Install dependencies

Run the experiments

To evaluate:

Citation

Work used in this paper:

IPLoM Implentation:

Paper that provided IPLoM sourcecode:

LogCluster implementation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages