Skip to content

Commit 718373a

Browse files
committed
change readme
1 parent 9abeffa commit 718373a

File tree

2 files changed

+22
-19
lines changed

2 files changed

+22
-19
lines changed

README.md

Lines changed: 22 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,25 @@
11
# SimEngine
2-
SimEngine is an open-source similarity measurement and feature selection repository in Python. It is build upon several standard math libraries(e.g. math, Numpy) and machine learning libraries (scikit-learn, Scipy).
3-
SimEngine could be used to determine the similarity between proxy and parent applications. It also includes algorithms to facilitate feature selection, as minimizing the number of features is very important to reduce the data collected and thus lower the number of application runs.
4-
With the help of the quantitative similarity measurement we have developed in SimEngine, users are guided to choose proper proxy applications for particular uses. Besides quantifying fidelity of proxy applications, similarity measurement approaches in SimEngine can also be applied to various HPC problems, such as compiler optimization, code refactoring, and application input sensitivity.
2+
SimEngine is an open-source similarity measurement and feature selection repository in Python. It is build upon several standard math libraries(e.g. math, Numpy) and machine learning libraries (scikit-learn, Scipy).
3+
SimEngine could be used to determine the similarity between proxy and parent applications. It also includes algorithms to facilitate feature selection, as minimizing the number of features is very important to reduce the data collected and thus lower the number of application runs.
4+
With the help of the quantitative similarity measurement we have developed in SimEngine, users are guided to choose proper proxy applications for particular uses. Besides quantifying fidelity of proxy applications, similarity measurement approaches in SimEngine can also be applied to various HPC problems, such as compiler optimization, code refactoring, and application input sensitivity.
55

66
# Prerequisites:
7-
Python 3
8-
Numpy
9-
Scipy
10-
Scikit-learn
7+
Python 3
8+
Numpy
9+
Scipy
10+
Scikit-learn
11+
12+
# How to run SimEngine
13+
If you want to process the raw data, go to step A. If you want to start from the processed CSV files, go to step B
14+
15+
## A.Start from the raw data
16+
Create the accumulated CSV and delta CSV
17+
1 Go to directory dataprocess. Open dataprep.sh, change TopDataPrep (at the beginning of the file) to {path to the raw data}
18+
2 chmod +x dataprep.sh
19+
3 ./dataprep.sh # It may take a long time (e.g. one or two days)to build two csv directory (csv_acc and csv_del)
20+
21+
## B.Use the existing CSV directory to analysis
22+
1 Go to directory engine
23+
2 python main.py
24+
25+

dataprocess/README.md

Lines changed: 0 additions & 12 deletions
This file was deleted.

0 commit comments

Comments
 (0)