GitHub - Yuhala/secv: Secure code partitioning via multi-language secure values

SecV: Secure Code Partitioning via Multi-Language Secure Values

SecV is a multi-language partitioning approach for TEE software, e.g., with Intel SGX. It is based on GraalVM's Truffle framework, an open source Java library for building programming language implementations. SecV proposes secure AST nodes which can be used to annontate sensitive information in a language-agnostic fashion. The corresponding code is then analyzed with Polytaint, a Truffle-based program analyzer which we developed that splits the program into trusted and untrusted parts running in and out of an SGX TEE respective.

This PoC implementation accompanies our Middleware'23 paper:

@inproceedings{10.1145/3590140.3629116,
author = {Yuhala, Peterson and Felber, Pascal and Guiroux, Hugo and Lozi, Jean-Pierre and Tchana, Alain and Schiavoni, Valerio and Thomas, Ga\"{e}l},
title = {SecV: Secure Code Partitioning via Multi-Language Secure Values},
year = {2023},
isbn = {9798400701771},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3590140.3629116},
doi = {10.1145/3590140.3629116},
booktitle = {Proceedings of the 24th International Middleware Conference},
pages = {207–219},
numpages = {13},
keywords = {GraalVM, Intel SGX, Java, Managed Execution Environments, Truffle, Trusted Execution Environments},
location = {Bologna, Italy},
series = {Middleware '23}
}

Polytaint tool.

Polytaint tool is a Truffle instrumentation taint tracking tool which analyses Polyglot programs to obtain source sections (i.e methods/functions) which access secure types defined by the SecureL Truffle language implementation.
Polytaint instruments mainly function call, variable write and variable read nodes to track the desired tainted/trusted functions.
The results of the run time analysis are then transfered to the Partitioning module, which partitions the application into two parts: Trusted java native image application which exposes the trusted functions as GraalVM entry points, and an Untrusted native image java applications which exposes the functions/methods which do not access secure types.
These images are then build with GraalVM native image tool and used to build the final Intel SGX application.
More info on the inner workings of this tool will be provided later on.

How to analyse programs with Polytaint.

Polytaint analyses programs in two modes: full and part.
With full mode no instrumentation is done. A single java program/file is created: Trusted.java in the output folder. This java program simply runs the full guest language code snippet using the Java polyglot API. This Java file is copied to substratevm where it is used to create a native image which will run entirely inside an SGX enclave.
With part mode the corresponding program (e.g JS, Python) is run on the JVM and instrumented accordingly. That is, program variables and functions/methods accessing/modifying SecureL types are registered as the program runs. Analysis divides functions into 3 categories:

1.Trusted functions: functions which explicitly manipulate secL types or takes secure variables as parameters. NB: A secure variable is either a variable which recieves its variable directly from a secL scope via the polyglot API, or one whose value originates implicitly from another secL variable. Trusted functions will be exported as static Java methods in Trusted.java. They will have proxies in Untrusted.java which will perform Intel SGX transitions.

2.Untrusted functions: functions which do not manipulate secure variables explicitly nor take them as input. These will be exported as static Java methods in Untrusted.java. The primary argument for this design choice is TCB reduction. They will have proxies in Trusted.java which perform Intel SGX transitions.

3.Neutral functions: functions which do not explicitly declare/manipulate secure types in their body but take them as input. They are exported fully in both Trusted.java and Untrusted.java. The primary argument for this design choice is security (no secure type should be sent out) and performance (no transition needed).

At the end of analysis, the analysis results (i.e trusted, untrusted, neutral functions) are sent to the partitioning module which then creates 2 Java files: Trusted.java and Untrusted.java as well as other useful files (C/C++ headers, EDL files etc) to be used by the sgx module to build the final SGX applications. These Java files have the corresponding guest language functions exported as Java methods using the Polyglot API.

The Java files are copied to substratevm and used to build two native images which will run inside (using Trusted.java) and outside the enclave (using Untrusted.java).

Running the tool

To run the polytaint tool, use the runTaintTrack.sh script as such: ./runTaintTrack.sh <guestLanguage> <programFile> <imageType>.
To partition a program on the other hand, we will run taint tracking with imageType = part. For example:

./runTaintTrack.sh js polyglot.js part

The above command will analyse the program in polyglot.js with our polytaint instrumentation tool and determine which functions will be Trusted, Untrusted and Neutral. The result of this analysis is printed at the end of the analysis. The two Java programs are then created as described above and the files are copied to substratevm in the graal folder. Other files are also created as already explained and moved to the appropriate module.
To build the partitioned native image, move to the graal folder and run the script: ./build_polytaint_images.sh. This will build two relocatable .o images corresponding to Trusted.java native image and Untrusted.java native image. These object files are moved to the sgx-module in the sgx folder.
To build the corresponding SGX application, move to the sgx folder and run make clean && make. Run the program with ./app.
To run a the program in polyglot.js entirely inside the enclave, we don't need to partition the program. So we use the command:

./runTaintTrack.sh js polyglot.js full

This will skip taint analysis and just create a single Java application with the JavaScript program embedded. To build the corresponding native image move into the graal folder and run: ./build_full_poly_image.sh. A native image relocatable file main.o will be created and moved to the sgx module.
Similarly, to build the corresponding SGX application, move to the sgx folder and run make clean && make. Run the program with ./app.

MISC information

The partitioning tool uses regular expressions parse out function defitions. For JS code, this is more or less feasible. However for Python source code, it is very difficult to extract the function body with a regex. To simplify the process, we introduce a magic string: func_end=1 at the end of python function definitions. This is a correct python expression but we are not interested in its result. It simply makes things easier for us when parsing out function definitions. This expression will be removed in the final partitioned program.

Possible bugs

If you install a language locally with gu and use the -f (force) option to prevent version checks, this may lead to the polyglot API not detecting the language in the list of languages even though gu list lists the language as installed. This is a weird issue that I'm yet to understand. Nonetheless all works correctly when the language (e.g secureL) is installed with: gu install -L secL-component.jar.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
bench		bench
microbench		microbench
output		output
polyglot		polyglot
results		results
src		src
temp		temp
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
buildPolytaint.sh		buildPolytaint.sh
ci.jsonnet		ci.jsonnet
do_bench.sh		do_bench.sh
gen_bench.sh		gen_bench.sh
graaljs.jar		graaljs.jar
graalpython.jar		graalpython.jar
poly.polyglot.build_artifacts.txt		poly.polyglot.build_artifacts.txt
polyglot.js		polyglot.js
polytaint		polytaint
pom.xml		pom.xml
results.csv		results.csv
runTaintTrack.sh		runTaintTrack.sh
securelanguage.jar		securelanguage.jar
sort.js		sort.js
val.txt		val.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SecV: Secure Code Partitioning via Multi-Language Secure Values

Polytaint tool.

How to analyse programs with Polytaint.

Running the tool

MISC information

Possible bugs

About

Uh oh!

Releases

Packages

Languages

License

Yuhala/secv

Folders and files

Latest commit

History

Repository files navigation

SecV: Secure Code Partitioning via Multi-Language Secure Values

Polytaint tool.

How to analyse programs with Polytaint.

Running the tool

MISC information

Possible bugs

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages