Unsafe execution of untrusted code via `exec` by default

The library directly calls `exec()` directly on model-generated code. The warning notes this should be commented out by default, but it isn't.

https://github.com/openai/human-eval/blob/6d43fb980f9fee3c892a914eda09951f772ad10d/human_eval/execution.py#L40-L50

The best solution would be to provide at least one sandbox integration as a reasonable default. I'm open to contributing this.

@mpokrass 

	# WARNING
	# This program exists to execute untrusted model-generated code. Although
	# it is highly unlikely that model-generated code will do something overtly
	# malicious in response to this test suite, model-generated code may act
	# destructively due to a lack of model capability or alignment.
	# Users are strongly encouraged to sandbox this evaluation suite so that it
	# does not perform destructive actions on their host or network. For more
	# information on how OpenAI sandboxes its code, see the accompanying paper.
	# Once you have read this disclaimer and taken appropriate precautions,
	# uncomment the following line and proceed at your own risk:
	exec(check_program, exec_globals)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unsafe execution of untrusted code via `exec` by default #65

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unsafe execution of untrusted code via exec by default #65

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Unsafe execution of untrusted code via `exec` by default #65